The performance of web search engines may often deteriorate due to the diversity and noisy information contained within web pages. User click-through data can be used to introduce more accurate description (metadata) for web pages, and to improve the search performance. However, noise and incompleteness, sparseness, and the volatility of web pages and queries are three major challenges for research work on user click-through log mining. In this paper, we propose a novel iterative reinforced algorithm to utilize the user click-through data to improve search performance. The algorithm fully explores the interrelations between queries and web pages, and effectively finds "virtual queries" for web pages and overcomes the challenges discussed above. Experiment results on a large set of MSN click-through log data show a significant improvement on search performance over the naive query log mining algorithm as well as the baseline search engine.
Description
Optimizing web search using web click-through data
%0 Conference Paper
%1 1031192
%A Xue, Gui-Rong
%A Zeng, Hua-Jun
%A Chen, Zheng
%A Yu, Yong
%A Ma, Wei-Ying
%A Xi, WenSi
%A Fan, WeiGuo
%B CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge management
%C New York, NY, USA
%D 2004
%I ACM
%K click learning-to-rank ranking search web
%P 118--126
%R 10.1145/1031171.1031192
%T Optimizing web search using web click-through data
%U http://portal.acm.org/citation.cfm?id=1031171.1031192
%X The performance of web search engines may often deteriorate due to the diversity and noisy information contained within web pages. User click-through data can be used to introduce more accurate description (metadata) for web pages, and to improve the search performance. However, noise and incompleteness, sparseness, and the volatility of web pages and queries are three major challenges for research work on user click-through log mining. In this paper, we propose a novel iterative reinforced algorithm to utilize the user click-through data to improve search performance. The algorithm fully explores the interrelations between queries and web pages, and effectively finds "virtual queries" for web pages and overcomes the challenges discussed above. Experiment results on a large set of MSN click-through log data show a significant improvement on search performance over the naive query log mining algorithm as well as the baseline search engine.
%@ 1-58113-874-1
@inproceedings{1031192,
abstract = {The performance of web search engines may often deteriorate due to the diversity and noisy information contained within web pages. User click-through data can be used to introduce more accurate description (metadata) for web pages, and to improve the search performance. However, noise and incompleteness, sparseness, and the volatility of web pages and queries are three major challenges for research work on user click-through log mining. In this paper, we propose a novel iterative reinforced algorithm to utilize the user click-through data to improve search performance. The algorithm fully explores the interrelations between queries and web pages, and effectively finds "virtual queries" for web pages and overcomes the challenges discussed above. Experiment results on a large set of MSN click-through log data show a significant improvement on search performance over the naive query log mining algorithm as well as the baseline search engine.},
added-at = {2010-08-05T09:17:33.000+0200},
address = {New York, NY, USA},
author = {Xue, Gui-Rong and Zeng, Hua-Jun and Chen, Zheng and Yu, Yong and Ma, Wei-Ying and Xi, WenSi and Fan, WeiGuo},
biburl = {https://www.bibsonomy.org/bibtex/22c9841b484ade7e9a8c9220662190c16/jaeschke},
booktitle = {CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge management},
description = {Optimizing web search using web click-through data},
doi = {10.1145/1031171.1031192},
interhash = {31bb65c1b57888b0529c1a11e981bbe8},
intrahash = {2c9841b484ade7e9a8c9220662190c16},
isbn = {1-58113-874-1},
keywords = {click learning-to-rank ranking search web},
location = {Washington, D.C., USA},
pages = {118--126},
publisher = {ACM},
timestamp = {2014-07-28T15:57:31.000+0200},
title = {Optimizing web search using web click-through data},
url = {http://portal.acm.org/citation.cfm?id=1031171.1031192},
year = 2004
}