P. Chirita, W. Nejdl, R. Paiu, and C. Kohlschütter. SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, page 178--185. New York, NY, USA, ACM Press, (2005)
DOI: 10.1145/1076034.1076067
Abstract
The Open Directory Project is clearly one of the largest collaborative efforts to manually annotate web pages. This effort involves over 65,000 editors and resulted in metadata specifying topic and importance for more than 4 million web pages. Still, given that this number is just about 0.05 percent of the Web pages indexed by Google, is this effort enough to make a difference? In this paper we discuss how these metadata can be exploited to achieve high quality personalized web search. First, we address this by introducing an additional criterion for web page ranking, namely the distance between a user profile defined using ODP topics and the sets of ODP topics covered by each URL returned in regular web search. We empirically show that this enhancement yields better results than current web search using Google. Then, in the second part of the paper, we investigate the boundaries of biasing PageRank on subtopics of the ODP in order to automatically extend these metadata to the whole web.
%0 Conference Paper
%1 chirita-p-2005-178
%A Chirita, Paul Alexandru
%A Nejdl, Wolfgang
%A Paiu, Raluca
%A Kohlschütter, Christian
%B SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
%C New York, NY, USA
%D 2005
%I ACM Press
%K user-profile en www-search ontology adaptive search
%P 178--185
%R 10.1145/1076034.1076067
%T Using ODP metadata to personalize search
%U http://dx.doi.org/10.1145/1076034.1076067
%X The Open Directory Project is clearly one of the largest collaborative efforts to manually annotate web pages. This effort involves over 65,000 editors and resulted in metadata specifying topic and importance for more than 4 million web pages. Still, given that this number is just about 0.05 percent of the Web pages indexed by Google, is this effort enough to make a difference? In this paper we discuss how these metadata can be exploited to achieve high quality personalized web search. First, we address this by introducing an additional criterion for web page ranking, namely the distance between a user profile defined using ODP topics and the sets of ODP topics covered by each URL returned in regular web search. We empirically show that this enhancement yields better results than current web search using Google. Then, in the second part of the paper, we investigate the boundaries of biasing PageRank on subtopics of the ODP in order to automatically extend these metadata to the whole web.
%@ 1595930345
@inproceedings{chirita-p-2005-178,
abstract = {The Open Directory Project is clearly one of the largest collaborative efforts to manually annotate web pages. This effort involves over 65,000 editors and resulted in metadata specifying topic and importance for more than 4 million web pages. Still, given that this number is just about 0.05 percent of the Web pages indexed by Google, is this effort enough to make a difference? In this paper we discuss how these metadata can be exploited to achieve high quality personalized web search. First, we address this by introducing an additional criterion for web page ranking, namely the distance between a user profile defined using ODP topics and the sets of ODP topics covered by each URL returned in regular web search. We empirically show that this enhancement yields better results than current web search using Google. Then, in the second part of the paper, we investigate the boundaries of biasing PageRank on subtopics of the ODP in order to automatically extend these metadata to the whole web.},
added-at = {2012-04-24T14:42:28.000+0200},
address = {New York, NY, USA},
at = {2006-06-06 01:06:51},
author = {Chirita, Paul Alexandru and Nejdl, Wolfgang and Paiu, Raluca and Kohlsch\"utter, Christian},
biburl = {https://www.bibsonomy.org/bibtex/2c51767d6e88dd4d9845c10b75035f817/mgns},
booktitle = {SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval},
doi = {10.1145/1076034.1076067},
id = {685449},
interhash = {4b00e4644e534ded488197bff82116ca},
intrahash = {c51767d6e88dd4d9845c10b75035f817},
isbn = {1595930345},
keywords = {user-profile en www-search ontology adaptive search},
pages = {178--185},
priority = {2},
publisher = {ACM Press},
timestamp = {2012-04-24T14:56:45.000+0200},
title = {Using {ODP} metadata to personalize search},
url = {http://dx.doi.org/10.1145/1076034.1076067},
year = 2005
}