The Query Representation and Understanding (QRU) data set contains a set of similar queries that can be used in web research such as query transformation and relevance ranking. QRU contains similar queries that are related to existing benchmark data sets, such as TREC query sets. The QRU data set was created by extracting 100 TREC queries, training a query-generation model and a commercial search engine, generating similar queries from TREC queries with the model, and removal of mistakenly generated queries.
Query log data for ad targeting
A WWW2006 paper out of Microsoft Research, "Finding Advertising Keywords on Web Pages" (PDF), claims that query log data is particularly useful for ad targeting.
Specifically, the researchers extracted from MSN query logs the keywords some people used to find a given page. They tested using that as one of many features for ad targeting. In their results, it was one of the most effective features.
Very interesting. It has always been harder to target ads to content than to search results because intent is much less clear.
By using the query log data in this way, the researchers were effectively using the intent of the searchers that arrived at the page as a proxy for the intent of everyone who arrived at the page.
Query log data for ad targeting
A WWW2006 paper out of Microsoft Research, "Finding Advertising Keywords on Web Pages" (PDF), claims that query log data is particularly useful for ad targeting.
Specifically, the researchers extracted from MSN query logs the keywords some people used to find a given page. They tested using that as one of many features for ad targeting. In their results, it was one of the most effective features.
Very interesting. It has always been harder to target ads to content than to search results because intent is much less clear.
By using the query log data in this way, the researchers were effectively using the intent of the searchers that arrived at the page as a proxy for the intent of everyone who arrived at the page.
With this Web page, we are opening some aspects of hakia R&D to the view of our users. We undertook highly specific research tasks solely dedicated to the advancement of the core-competency in Web search. The main challenge is to make science work in a co
M. Thein, and M. Thwin. International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), volume 2 of IFIP Advances in Information and Communication Technology, page 13-32. Springer, (December 2012)
J. Singh, W. Nejdl, and A. Anand. Proceedings of the 2016 ACM on Conference on Human Information Interaction and Retrieval, page 183--192. New York, NY, USA, ACM, (2016)
R. Jäschke, B. Krause, A. Hotho, and G. Stumme. Proceedings of the Second International Conference on Weblogs and Social Media(ICWSM 2008), AAAI Press, (2008)
B. Krause, A. Hotho, and G. Stumme. Advances in Information Retrieval, 30th European Conference on IR Research, ECIR 2008, 4956, page 101-113. Springer, (2008)
B. Krause, A. Hotho, and G. Stumme. Advances in Information Retrieval, 30th European Conference on IR Research, ECIR 2008, 4956, page 101-113. Springer, (2008)
R. Jäschke, B. Krause, A. Hotho, and G. Stumme. Proceedings of the Second International Conference on Weblogs and Social Media(ICWSM 2008), AAAI Press, (2008)
R. Baeza-Yates, C. Hurtado, and M. Mendoza. Current Trends in Database Technology - EDBT 2004 Workshops, volume 3268 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2005)
Z. Al Bawab, G. Mills, and J. Crespo. Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, page 397--405. New York, NY, USA, ACM, (2012)