Web search engines have changed our lives - enabling instant access to information about subjects that are both deeply important to us, as well as passing whims. The search engines that provide answers to our search queries also log those queries, in order to improve their algorithms. Academic research on search queries has shown that they can provide valuable information on diverse topics including word and phrase similarity, topical seasonality and may even have potential for sociology, as well as providing a barometer of the popularity of many subjects. At the same time, individuals are rightly concerned about what the consequences of accidental leaking or deliberate sharing of this information may mean for their privacy. In this talk I will cover the applications which have benefited from mining query logs, the risks that privacy can be breached by sharing query logs, and current algorithms for mining logs in a way to prevent privacy breaches.
R. Jäschke, and S. Rudolph. Contributions to the 11th International Conference on Formal Concept Analysis, page 19--34. Technische Universität Dresden, (May 2013)
T. Joachims. Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, page 133--142. New York, NY, USA, ACM, (2002)
T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, page 154--161. New York, NY, USA, ACM, (2005)