the data here is useful for testing classification / clustering, and the accuracy of indexing techniques. However the datasets are too small to make claims about the efficiency of indexing.
My diploma thesis about a system to automatically build a multilingual thesaurus from wikipedia, "WikiWord", is finally done. I handed it in yesterday. My research will hopefully help to make Wikipedia more accessible for automatic processing
English translation of selected chapters of the WikiWord thesis "Automatischer Aufbau eines multilingualen Thesaurus durch Extraktion semantischer und lexikalischer Relationen aus der Wikipedia" by Daniel Kinzler. Translation by the author.
R. Neßelrath, и J. Alexandersson. Proceedings of the 6th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems. Twenty-First International Joint Conference On Artificial Intelligence (IJCAI -09), in Conjunction with 6th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems (KRPD-09), July 12, Pasadena, California, United States, стр. 46-51. IJCAI 2009, (июля 2009)
M. Sahami, S. Dumais, D. Heckerman, и E. Horvitz. Learning for Text Categorization: Papers from the 1998 Workshop, Madison, Wisconsin, AAAI Technical Report WS-98-05, (1998)
D. Willems, и L. Vuurpijl. Proceedings of the Ninth international conference on document analysis and recognition, стр. 869-873. Curitiba, Brazil, (2007)
J. Esparza, и F. Reiter. 31st International Conference on Concurrency Theory (CONCUR 2020), том 171 из Leibniz International Proceedings in Informatics (LIPIcs), стр. 10:1--10:16. Dagstuhl, Germany, Schloss Dagstuhl--Leibniz-Zentrum für Informatik, (2020)Preprint: <a href="https://arxiv.org/abs/2007.03291">Link</a><br>#conference.
Y. Yang, и J. Pedersen. Proceedings of ICML-97, 14th International Conference on Machine Learning, стр. 412--420. Nashville, US, Morgan Kaufmann Publishers, San Francisco, US, (1997)
Y. Yang, и J. Pedersen. Proceedings of ICML-97, 14th International Conference on Machine Learning, стр. 412--420. Nashville, US, Morgan Kaufmann Publishers, San Francisco, US, (1997)