20 Newsgroups
Abstract
This data set consists of 20000 messages taken from 20 Usenet newsgroups.
Information files:
description of the data
Data files:
20_newsgroups.tar.gz (17.3M; 61.6M uncompressed)
mini_newsgroups.tar.gz A subset composed of 100 articles from each newsgroup. (1.9M; 6.2M uncompressed)
The journal Semantic Web – Interoperability, Usability, Applicability brings together researchers from various fields which share the vision and need for more effective and meaningful ways to share information across agents and services on the future internet and elsewhere. As such, Semantic Web technologies shall support the seamless integration of data, on-the-fly composition and interoperation of Web services, as well as more intuitive search engines. The semantics – or meaning – of information, however, cannot be defined without a context, which makes personalization, trust, and provenance core topics for Semantic Web research. New retrieval paradigms, user interfaces, and visualization techniques have to unleash the power of the Semantic Web and at the same time hide its complexity from the user. Based on this vision, the journal welcome contributions ranging from theoretical and foundational research over methods and tools to descriptions of concrete ontologies and applications in all areas. We especially welcome papers which add a social, spatial, and temporal dimension to Semantic Web research, as well as application-oriented papers making use of formal semantics.
The journal is peer-reviewed and will be published quarterly.
J. Freyne, S. Anand, I. Guy, and A. Hotho. Proceedings of the fifth ACM conference on Recommender systems, page 383--384. New York, NY, USA, ACM, (2011)
A. Sonnenbichler. (2010)cite arxiv:1006.4271
Comment: Presented at the International Network For Social Network Analysis
(INSNA): Sunbelt Conference 2009, San Diego, California, USA. 9 pages, 6
figures.
J. Illig, A. Hotho, R. Jäschke, and G. Stumme. Knowledge Processing and Data Analysis, volume 6581 of Lecture Notes in Computer Science, page 136--149. Berlin/Heidelberg, Springer, (2011)
B. Krause, A. Hotho, and G. Stumme. Advances in Information Retrieval, 30th European Conference on IR Research, ECIR 2008, 4956, page 101-113. Springer, (2008)