copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Identifying Document Topics Using the Wikipedia Category Network

P. Schonhofen. Web Intelligence, 2006. WI 2006. IEEE/WIC/ACM International Conference on, page 456--462. Washington, DC, USA, IEEE, (December 2006)
DOI: 10.1109/WI.2006.92

Abstract

In the size and coverage of Wikipedia, a freely available online encyclopedia has reached the point where it can be utilized similar to an ontology or taxonomy to identify the topics discussed in a document. In this paper we show that even a simple algorithm that exploits only the titles and categories of Wikipedia articles can characterize documents by Wikipedia categories surprisingly well. We test the reliability of our method by predicting categories of Wikipedia articles themselves based on their bodies, and by performing classification and clustering on 20 newsgroups and RCV1, representing documents by their Wikipedia categories instead of their texts

Description

CiteULike: Identifying Document Topics Using the Wikipedia Category Network

Links and resources

BibTeX key: citeulike:1839949
entry type: inproceedings
address: Washington, DC, USA
booktitle: Web Intelligence, 2006. WI 2006. IEEE/WIC/ACM International Conference on
year: 2006
month: dec
institution: Comput. & Autom. Res. Inst., Hungarian Acad. of Sci., Budapest
pages: 456--462
publisher: IEEE
series: WI '06
posted-at: 2012-04-26 06:40:15
priority: 4
isbn: 0-7695-2747-7
citeulike-article-id: 1839949
citeulike-linkout-1: http://dx.doi.org/10.1109/WI.2006.92
citeulike-linkout-2: http://ieeexplore.ieee.org/xpls/abs\_all.jsp?arnumber=4061411
citeulike-linkout-0: http://portal.acm.org/citation.cfm?id=1249180
DOI: 10.1109/WI.2006.92
url: http://dx.doi.org/10.1109/WI.2006.92

@peterr's tags highlighted

Cite this publication

search on

Meta data

Last update 12 years ago
Created 12 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Identifying Document Topics Using the Wikipedia Category Network

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Identifying Document Topics Using the Wikipedia Category Network

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Identifying Document Topics Using the Wikipedia Category Network

Comments and Reviews
(0)