Categorizing gigabytes: experiments on the RCV1 Corpus
D. Tikk, Z. Bánsághi, and G. Biró. Proc. of the 6th Int. Symp. of Hungarian Researchers on Computational Intelligence (HUCI 2005), page 267--276. Budapest, Hungary, (November 2005)
Abstract
This paper presents categorization results performed by means of HITEC
categorizer tool on the new benchmark document collection of text cat-
egorization, the Reuters Corpus Volume 1 (RCV1). RCV1 is an archive
of over 800,000 manually categorized newswire stories made available by
Reuters in 2000 for research purposes. This collection was released to
take place of the Reuters-21578 collection that has been used widespread
in the text retrieval community. This paper intend to add some interesting
result to the characterization of RCV1 and HITEC categorizer.
%0 Conference Paper
%1 rcv1Cf
%A Tikk, Domonkos
%A Bánsághi, Zoltán
%A Biró, György
%B Proc. of the 6th Int. Symp. of Hungarian Researchers on Computational Intelligence (HUCI 2005)
%C Budapest, Hungary
%D 2005
%K cf corpus description rcv1 reuters section
%P 267--276
%T Categorizing gigabytes: experiments on the RCV1 Corpus
%X This paper presents categorization results performed by means of HITEC
categorizer tool on the new benchmark document collection of text cat-
egorization, the Reuters Corpus Volume 1 (RCV1). RCV1 is an archive
of over 800,000 manually categorized newswire stories made available by
Reuters in 2000 for research purposes. This collection was released to
take place of the Reuters-21578 collection that has been used widespread
in the text retrieval community. This paper intend to add some interesting
result to the characterization of RCV1 and HITEC categorizer.
%@ 963 7154 43 4
@inproceedings{rcv1Cf,
abstract = {This paper presents categorization results performed by means of HITEC
categorizer tool on the new benchmark document collection of text cat-
egorization, the Reuters Corpus Volume 1 (RCV1). RCV1 is an archive
of over 800,000 manually categorized newswire stories made available by
Reuters in 2000 for research purposes. This collection was released to
take place of the Reuters-21578 collection that has been used widespread
in the text retrieval community. This paper intend to add some interesting
result to the characterization of RCV1 and HITEC categorizer.},
added-at = {2012-11-20T04:53:46.000+0100},
address = {Budapest, Hungary},
author = {Tikk, Domonkos and B{\'a}ns{\'a}ghi, Zolt{\'a}n and Bir{\'o}, Gy{\"o}rgy},
biburl = {https://www.bibsonomy.org/bibtex/242e60081109a08080e04bfe105b603e7/jil},
booktitle = {Proc. of the 6th Int. Symp. of Hungarian Researchers on Computational Intelligence (HUCI 2005)},
description = {Domonkos Tikk — Research | Homepage},
interhash = {f6baa736642c86e00e01f43026bed138},
intrahash = {42e60081109a08080e04bfe105b603e7},
isbn = {963 7154 43 4},
keywords = {cf corpus description rcv1 reuters section},
month = {November 18--19},
pages = {267--276},
timestamp = {2013-11-23T20:11:51.000+0100},
title = {Categorizing gigabytes: experiments on the RCV1 Corpus},
year = 2005
}