@lepsky

Keyword extraction from a single document using word co-occurrence statistical information

, und . International Journal on Artificial Intelligence Tools, (2004)

Zusammenfassung

This paper explains a keyword extraction algorithm based solely on a single document. First, frequent terms are extracted. Co-occurrences of a term and frequent terms are counted. If a term appears frequently with a particular subset of terms, the term is likely to have important meaning. The degree of bias of the cooccurrence distribution is measured by the \# -measure. We show that our keyword extraction performs well without the need for a corpus. In this paper, a term is defined as a word or a word sequence. We do not intend to limit the meaning in a terminological sense. A word sequence is written as a phrase

Links und Ressourcen

Tags