Improving Word Sense Disambiguation Using Topic Features
J. Cai, W. Lee, and Y. Teh. Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), page 1015--1023. (2007)
Abstract
This paper presents a novel approach for exploiting the global context for the task of word sense disambiguation (WSD). This is done by using topic features constructed using the latent Dirichlet allocation (LDA) algorithm on unlabeled data. The features are incorporated into a modified naive Bayes network alongside other features such as part-of-speech of neighboring words, single words in the surrounding context, local collocations, and syntactic patterns. In both the English all-words task and the English lexical sample task, the method achieved significant improvement over the simple naive Bayes classifier and higher accuracy than the best official scores on Senseval-3 for both task.
%0 Conference Paper
%1 Cai:EtAl:07
%A Cai, Junfu
%A Lee, Wee Sun
%A Teh, Yee Whye
%B Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)
%D 2007
%K 2007 bayesian conll emnlp topic wsd
%P 1015--1023
%T Improving Word Sense Disambiguation Using Topic Features
%U http://acl.ldc.upenn.edu/D/D07/D07-1108.pdf
%X This paper presents a novel approach for exploiting the global context for the task of word sense disambiguation (WSD). This is done by using topic features constructed using the latent Dirichlet allocation (LDA) algorithm on unlabeled data. The features are incorporated into a modified naive Bayes network alongside other features such as part-of-speech of neighboring words, single words in the surrounding context, local collocations, and syntactic patterns. In both the English all-words task and the English lexical sample task, the method achieved significant improvement over the simple naive Bayes classifier and higher accuracy than the best official scores on Senseval-3 for both task.
@inproceedings{Cai:EtAl:07,
abstract = {This paper presents a novel approach for exploiting the global context for the task of word sense disambiguation (WSD). This is done by using topic features constructed using the latent Dirichlet allocation (LDA) algorithm on unlabeled data. The features are incorporated into a modified naive Bayes network alongside other features such as part-of-speech of neighboring words, single words in the surrounding context, local collocations, and syntactic patterns. In both the English all-words task and the English lexical sample task, the method achieved significant improvement over the simple naive Bayes classifier and higher accuracy than the best official scores on Senseval-3 for both task.},
added-at = {2007-07-09T17:31:48.000+0200},
author = {Cai, Junfu and Lee, Wee Sun and Teh, Yee Whye},
biburl = {https://www.bibsonomy.org/bibtex/288143110902fef92d5b620bd1488c15c/seandalai},
booktitle = {Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)},
interhash = {498b813746fa68bf563535314c0180f4},
intrahash = {88143110902fef92d5b620bd1488c15c},
keywords = {2007 bayesian conll emnlp topic wsd},
pages = {1015--1023},
timestamp = {2007-07-09T17:31:48.000+0200},
title = {Improving Word Sense Disambiguation Using Topic Features},
url = {http://acl.ldc.upenn.edu/D/D07/D07-1108.pdf},
year = 2007
}