Twitter as a Corpus for Sentiment Analysis and Opinion Mining.
A. Pak, and P. Paroubek. LREC, European Language Resources Association, (2010)
Abstract
Microblogging today has become a very popular communication tool among Internet users. Millions of users share opinions on different
aspects of life everyday. Therefore microblogging web-sites are rich sources of data for opinion mining and sentiment analysis. Because
microblogging has appeared relatively recently, there are a few research works that were devoted to this topic. In our paper, we focus
on using Twitter, the most popular microblogging platform, for the task of sentiment analysis. We show how to automatically collect a
corpus for sentiment analysis and opinion mining purposes. We perform linguistic analysis of the collected corpus and explain discovered
phenomena. Using the corpus, we build a sentiment classifier, that is able to determine positive, negative and neutral sentiments for a
document. Experimental evaluations show that our proposed techniques are efficient and performs better than previously proposed
methods. In our research, we worked with English, however, the proposed technique can be used with any other language.
%0 Conference Paper
%1 pak2010twitter
%A Pak, Alexander
%A Paroubek, Patrick
%B LREC
%D 2010
%E Calzolari, Nicoletta
%E Choukri, Khalid
%E Maegaard, Bente
%E Mariani, Joseph
%E Odijk, Jan
%E Piperidis, Stelios
%E Rosner, Mike
%E Tapias, Daniel
%I European Language Resources Association
%K corpus k3 opinion sentiment twitter
%T Twitter as a Corpus for Sentiment Analysis and Opinion Mining.
%U http://dblp.uni-trier.de/db/conf/lrec/lrec2010.html#PakP10
%X Microblogging today has become a very popular communication tool among Internet users. Millions of users share opinions on different
aspects of life everyday. Therefore microblogging web-sites are rich sources of data for opinion mining and sentiment analysis. Because
microblogging has appeared relatively recently, there are a few research works that were devoted to this topic. In our paper, we focus
on using Twitter, the most popular microblogging platform, for the task of sentiment analysis. We show how to automatically collect a
corpus for sentiment analysis and opinion mining purposes. We perform linguistic analysis of the collected corpus and explain discovered
phenomena. Using the corpus, we build a sentiment classifier, that is able to determine positive, negative and neutral sentiments for a
document. Experimental evaluations show that our proposed techniques are efficient and performs better than previously proposed
methods. In our research, we worked with English, however, the proposed technique can be used with any other language.
%@ 2-9517408-6-7
@inproceedings{pak2010twitter,
abstract = {Microblogging today has become a very popular communication tool among Internet users. Millions of users share opinions on different
aspects of life everyday. Therefore microblogging web-sites are rich sources of data for opinion mining and sentiment analysis. Because
microblogging has appeared relatively recently, there are a few research works that were devoted to this topic. In our paper, we focus
on using Twitter, the most popular microblogging platform, for the task of sentiment analysis. We show how to automatically collect a
corpus for sentiment analysis and opinion mining purposes. We perform linguistic analysis of the collected corpus and explain discovered
phenomena. Using the corpus, we build a sentiment classifier, that is able to determine positive, negative and neutral sentiments for a
document. Experimental evaluations show that our proposed techniques are efficient and performs better than previously proposed
methods. In our research, we worked with English, however, the proposed technique can be used with any other language.},
added-at = {2016-02-10T14:12:06.000+0100},
author = {Pak, Alexander and Paroubek, Patrick},
biburl = {https://www.bibsonomy.org/bibtex/2219d528a505a821f460864441c499147/asmelash},
booktitle = {LREC},
editor = {Calzolari, Nicoletta and Choukri, Khalid and Maegaard, Bente and Mariani, Joseph and Odijk, Jan and Piperidis, Stelios and Rosner, Mike and Tapias, Daniel},
ee = {http://www.lrec-conf.org/proceedings/lrec2010/summaries/385.html},
interhash = {ac930b0459a3c8a2fc2d74c52a475026},
intrahash = {219d528a505a821f460864441c499147},
isbn = {2-9517408-6-7},
keywords = {corpus k3 opinion sentiment twitter},
publisher = {European Language Resources Association},
timestamp = {2016-02-10T14:37:45.000+0100},
title = {Twitter as a Corpus for Sentiment Analysis and Opinion Mining.},
url = {http://dblp.uni-trier.de/db/conf/lrec/lrec2010.html#PakP10},
year = 2010
}