Use of a genetic algorithm in brill's
transformation-based part-of-speech tagger
G. Wilson, and M. Heywood. GECCO 2005: Proceedings of the 2005 conference on
Genetic and evolutionary computation, 2, page 2067--2073. Washington DC, USA, ACM Press, (25-29 June 2005)
Abstract
The tagging problem in natural language processing is
to find a way to label every word in a text as a
particular part of speech, e.g., proper noun. An
effective way of solving this problem with high
accuracy is the transformation-based or "Brill"
tagger. In Brill's system, a number of transformation
templates are specified a priori that are instantiated
and ranked during a greedy search-based algorithm. This
paper describes a variant of Brill's implementation
that instead uses a genetic algorithm to generate the
instantiated rules and provide an adaptive ranking.
Based on tagging accuracy, the new system provides a
better hybrid evolutionary computation solution to the
part-of-speech (POS) problem than the previous attempt.
Although not able to make up for the use of a priori
knowledge used by Brill, the method appears to point
the way for an improved solution to the tagging
problem.
GECCO 2005: Proceedings of the 2005 conference on
Genetic and evolutionary computation
year
2005
month
25-29 June
pages
2067--2073
publisher
ACM Press
volume
2
organisation
ACM SIGEVO (formerly ISGEC)
publisher_address
New York, NY, 10286-1405, USA
isbn
1-59593-010-8
notes
GECCO-2005 A joint meeting of the fourteenth
international conference on genetic algorithms
(ICGA-2005) and the tenth annual genetic programming
conference (GP-2005).
ACM Order Number 910052
%0 Conference Paper
%1 1068352
%A Wilson, Garnett
%A Heywood, Malcolm
%B GECCO 2005: Proceedings of the 2005 conference on
Genetic and evolutionary computation
%C Washington DC, USA
%D 2005
%E Beyer, Hans-Georg
%E O'Reilly, Una-May
%E Arnold, Dirk V.
%E Banzhaf, Wolfgang
%E Blum, Christian
%E Bonabeau, Eric W.
%E Cantu-Paz, Erick
%E Dasgupta, Dipankar
%E Deb, Kalyanmoy
%E Foster, James A.
%E de
Jong, Edwin D.
%E Lipson, Hod
%E Llora, Xavier
%E Mancoridis, Spiros
%E Pelikan, Martin
%E Raidl, Guenther R.
%E Soule, Terence
%E Tyrrell, Andy M.
%E Watson, Jean-Paul
%E Zitzler, Eckart
%I ACM Press
%K Applications, Brill Real World algorithms, experimentation, genetic language languages natural processing, programming, tagger,
%P 2067--2073
%T Use of a genetic algorithm in brill's
transformation-based part-of-speech tagger
%U http://doi.acm.org/10.1145/1068009.1068352
%V 2
%X The tagging problem in natural language processing is
to find a way to label every word in a text as a
particular part of speech, e.g., proper noun. An
effective way of solving this problem with high
accuracy is the transformation-based or "Brill"
tagger. In Brill's system, a number of transformation
templates are specified a priori that are instantiated
and ranked during a greedy search-based algorithm. This
paper describes a variant of Brill's implementation
that instead uses a genetic algorithm to generate the
instantiated rules and provide an adaptive ranking.
Based on tagging accuracy, the new system provides a
better hybrid evolutionary computation solution to the
part-of-speech (POS) problem than the previous attempt.
Although not able to make up for the use of a priori
knowledge used by Brill, the method appears to point
the way for an improved solution to the tagging
problem.
%@ 1-59593-010-8
@inproceedings{1068352,
abstract = {The tagging problem in natural language processing is
to find a way to label every word in a text as a
particular part of speech, e.g., proper noun. An
effective way of solving this problem with high
accuracy is the transformation-based or {"}Brill{"}
tagger. In Brill's system, a number of transformation
templates are specified a priori that are instantiated
and ranked during a greedy search-based algorithm. This
paper describes a variant of Brill's implementation
that instead uses a genetic algorithm to generate the
instantiated rules and provide an adaptive ranking.
Based on tagging accuracy, the new system provides a
better hybrid evolutionary computation solution to the
part-of-speech (POS) problem than the previous attempt.
Although not able to make up for the use of a priori
knowledge used by Brill, the method appears to point
the way for an improved solution to the tagging
problem.},
added-at = {2008-06-19T17:35:00.000+0200},
address = {Washington DC, USA},
author = {Wilson, Garnett and Heywood, Malcolm},
biburl = {https://www.bibsonomy.org/bibtex/28fd2759fe79f9a31dcedabed101f083b/brazovayeye},
booktitle = {{GECCO 2005}: Proceedings of the 2005 conference on
Genetic and evolutionary computation},
editor = {Beyer, Hans-Georg and O'Reilly, Una-May and Arnold, Dirk V. and Banzhaf, Wolfgang and Blum, Christian and Bonabeau, Eric W. and Cantu-Paz, Erick and Dasgupta, Dipankar and Deb, Kalyanmoy and Foster, James A. and {de
Jong}, Edwin D. and Lipson, Hod and Llora, Xavier and Mancoridis, Spiros and Pelikan, Martin and Raidl, Guenther R. and Soule, Terence and Tyrrell, Andy M. and Watson, Jean-Paul and Zitzler, Eckart},
interhash = {683ea73863a23b5d7686c518443e45fc},
intrahash = {8fd2759fe79f9a31dcedabed101f083b},
isbn = {1-59593-010-8},
keywords = {Applications, Brill Real World algorithms, experimentation, genetic language languages natural processing, programming, tagger,},
month = {25-29 June},
notes = {GECCO-2005 A joint meeting of the fourteenth
international conference on genetic algorithms
(ICGA-2005) and the tenth annual genetic programming
conference (GP-2005).
ACM Order Number 910052},
organisation = {ACM SIGEVO (formerly ISGEC)},
pages = {2067--2073},
publisher = {ACM Press},
publisher_address = {New York, NY, 10286-1405, USA},
timestamp = {2008-06-19T17:54:14.000+0200},
title = {Use of a genetic algorithm in brill's
transformation-based part-of-speech tagger},
url = {http://doi.acm.org/10.1145/1068009.1068352},
volume = 2,
year = 2005
}