«Traditionally, unification grammars are hand-coded. This is extremely time consuming, expensive and very difficult to scale. [...] we have developed a new method for automatically extracting wide-coverage probabilistic unification (LFG) grammars from treebank resources. To achieve this, we first automatically annotate the treebank (such as Penn-II) with feature-structure information (LFG f-structures, approximating to basic predicate-argument structure). From the f-structure annotated treebank, we then automatically extract wide-coverage, probabilistic LFG approximations to parse new text»
M. Volk, T. Marek, and Y. Samuelsson. Proceedings of the Workshop on Human Judgements in Computational Linguistics, page 51--57. Manchester, Association for Computational Linguistics, Association for Computational Linguistics, (2008)
H. Dyvik, P. Meurer, V. Rosén, and K. De Smedt. Proceedings of the Eighth International Workshop on Treebanks and Linguistic Theories, page 71--82. Milano, EDUCatt, (2009)
J. Tiedemann, and G. Kotzé. Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning, page 33--39. Borovets, Bulgaria, Association for Computational Linguistics, Association for Computational Linguistics, (2009)
V. Zhechev, and A. Way. Proceedings of the 22nd International Conference on Computational Linguistics, 1, page 1105--1112. Manchester, Association for Computational Linguistics, Association for Computational Linguistics, (2008)
V. Rosén, and K. De Smedt. Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007, page 152--159. Tartu, University of Tartu, (2007)
M. Hearne, S. Ozdowska, and J. Tinsley. Actes de la 15e Conférence Annuelle sur le Traitement Automatique des Langues Naturelles (TALN '08), Avignon, France, ATALA, (2008)
Y. Samuelsson, and M. Volk. Treebanking for Discourse and Speech. Proceedings of the NODALIDA 2005 Special Session on Treebanks for Spoken Language and Discourse, volume 32 of Copenhagen Studies in Language, page 147. Forlaget Samfundslitteratur, København, (2005)
V. Rosén, P. Meurer, and K. de Smedt. Proceedings of the 7th International Workshop on Treebanks and Linguistic Theories (TLT7), page 127--133. Utrecht, LOT, (2009)