Position-aware Attention and Supervised Data Improve Slot Filling
Y. Zhang, V. Zhong, D. Chen, G. Angeli, and C. Manning. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, page 35--45. Copenhagen, Denmark, Association for Computational Linguistics, (September 2017)
DOI: 10.18653/v1/D17-1004
Abstract
Organized relational knowledge in the form of ``knowledge graphs'' is important for many applications. However, the ability to populate knowledge bases with facts automatically extracted from documents has improved frustratingly slowly. This paper simultaneously addresses two issues that have held back prior work. We first propose an effective new model, which combines an LSTM sequence model with a form of entity position-aware attention that is better suited to relation extraction. Then we build TACRED, a large (119,474 examples) supervised relation extraction dataset obtained via crowdsourcing and targeted towards TAC KBP relations. The combination of better supervised data and a more appropriate high-capacity model enables much better relation extraction performance. When the model trained on this new dataset replaces the previous relation extraction component of the best TAC KBP 2015 slot filling system, its F1 score increases markedly from 22.2\% to 26.7\%.
%0 Conference Paper
%1 zhang-etal-2017-position
%A Zhang, Yuhao
%A Zhong, Victor
%A Chen, Danqi
%A Angeli, Gabor
%A Manning, Christopher D.
%B Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
%C Copenhagen, Denmark
%D 2017
%I Association for Computational Linguistics
%K final imported thema:kepler
%P 35--45
%R 10.18653/v1/D17-1004
%T Position-aware Attention and Supervised Data Improve Slot Filling
%U https://www.aclweb.org/anthology/D17-1004
%X Organized relational knowledge in the form of ``knowledge graphs'' is important for many applications. However, the ability to populate knowledge bases with facts automatically extracted from documents has improved frustratingly slowly. This paper simultaneously addresses two issues that have held back prior work. We first propose an effective new model, which combines an LSTM sequence model with a form of entity position-aware attention that is better suited to relation extraction. Then we build TACRED, a large (119,474 examples) supervised relation extraction dataset obtained via crowdsourcing and targeted towards TAC KBP relations. The combination of better supervised data and a more appropriate high-capacity model enables much better relation extraction performance. When the model trained on this new dataset replaces the previous relation extraction component of the best TAC KBP 2015 slot filling system, its F1 score increases markedly from 22.2\% to 26.7\%.
@inproceedings{zhang-etal-2017-position,
abstract = {Organized relational knowledge in the form of {``}knowledge graphs{''} is important for many applications. However, the ability to populate knowledge bases with facts automatically extracted from documents has improved frustratingly slowly. This paper simultaneously addresses two issues that have held back prior work. We first propose an effective new model, which combines an LSTM sequence model with a form of entity position-aware attention that is better suited to relation extraction. Then we build TACRED, a large (119,474 examples) supervised relation extraction dataset obtained via crowdsourcing and targeted towards TAC KBP relations. The combination of better supervised data and a more appropriate high-capacity model enables much better relation extraction performance. When the model trained on this new dataset replaces the previous relation extraction component of the best TAC KBP 2015 slot filling system, its F1 score increases markedly from 22.2{\%} to 26.7{\%}.},
added-at = {2021-06-25T06:48:05.000+0200},
address = {Copenhagen, Denmark},
author = {Zhang, Yuhao and Zhong, Victor and Chen, Danqi and Angeli, Gabor and Manning, Christopher D.},
biburl = {https://www.bibsonomy.org/bibtex/23b2ea10ee5902416d56d3b3de5c86671/michan},
booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
doi = {10.18653/v1/D17-1004},
interhash = {376f6627cb7e38c0cf21775f76addf24},
intrahash = {3b2ea10ee5902416d56d3b3de5c86671},
keywords = {final imported thema:kepler},
month = sep,
pages = {35--45},
publisher = {Association for Computational Linguistics},
timestamp = {2021-06-25T06:48:05.000+0200},
title = {Position-aware Attention and Supervised Data Improve Slot Filling},
url = {https://www.aclweb.org/anthology/D17-1004},
year = 2017
}