A maximum entropy approach to information extraction from semi-structured and free text
H. Chieu, and H. Ng. Eighteenth national conference on Artificial intelligence, page 786--791. Menlo Park, CA, USA, American Association for Artificial Intelligence, (2002)
Abstract
In this paper, we present a classification-based approach towards single-slot as well as multi-slot information extraction (IE). For single-slot IE, we worked on the domain of Seminar Announcements, where each document contains information on only one seminar. For multi-slot IE, we worked on the domain of Management Succession. For this domain, we restrict ourselves to extracting information sentence by sentence, in the same way as (Soderland 1999). Each sentence can contain information on several management succession events. By using a classification approach based on a maximum entropy framework, our system achieves higher accuracy than the best previously published results in both domains.
%0 Conference Paper
%1 Chieu2002
%A Chieu, Hai L.
%A Ng, Hwee T.
%B Eighteenth national conference on Artificial intelligence
%C Menlo Park, CA, USA
%D 2002
%I American Association for Artificial Intelligence
%K information_extraction probabilistic uncertainty
%P 786--791
%T A maximum entropy approach to information extraction from semi-structured and free text
%U http://portal.acm.org/citation.cfm?id=777092.777213
%X In this paper, we present a classification-based approach towards single-slot as well as multi-slot information extraction (IE). For single-slot IE, we worked on the domain of Seminar Announcements, where each document contains information on only one seminar. For multi-slot IE, we worked on the domain of Management Succession. For this domain, we restrict ourselves to extracting information sentence by sentence, in the same way as (Soderland 1999). Each sentence can contain information on several management succession events. By using a classification approach based on a maximum entropy framework, our system achieves higher accuracy than the best previously published results in both domains.
%@ 0262511290
@inproceedings{Chieu2002,
abstract = {In this paper, we present a classification-based approach towards single-slot as well as multi-slot information extraction (IE). For single-slot IE, we worked on the domain of Seminar Announcements, where each document contains information on only one seminar. For multi-slot IE, we worked on the domain of Management Succession. For this domain, we restrict ourselves to extracting information sentence by sentence, in the same way as (Soderland 1999). Each sentence can contain information on several management succession events. By using a classification approach based on a maximum entropy framework, our system achieves higher accuracy than the best previously published results in both domains.},
added-at = {2009-03-12T15:42:50.000+0100},
address = {Menlo Park, CA, USA},
author = {Chieu, Hai L. and Ng, Hwee T.},
biburl = {https://www.bibsonomy.org/bibtex/2ce783a233880869ae5a52520e02e248f/lillejul},
booktitle = {Eighteenth national conference on Artificial intelligence},
citeulike-article-id = {2972713},
interhash = {8bb0f54e66366fc9e1b962a8d1b0d34d},
intrahash = {ce783a233880869ae5a52520e02e248f},
isbn = {0262511290},
keywords = {information_extraction probabilistic uncertainty},
location = {Edmonton, Alberta, Canada},
pages = {786--791},
posted-at = {2008-07-08 15:20:41},
priority = {2},
publisher = {American Association for Artificial Intelligence},
timestamp = {2009-04-22T10:29:37.000+0200},
title = {A maximum entropy approach to information extraction from semi-structured and free text},
url = {http://portal.acm.org/citation.cfm?id=777092.777213},
year = 2002
}