copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditional Random Fields For Local Adaptive Reference Extraction

M. Toepfer, P. Kluegl, A. Hotho, and F. Puppe.. Proceedings of LWA2010 - Workshop-Woche: Lernen, Wissen & Adaptivitaet, Kassel, Germany, (2010)

Abstract

The accurate extraction of bibliographic information from scientific publications is an active field of research. Machine learning and sequence labeling approaches like Conditional Random Fields (CRF) are often applied for this reference extraction task, but still suffer from the ambiguity of reference notation. Reference sections apply a predefined style guide and contain only homogeneous references. Therefore, other references of the same paper or journal often provide evidence how the fields of a reference are correctly labeled. We propose a novel approach that exploits the similarities within a document. Our process model uses information of unlabeled documents directly during the extraction task in order to automatically adapt to the perceived style guide. This is implemented by changing the manifestation of the features for the applied CRF. The experimental results show considerable improvements compared to the common approach. We achieve an average F1 score of 96.7% and an instance accuracy of 85.4% on the test data set.

Links and resources

BibTeX key: kdml21
entry type: inproceedings
address: Kassel, Germany
booktitle: Proceedings of LWA2010 - Workshop-Woche: Lernen, Wissen & Adaptivitaet
year: 2010
crossref: lwa2010
presentation_start: 2010-10-04 16:46:00
session: kdml1
track: kdml
presentation_end: 2010-10-04 17:08:00
room: 0446
Document: http://www.kde.cs.uni-kassel.de/conf/lwa10/papers/kdml21.pdf

@lwa2010's tags highlighted

Cite this publication

@inproceedings{kdml21, abstract = {The accurate extraction of bibliographic information from scientific publications is an active field of research. Machine learning and sequence labeling approaches like Conditional Random Fields (CRF) are often applied for this reference extraction task, but still suffer from the ambiguity of reference notation. Reference sections apply a predefined style guide and contain only homogeneous references. Therefore, other references of the same paper or journal often provide evidence how the fields of a reference are correctly labeled. We propose a novel approach that exploits the similarities within a document. Our process model uses information of unlabeled documents directly during the extraction task in order to automatically adapt to the perceived style guide. This is implemented by changing the manifestation of the features for the applied CRF. The experimental results show considerable improvements compared to the common approach. We achieve an average F1 score of 96.7% and an instance accuracy of 85.4% on the test data set.}, added-at = {2010-10-05T14:15:12.000+0200}, address = {Kassel, Germany}, author = {Toepfer, Martin and Kluegl, Peter and Hotho, Andreas and Puppe., Frank}, biburl = {https://www.bibsonomy.org/bibtex/237242cd584805b2e4cea0c486008889d/lwa2010}, booktitle = {Proceedings of LWA2010 - Workshop-Woche: Lernen, Wissen {\&} Adaptivitaet}, crossref = {lwa2010}, editor = {Atzmüller, Martin and Benz, Dominik and Hotho, Andreas and Stumme, Gerd}, interhash = {d8f45281363701bfe7f979b1e13ee269}, intrahash = {37242cd584805b2e4cea0c486008889d}, keywords = {adaptive conditional extraction field information learning local machine random reference room:0446 session:kdml2 workshop:kdml}, presentation_end = {2010-10-04 17:08:00}, presentation_start = {2010-10-04 16:46:00}, room = {0446}, session = {kdml1}, timestamp = {2010-10-05T14:15:13.000+0200}, title = {Conditional Random Fields For Local Adaptive Reference Extraction}, track = {kdml}, url = {http://www.kde.cs.uni-kassel.de/conf/lwa10/papers/kdml21.pdf}, year = 2010 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditional Random Fields For Local Adaptive Reference Extraction

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Conditional Random Fields For Local Adaptive Reference Extraction

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditional Random Fields For Local Adaptive Reference Extraction

Comments and Reviews
(0)