POS tagging and its applications for mathematics

Abstract

Content analysis of scientific publications is a nontrivial task, but a useful and important one for scientific information services. In the Gutenberg era it was a domain of human experts; in the digital age many machine-based methods, e.g., graph analysis tools and machine-learning techniques, have been developed for it. Natural Language Processing (NLP) is a powerful machine-learning approach to semiautomatic speech and language processing, which is also applicable to mathematics. The well established methods of NLP have to be adjusted for the special needs of mathematics, in particular for handling mathematical formulae. We demonstrate a mathematics-aware part of speech tagger and give a short overview about our adaptation of NLP methods for mathematical publications. We show the use of the tools developed for key phrase extraction and classification in the database zbMATH.

BibTeX key: schoneberg_pos_2014
entry type: article
year: 2014
journal: arXiv:1406.2880 cs
url: http://arxiv.org/abs/1406.2880

BibSonomy

POS tagging and its applications for mathematics

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on