Inproceedings,

Semi-automatic extraction and modeling of ontologies using Wikipedia XML Corpus

L. Silva, and L. Jayaratne.
Applications of Digital Information and Web Technologies, 2009. ICADIWT '09. Second International Conference on the, page 446-451. (August 2009)
DOI: 10.1109/ICADIWT.2009.5273871

Abstract

This paper introduces WikiOnto: a system that assists in the extraction and modeling of topic ontologies in a semi-automatic manner using a preprocessed document corpus derived from Wikipedia. Based on the Wikipedia XML Corpus, we present a three-tiered framework for extracting topic ontologies in quick time and a modeling environment to refine these ontologies. Using natural language processing (NLP) and other machine learning (ML) techniques along with a very rich document corpus, this system proposes a solution to a task that is generally considered extremely cumbersome. The initial results of the prototype suggest strong potential of the system to become highly successful in ontology extraction and modeling and also inspire further research on extracting ontologies from other semi-structured document corpora as well.

BibTeX key: silva2009semiautomatic
entry type: inproceedings
booktitle: Applications of Digital Information and Web Technologies, 2009. ICADIWT '09. Second International Conference on the
year: 2009
month: Aug.
pages: 446-451
timestamp: 2010-02-23 12:54:40
username: dbenz
intrahash: 66bec053541e521fbe68c0119806ae49
file: silva2009semiautomatic.pdf:silva2009semiautomatic.pdf:PDF
interhash: c1996cb9e69de56e2bb2f8e763fe0482
groups: public
DOI: 10.1109/ICADIWT.2009.5273871
url: http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=5273826&arnumber=5273871&count=156&index=116

BibSonomy

Semi-automatic extraction and modeling of ontologies using Wikipedia XML Corpus

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on