Inproceedings,

Distributed Web2.0 Crawling for Ontology Evolution

A. Juffinger, T. Neidhart, M. Granitzer, K. R., A. Weichselbraun, G. Wohlgenannt, and A. Scharl.
Proceedings of the Second International Conference on Digital Information Management (ICDIM'07), Lyon, France, (October 2007)

Abstract

Semantic Web technologies in general and ontology- based approaches in particular are considered the foundation for the next generation of information services. While ontologies enable software agents to exchange knowledge and information in a standardised, intelligent manner, describing todays vast amount of information in terms of ontological knowledge and to track the evolution of such ontologies remains a challenge. In this paper we describe Web2.0 crawling for ontology evolution. The World Wide Web, or Web for short, is due its evolutionary properties and social network characteristics a perfect fitting data source to evolve an ontology. The decentralised structure of the Internet, the huge amount of data and upcoming Web2.0 technologies arise several challenges for a crawling system. In this paper we present a distributed crawling system with standard browser integration. The proposed system is a high performance, sitescript based noise reducing crawler which loads standard browser equivalent content from Web2.0 resources. Furthermore we describe the integration of this spider into our ontology evolution framework.

BibTeX key: juffinger2007
entry type: inproceedings
address: Lyon, France
booktitle: Proceedings of the Second International Conference on Digital Information Management (ICDIM'07)
year: 2007
month: October
timestamp: 2007.10.31
owner: albert

BibSonomy

Distributed Web2.0 Crawling for Ontology Evolution

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on