Automating RDF Dataset Transformation and Enrichment
{. Sherif, A. Ngonga Ngomo, and J. Lehmann. 12th Extended Semantic Web Conference, Portoroz, Slovenia, 31st May - 4th June 2015, Springer, (2015)
Abstract
With the adoption of RDF across several domains, come growing requirements pertaining to the completeness and quality of RDF datasets. Currently, this problem is most commonly addressed by manually devising means of enriching an input dataset. The few tools that aim at supporting this endeavour usually focus on supporting the manual definition of enrichment pipelines. In this paper, we present a supervised learning approach based on a refinement operator for enriching RDF datasets. We show how we can use exemplary descriptions of enriched resources to generate accurate enrichment pipelines. We evaluate our approach against eight manually defined enrichment pipelines and show that our approach can learn accurate pipelines even when provided with a small number of training examples.
%0 Conference Paper
%1 DEER_2015
%A Sherif, Mohamed Ahmed
%A Ngonga Ngomo, Axel-Cyrille
%A Lehmann, Jens
%B 12th Extended Semantic Web Conference, Portoroz, Slovenia, 31st May - 4th June 2015
%D 2015
%I Springer
%K 2015 MOLE deer dice geoknow group_aksw lehmann ngonga sherif simba
%T Automating RDF Dataset Transformation and Enrichment
%U http://svn.aksw.org/papers/2015/ESWC_DEER/public.pdf
%X With the adoption of RDF across several domains, come growing requirements pertaining to the completeness and quality of RDF datasets. Currently, this problem is most commonly addressed by manually devising means of enriching an input dataset. The few tools that aim at supporting this endeavour usually focus on supporting the manual definition of enrichment pipelines. In this paper, we present a supervised learning approach based on a refinement operator for enriching RDF datasets. We show how we can use exemplary descriptions of enriched resources to generate accurate enrichment pipelines. We evaluate our approach against eight manually defined enrichment pipelines and show that our approach can learn accurate pipelines even when provided with a small number of training examples.
@inproceedings{DEER_2015,
abstract = {With the adoption of RDF across several domains, come growing requirements pertaining to the completeness and quality of RDF datasets. Currently, this problem is most commonly addressed by manually devising means of enriching an input dataset. The few tools that aim at supporting this endeavour usually focus on supporting the manual definition of enrichment pipelines. In this paper, we present a supervised learning approach based on a refinement operator for enriching RDF datasets. We show how we can use exemplary descriptions of enriched resources to generate accurate enrichment pipelines. We evaluate our approach against eight manually defined enrichment pipelines and show that our approach can learn accurate pipelines even when provided with a small number of training examples.},
added-at = {2024-11-01T19:15:08.000+0100},
author = {Sherif, {Mohamed Ahmed} and {Ngonga Ngomo}, Axel-Cyrille and Lehmann, Jens},
bdsk-url-1 = {http://svn.aksw.org/papers/2015/ESWC_DEER/public.pdf},
biburl = {https://www.bibsonomy.org/bibtex/2d1332ac10237bd3e3e2b89e871b029c6/aksw},
booktitle = {12th Extended Semantic Web Conference, Portoro{\v{z}}, Slovenia, 31st May - 4th June 2015},
interhash = {1d53de0d674b92f7dff3ae46593c355f},
intrahash = {d1332ac10237bd3e3e2b89e871b029c6},
keywords = {2015 MOLE deer dice geoknow group_aksw lehmann ngonga sherif simba},
publisher = {Springer},
timestamp = {2024-11-01T19:15:08.000+0100},
title = {Automating {RDF} Dataset Transformation and Enrichment},
url = {http://svn.aksw.org/papers/2015/ESWC_DEER/public.pdf},
year = 2015
}