@gergie

OWL Reasoning with WebPIE: Calculating the Closure of 100 Billion Triples

, , , , and . Proceedings of the 7th Extended Semantic Web Conference (ESWC), volume 6088 of Lecture Notes in Computer Science, page 213--227. Berlin, Heidelberg, Springer, (May 2010)

Abstract

In previous work we have shown that the MapReduce frame- work for distributed computation can be deployed for highly scalable inference over RDF graphs under the RDF Schema semantics. Unfortu- nately, several key optimizations that enabled the scalable RDFS infer- ence do not generalize to the richer OWL semantics. In this paper we analyze these problems, and we propose solutions to overcome them. Our solutions allow distributed computation of the closure of an RDF graph under the OWL Horst semantics. We demonstrate the WebPIE inference engine, built on top of the Hadoop platform and deployed on a compute cluster of 64 machines. We have evaluated our approach using some real-world datasets (UniProt and LDSR, about 0.9-1.5 billion triples) and a synthetic benchmark (LUBM, up to 100 billion triples). Results show that our implementation is scal- able and vastly outperforms current systems when comparing supported language expressivity, maximum data size and inference speed.

Links and resources

Tags

    community

    • @sb3000
    • @gergie
    • @dblp
    @gergie's tags highlighted