Construction of disambiguated Folksonomy ontologies using Wikipedia
N. Tomuro, and A. Shepitsen. Proceedings of the 2009 Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources, page 42--50. Stroudsburg, PA, USA, Association for Computational Linguistics, (2009)
Abstract
One of the difficulties in using Folksonomies in computational systems is <i>tag ambiguity:</i> tags with multiple meanings. This paper presents a novel method for building Folksonomy tag ontologies in which the nodes are disambiguated. Our method utilizes a clustering algorithm called DSCBC, which was originally developed in Natural Language Processing (NLP), to derive <i>committees</i> of tags, each of which corresponds to one meaning or domain. In this work, we use Wikipedia as the external knowledge source for the domains of the tags. Using the committees, an ambiguous tag is identified as one which belongs to more than one committee. Then we apply a hierarchical agglomerative clustering algorithm to build an ontology of tags. The nodes in the derived ontology are disambiguated in that an ambiguous tag appears in several nodes in the ontology, each of which corresponds to one meaning of the tag. We evaluate the derived ontology for its <i>ontological density</i> (how close similar tags are placed), and its usefulness in applications, in particular for a personalized tag retrieval task. The results showed marked improvements over other approaches.
Description
Construction of disambiguated Folksonomy ontologies using Wikipedia
%0 Conference Paper
%1 Tomuro:2009:CDF:1699765.1699772
%A Tomuro, Noriko
%A Shepitsen, Andriy
%B Proceedings of the 2009 Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources
%C Stroudsburg, PA, USA
%D 2009
%I Association for Computational Linguistics
%K clustering disambiguate disambiguity folksonomy hierarchical ontology tagsense2012
%P 42--50
%T Construction of disambiguated Folksonomy ontologies using Wikipedia
%U http://dl.acm.org/citation.cfm?id=1699765.1699772
%X One of the difficulties in using Folksonomies in computational systems is <i>tag ambiguity:</i> tags with multiple meanings. This paper presents a novel method for building Folksonomy tag ontologies in which the nodes are disambiguated. Our method utilizes a clustering algorithm called DSCBC, which was originally developed in Natural Language Processing (NLP), to derive <i>committees</i> of tags, each of which corresponds to one meaning or domain. In this work, we use Wikipedia as the external knowledge source for the domains of the tags. Using the committees, an ambiguous tag is identified as one which belongs to more than one committee. Then we apply a hierarchical agglomerative clustering algorithm to build an ontology of tags. The nodes in the derived ontology are disambiguated in that an ambiguous tag appears in several nodes in the ontology, each of which corresponds to one meaning of the tag. We evaluate the derived ontology for its <i>ontological density</i> (how close similar tags are placed), and its usefulness in applications, in particular for a personalized tag retrieval task. The results showed marked improvements over other approaches.
%@ 978-1-932432-55-8
@inproceedings{Tomuro:2009:CDF:1699765.1699772,
abstract = {One of the difficulties in using Folksonomies in computational systems is <i>tag ambiguity:</i> tags with multiple meanings. This paper presents a novel method for building Folksonomy tag ontologies in which the nodes are disambiguated. Our method utilizes a clustering algorithm called DSCBC, which was originally developed in Natural Language Processing (NLP), to derive <i>committees</i> of tags, each of which corresponds to one meaning or domain. In this work, we use Wikipedia as the external knowledge source for the domains of the tags. Using the committees, an ambiguous tag is identified as one which belongs to more than one committee. Then we apply a hierarchical agglomerative clustering algorithm to build an ontology of tags. The nodes in the derived ontology are disambiguated in that an ambiguous tag appears in several nodes in the ontology, each of which corresponds to one meaning of the tag. We evaluate the derived ontology for its <i>ontological density</i> (how close similar tags are placed), and its usefulness in applications, in particular for a personalized tag retrieval task. The results showed marked improvements over other approaches.},
acmid = {1699772},
added-at = {2012-04-03T15:27:42.000+0200},
address = {Stroudsburg, PA, USA},
author = {Tomuro, Noriko and Shepitsen, Andriy},
biburl = {https://www.bibsonomy.org/bibtex/25015f34647b93101bc09e2bd672dec46/psinger},
booktitle = {Proceedings of the 2009 Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources},
description = {Construction of disambiguated Folksonomy ontologies using Wikipedia},
interhash = {17673253102b6f472ded9173ffaa2ebe},
intrahash = {5015f34647b93101bc09e2bd672dec46},
isbn = {978-1-932432-55-8},
keywords = {clustering disambiguate disambiguity folksonomy hierarchical ontology tagsense2012},
location = {Suntec, Singapore},
numpages = {9},
pages = {42--50},
publisher = {Association for Computational Linguistics},
series = {People's Web '09},
timestamp = {2012-04-03T15:27:42.000+0200},
title = {Construction of disambiguated Folksonomy ontologies using Wikipedia},
url = {http://dl.acm.org/citation.cfm?id=1699765.1699772},
year = 2009
}