Towards the semantic web: Collaborative tag suggestions
Z. Xu, Y. Fu, J. Mao, and D. Su. Collaborative Web Tagging Workshop at WWW2006, Edinburgh, Scotland, May, (2006)
Abstract
Content organization over the Internet went through several
interesting phases of evolution: from structured directories to
unstructured Web search engines and more recently, to tagging
as a way for aggregating information, a step towards the
semantic web vision. Tagging allows ranking and data
organization to directly utilize inputs from end users, enabling
machine processing of Web content. Since tags are created by
individual users in a free form, one important problem facing
tagging is to identify most appropriate tags, while eliminating
noise and spam. For this purpose, we define a set of general
criteria for a good tagging system. These criteria include high
coverage of multiple facets to ensure good recall, least effort to
reduce the cost involved in browsing, and high popularity to
ensure tag quality. We propose a collaborative tag suggestion
algorithm using these criteria to spot high-quality tags. The
proposed algorithm employs a goodness measure for tags derived
from collective user authorities to combat spam. The goodness
measure is iteratively adjusted by a reward-penalty algorithm,
which also incorporates other sources of tags, e.g., content-based
auto-generated tags. Our experiments based on My Web 2.0 show
that the algorithm is effective.
%0 Journal Article
%1 xu2006tsw
%A Xu, Z.
%A Fu, Y.
%A Mao, J.
%A Su, D.
%D 2006
%J Collaborative Web Tagging Workshop at WWW2006, Edinburgh, Scotland, May
%K collaboration recommender semanticweb tagging web2.0
%T Towards the semantic web: Collaborative tag suggestions
%X Content organization over the Internet went through several
interesting phases of evolution: from structured directories to
unstructured Web search engines and more recently, to tagging
as a way for aggregating information, a step towards the
semantic web vision. Tagging allows ranking and data
organization to directly utilize inputs from end users, enabling
machine processing of Web content. Since tags are created by
individual users in a free form, one important problem facing
tagging is to identify most appropriate tags, while eliminating
noise and spam. For this purpose, we define a set of general
criteria for a good tagging system. These criteria include high
coverage of multiple facets to ensure good recall, least effort to
reduce the cost involved in browsing, and high popularity to
ensure tag quality. We propose a collaborative tag suggestion
algorithm using these criteria to spot high-quality tags. The
proposed algorithm employs a goodness measure for tags derived
from collective user authorities to combat spam. The goodness
measure is iteratively adjusted by a reward-penalty algorithm,
which also incorporates other sources of tags, e.g., content-based
auto-generated tags. Our experiments based on My Web 2.0 show
that the algorithm is effective.
@article{xu2006tsw,
abstract = {Content organization over the Internet went through several
interesting phases of evolution: from structured directories to
unstructured Web search engines and more recently, to tagging
as a way for aggregating information, a step towards the
semantic web vision. Tagging allows ranking and data
organization to directly utilize inputs from end users, enabling
machine processing of Web content. Since tags are created by
individual users in a free form, one important problem facing
tagging is to identify most appropriate tags, while eliminating
noise and spam. For this purpose, we define a set of general
criteria for a good tagging system. These criteria include high
coverage of multiple facets to ensure good recall, least effort to
reduce the cost involved in browsing, and high popularity to
ensure tag quality. We propose a collaborative tag suggestion
algorithm using these criteria to spot high-quality tags. The
proposed algorithm employs a goodness measure for tags derived
from collective user authorities to combat spam. The goodness
measure is iteratively adjusted by a reward-penalty algorithm,
which also incorporates other sources of tags, e.g., content-based
auto-generated tags. Our experiments based on My Web 2.0 show
that the algorithm is effective.},
added-at = {2007-05-16T16:20:56.000+0200},
author = {Xu, Z. and Fu, Y. and Mao, J. and Su, D.},
biburl = {https://www.bibsonomy.org/bibtex/2fc8c03e106e0934a550b76e3b94bab6d/beate},
interhash = {e18fd92b0ffa21b9f0cbb3a2fe15b873},
intrahash = {fc8c03e106e0934a550b76e3b94bab6d},
journal = {Collaborative Web Tagging Workshop at WWW2006, Edinburgh, Scotland, May},
keywords = {collaboration recommender semanticweb tagging web2.0},
timestamp = {2008-12-09T16:32:51.000+0100},
title = {{Towards the semantic web: Collaborative tag suggestions}},
year = 2006
}