copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Identifying Web Spam with the Wisdom of the Crowds

Y. Liu, F. Chen, W. Kong, H. Yu, M. Zhang, S. Ma, and L. Ru. ACM Trans. Web, (March 2012)
DOI: 10.1145/2109205.2109207

Abstract

Combating Web spam has become one of the top challenges for Web search engines. State-of-the-art spam-detection techniques are usually designed for specific, known types of Web spam and are incapable of dealing with newly appearing spam types efficiently. With user-behavior analyses from Web access logs, a spam page-detection algorithm is proposed based on a learning scheme. The main contributions are the following. (1) User-visiting patterns of spam pages are studied, and a number of user-behavior features are proposed for separating Web spam pages from ordinary pages. (2) A novel spam-detection framework is proposed that can detect various kinds of Web spam, including newly appearing ones, with the help of the user-behavior analysis. Experiments on large-scale practical Web access log data show the effectiveness of the proposed features and the detection framework.

Links and resources

BibTeX key: citeulike:10560379
entry type: article
address: New York, NY, USA
year: 2012
month: mar
journal: ACM Trans. Web
number: 1
publisher: ACM
volume: 6
citeulike-article-id: 10560379
citeulike-linkout-1: http://dx.doi.org/10.1145/2109205.2109207
priority: 2
posted-at: 2012-04-12 19:07:53
issn: 1559-1131
citeulike-linkout-0: http://portal.acm.org/citation.cfm?id=2109207
DOI: 10.1145/2109205.2109207
url: http://dx.doi.org/10.1145/2109205.2109207

@brusilovsky's tags highlighted

Cite this publication

search on

Meta data

Last update 4 years ago
Created 7 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Identifying Web Spam with the Wisdom of the Crowds

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Identifying Web Spam with the Wisdom of the Crowds

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Identifying Web Spam with the Wisdom of the Crowds

Comments and Reviews
(0)