copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Relaxed online SVMs for spam filtering

D. Sculley, and G. Wachman. Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, page 415--422. New York, NY, USA, ACM, (2007)
DOI: 10.1145/1277741.1277813

Abstract

Spam is a key problem in electronic communication, including large-scale email systems and the growing number of blogs. Content-based filtering is one reliable method of combating this threat in its various forms, but some academic researchers and industrial practitioners disagree on how best to filter spam. The former have advocated the use of Support Vector Machines (SVMs) for content-based filtering, as this machine learning methodology gives state-of-the-art performance for text classification. However, similar performance gains have yet to be demonstrated for online spam filtering. Additionally, practitioners cite the high cost of SVMs as reason to prefer faster (if less statistically robust) Bayesian methods. In this paper, we offer a resolution to this controversy. First, we show that online SVMs indeed give state-of-the-art classification performance on online spam filtering on large benchmark data sets. Second, we show that nearly equivalent performance may be achieved by a Relaxed Online SVM (ROSVM) at greatly reduced computational cost. Our results are experimentally verified on email spam, blog spam, and splog detection tasks.

Description

Relaxed online SVMs for spam filtering

Links and resources

BibTeX key: sculley2007relaxed
entry type: inproceedings
address: New York, NY, USA
booktitle: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
year: 2007
pages: 415--422
publisher: ACM
series: SIGIR '07
acmid: 1277813
location: Amsterdam, The Netherlands
isbn: 978-1-59593-597-7
numpages: 8
DOI: 10.1145/1277741.1277813
url: http://doi.acm.org/10.1145/1277741.1277813

@beate's tags highlighted

Cite this publication

@inproceedings{sculley2007relaxed, abstract = {Spam is a key problem in electronic communication, including large-scale email systems and the growing number of blogs. Content-based filtering is one reliable method of combating this threat in its various forms, but some academic researchers and industrial practitioners disagree on how best to filter spam. The former have advocated the use of Support Vector Machines (SVMs) for content-based filtering, as this machine learning methodology gives state-of-the-art performance for text classification. However, similar performance gains have yet to be demonstrated for online spam filtering. Additionally, practitioners cite the high cost of SVMs as reason to prefer faster (if less statistically robust) Bayesian methods. In this paper, we offer a resolution to this controversy. First, we show that online SVMs indeed give state-of-the-art classification performance on online spam filtering on large benchmark data sets. Second, we show that nearly equivalent performance may be achieved by a Relaxed Online SVM (ROSVM) at greatly reduced computational cost. Our results are experimentally verified on email spam, blog spam, and splog detection tasks.}, acmid = {1277813}, added-at = {2011-04-13T11:21:13.000+0200}, address = {New York, NY, USA}, author = {Sculley, D. and Wachman, Gabriel M.}, biburl = {https://www.bibsonomy.org/bibtex/26d352cba27cebac53debf90abaa8ca79/beate}, booktitle = {Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval}, description = {Relaxed online SVMs for spam filtering}, doi = {10.1145/1277741.1277813}, interhash = {2cb1f8322431fa82bacd951de9b8d4cf}, intrahash = {6d352cba27cebac53debf90abaa8ca79}, isbn = {978-1-59593-597-7}, keywords = {online-learning spam-detection svm}, location = {Amsterdam, The Netherlands}, numpages = {8}, pages = {415--422}, publisher = {ACM}, series = {SIGIR '07}, timestamp = {2011-04-13T11:21:37.000+0200}, title = {Relaxed online SVMs for spam filtering}, url = {http://doi.acm.org/10.1145/1277741.1277813}, year = 2007 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Relaxed online SVMs for spam filtering

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Relaxed online SVMs for spam filtering

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Relaxed online SVMs for spam filtering

Comments and Reviews
(0)