Support vector machines for spam categorization

被引:771
|
作者
Drucker, H [1 ]
Wu, DH
Vapnik, VN
机构
[1] AT&T Bell Labs, Res, Red Bank, NJ 07701 USA
[2] Monmouth Univ, Dept Elect Engn, W Long Branch, NJ 07764 USA
[3] Rensselaer Polytech Inst, Troy, NY 12181 USA
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1999年 / 10卷 / 05期
关键词
boosting algorithms; classification; e-mail; feature representation; Ripper; Rocchio; support vector machines;
D O I
10.1109/72.788645
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the use of support vector machines (SVM's) In classifying e-mail as spam or nonspam by comparing it to three other classification algorithms: Ripper, Rocchio, and boosting decision trees, These four algorithms were tested on two different data sets: one data set where the number of features were constrained to the 1000 best features and another data set where the dimensionality was over 7000, SVM's performed best when using binary features. For both data sets, boosting trees and SVM's had acceptable test performance in terms of accuracy and speed. However, SVM's had significantly less training time.
引用
收藏
页码:1048 / 1054
页数:7
相关论文
共 50 条
  • [1] Performance analysis of Naive Bayes classification, support vector machines and neural networks for spam categorization
    Tantug, A. C. neyd
    Eryigit, G. lsen
    [J]. APPLIED SOFT COMPUTING TECHNOLOGIES: THE CHALLENGE OF COMPLEXITY, 2006, 34 : 495 - 504
  • [2] Evolutionary support vector machines for spam filtering
    Stoean, Ruxandra
    Stoean, Catalin
    Preuss, Mike
    Dumitrescu, D.
    [J]. 5TH ROEDUNET IEEE INTERNATIONAL CONFERENCE, PROCEEDINGS, 2006, : 261 - 265
  • [3] Online Spam Filtering Using Support Vector Machines
    Amayri, Ola
    Bouguila, Nizar
    [J]. ISCC: 2009 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, 2009, : 337 - 340
  • [4] Document categorization using support vector machines
    Villasana, Sergio
    Seijas, Cesar
    Caralli, Antonino
    Jimenez, Jesus
    Pacheco, Jose
    [J]. INGENIERIA UC, 2008, 15 (03): : 45 - 52
  • [5] Transductive support vector machine for personal inboxes spam categorization
    Xu, Chao
    Zhou, Yiming
    [J]. CIS WORKSHOPS 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY WORKSHOPS, 2007, : 459 - 463
  • [6] Using of support vector machines for link spam detection
    Sharapov, Ruslan V.
    Sharapova, Ekaterina V.
    [J]. INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [7] A study of spam filtering using support vector machines
    Ola Amayri
    Nizar Bouguila
    [J]. Artificial Intelligence Review, 2010, 34 : 73 - 108
  • [8] A study of spam filtering using support vector machines
    Amayri, Ola
    Bouguila, Nizar
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (01) : 73 - 108
  • [9] Extreme Learning Machines and Support Vector Machines Models for Email spam detection
    Olatunji, Sunday Olusanya
    [J]. 2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,
  • [10] A Method of Spam Filtering Based on Weighted Support Vector Machines
    Chen Xiao-li
    Liu Pei-yu
    Zhu Zhen-fang
    Qiu Ye
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE & EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2009, : 947 - 950