An empirical study of three machine learning methods for spam filtering

被引:43
|
作者
Lai, Chih-Chin [1 ]
机构
[1] Natl Univ Tainan, Dept Comp Sci & Informat Engn, Tainan 700, Taiwan
关键词
spam filtering; machine learning;
D O I
10.1016/j.knosys.2006.05.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing volumes of unsolicited bulk e-mail (also known as spam) are bringing more annoyance for most Internet users. Using a classifier based on a specific machine-learning technique to automatically filter out spam e-mail has drawn many researchers' attention. This paper is a comparative study the performance of three commonly used machine learning methods in spam filtering. On the other hand, we try to integrate two spam filtering methods to obtain better performance. A set of systematic experiments has been conducted with these methods which are applied to different parts of an e-mail. Experiments show that using the header only can achieve satisfactory performance, and the idea of integrating disparate methods is a promising way to fight spam. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:249 / 254
页数:6
相关论文
共 50 条
  • [1] A review of machine learning approaches to Spam filtering
    Guzella, Thiago S.
    Caminhas, Walmir M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (07) : 10206 - 10222
  • [2] A survey of machine learning techniques for Spam filtering
    Saad, Omar
    Darwish, Ashraf
    Faraj, Ramadan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2012, 12 (02): : 66 - 73
  • [3] A Survey of Machine Learning Techniques for Spam Filtering
    Saad, Omar
    Hassanien, Aboul Ella
    Darwish, Ashraf
    Faraj, Ramadan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2013, 13 (01): : 103 - 110
  • [4] A Comparative Study of Machine Learning Techniques in Blog Comments Spam Filtering
    Romero, C.
    Valdez, M. Garcia
    Alanis, A.
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [5] A Machine Learning based Web Spam Filtering Approach
    Kumar, Santosh
    Gao, Xiaoying
    Welch, Ian
    Mansoori, Masood
    IEEE 30TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS IEEE AINA 2016, 2016, : 973 - 980
  • [6] An empirical performance comparison of machine learning methods for spam e-mail categorization
    Lai, CC
    Tsai, MC
    HIS'04: FOURTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, : 44 - 48
  • [7] Comparative Study of Feature Reduction and Machine Learning Methods for Spam Detection
    Agarwal, Basant
    Mittal, Namita
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2012), 2014, 236 : 761 - 769
  • [8] SMS Spam Filtering using Supervised Machine Learning Algorithms
    Navaney, Pavas
    Dubey, Gaurav
    Rana, Ajay
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 43 - 48
  • [9] Architecture of adaptive spam filtering based on machine learning algorithms
    Islam, Md Rafiqul
    Zhou, Wanlei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PROCEEDINGS, 2007, 4494 : 458 - +
  • [10] A novel machine learning approach to spam filtering based on relevance vector machine
    School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
    不详
    J. Comput. Inf. Syst., 2008, 5 (2203-2210): : 2203 - 2210