An empirical study of three machine learning methods for spam filtering

被引:43
|
作者
Lai, Chih-Chin [1 ]
机构
[1] Natl Univ Tainan, Dept Comp Sci & Informat Engn, Tainan 700, Taiwan
关键词
spam filtering; machine learning;
D O I
10.1016/j.knosys.2006.05.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing volumes of unsolicited bulk e-mail (also known as spam) are bringing more annoyance for most Internet users. Using a classifier based on a specific machine-learning technique to automatically filter out spam e-mail has drawn many researchers' attention. This paper is a comparative study the performance of three commonly used machine learning methods in spam filtering. On the other hand, we try to integrate two spam filtering methods to obtain better performance. A set of systematic experiments has been conducted with these methods which are applied to different parts of an e-mail. Experiments show that using the header only can achieve satisfactory performance, and the idea of integrating disparate methods is a promising way to fight spam. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:249 / 254
页数:6
相关论文
共 50 条
  • [41] Comparative Study of Machine Learning Algorithms for SMS Spam Detection
    Alzahrani, Amani
    Rawat, Danda B.
    2019 IEEE SOUTHEASTCON, 2019,
  • [42] Active Learning based Spam Filtering Method
    Zhang, Wei
    Gao, Feng
    Lv, Di
    Xue, Feng
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 3302 - 3306
  • [43] An empirical study of spam and spam vulnerable email accounts
    Dhinakaran, Cynthia
    Chae, Cheol-Joo
    Lee, Jae-Kwang
    Nagamalai, Dhinaharan
    PROCEEDINGS OF FUTURE GENERATION COMMUNICATION AND NETWORKING, MAIN CONFERENCE PAPERS, VOL 1, 2007, : 407 - +
  • [44] Application of Natural Language Processing and Machine Learning Boosted with Swarm Intelligence for Spam Email Filtering
    Bacanin, Nebojsa
    Zivkovic, Miodrag
    Stoean, Catalin
    Antonijevic, Milos
    Janicijevic, Stefana
    Sarac, Marko
    Strumberger, Ivana
    MATHEMATICS, 2022, 10 (22)
  • [45] SMS Spam Filtering on Multiple Background Datasets Using Machine Learning Techniques: A Novel Approach
    Kaliyar, Rohit Kumar
    Narang, Pratik
    Goswami, Anurag
    PROCEEDINGS OF THE 2018 IEEE 8TH INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC 2018), 2018, : 59 - 65
  • [46] An Empirical Study on Data Balancing in Machine Learning Based Software Traceability Methods
    Wang, Bangchao
    Wang, Zihan
    Wan, Hongyan
    Li, Xingfu
    Deng, Yang
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [47] Machine learning experiment management tools: a mixed-methods empirical study
    Idowu, Samuel
    Osman, Osman
    Struber, Daniel
    Berger, Thorsten
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (04)
  • [48] Efficient feature selection methods in chinese spam filtering
    Xu, Yan
    Information Technology Journal, 2013, 12 (20) : 5492 - 5496
  • [49] Artificial Immune System Based Methods for Spam Filtering
    Tan, Ying
    Mi, Guyue
    Zhu, Yuanchun
    Deng, Chao
    2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 2484 - 2488
  • [50] Comparative study of three machine learning methods for software fault prediction
    Wang, Qi
    Zhu, Jie
    Yu, Bo
    Journal of Shanghai Jiaotong University (Science), 2005, 10 E (02) : 117 - 121