Email Spam Filtering

被引:15
|
作者
Puertas Sanz, Enrique [1 ]
Gomez Hidalgo, Jose Maria [2 ]
Cortizo Perez, Jose Carlos [3 ]
机构
[1] Univ Europea Madrid, Madrid 28670, Spain
[2] Optenet, Madrid 28230, Spain
[3] AINet Solut, Madrid 28943, Spain
关键词
D O I
10.1016/S0065-2458(08)00603-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, email Spam has become an increasingly important problem, with a big economic impact in society. In this work, we present the problem of Spam, how it affects us, and how we can fight against it. We discuss legal, economic, and technical measures used to stop these unsolicited emails. Among all the technical measures, those based on content analysis have been particularly effective in filtering Spam, so we focus on them, explaining how they work in detail. In summary, we explain the structure and the process of different Machine Learning methods used for this task, and how we can make them to be cost sensitive through several methods like threshold optimization, instance weighting, or MetaCost. We also discuss how to evaluate Spam filters using basic metrics, TREC metrics, and the receiver operating characteristic convex bull method, that best suits classification problems in which target conditions are not known, as it is the case. We also describe how actual filters are used in practice. We also present different methods used by spammers to attack Spam filters and what we can expect to find in the coming years in the battle of Spam filters against spammers.
引用
收藏
页码:45 / 114
页数:70
相关论文
共 50 条
  • [41] Machine learning for email spam filtering: review, approaches and open research problems
    Dada, Emmanuel Gbenga
    Bassi, Joseph Stephen
    Chiroma, Haruna
    Abdulhamid, Shafi'i Muhammad
    Adetunmbi, Adebayo Olusola
    Ajibuwa, Opeyemi Emmanuel
    HELIYON, 2019, 5 (06)
  • [42] Spam Filtering Email Classification (SFECM) using Gain and Graph Mining Algorithm
    Chae, M. K.
    Alsadoon, Abeer
    Prasad, P. W. C.
    Elchouemi, A.
    2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [43] Online active multi-field learning for efficient email spam filtering
    Liu, Wuying
    Wang, Ting
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 33 (01) : 117 - 136
  • [44] Analysis of Naive Bayes Algorithm for Email Spam Filtering across Multiple Datasets
    Rusland, Nurul Fitriah
    Wahid, Norfaradilla
    Kasim, Shahreen
    Hafit, Hanayanti
    INTERNATIONAL RESEARCH AND INNOVATION SUMMIT (IRIS2017), 2017, 226
  • [45] Utilizing Multi-Field Text Features for Efficient Email Spam Filtering
    Wuying Liu
    Ting Wang
    International Journal of Computational Intelligence Systems, 2012, 5 : 505 - 518
  • [46] Spam Filtering Email Classification (SFECM) using Gain and Graph Mining Algorithm
    Chae, M. K.
    Alsadoon, Abeer
    Prasad, P. W. C.
    Sreedharan, Sasikumaran
    2017 2ND INTERNATIONAL CONFERENCE ON ANTI-CYBER CRIMES (ICACC), 2017, : 217 - 222
  • [47] Trusting Spam Reporters: A Reporter-Based Reputation System for Email Filtering
    Zheleva, Elena
    Kolcz, Aleksander
    Getoor, Lise
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2009, 27 (01)
  • [48] Tensor Flow-powered Spam Email Filtering: An Evaluation of Performance and Robustness
    Kankrale, Rajendra
    Jadhav, Tushar
    Kharat, Pravin A.
    Deshmukh, Trupti
    Pardeshi, Nilesh G.
    Karmode, Sayali
    Gore, Santosh
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06) : 509 - 515
  • [49] Discovering Classification Rules for Email Spam Filtering with an Ant Colony Optimization Algorithm
    El-Alfy, El-Sayed M.
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 1778 - 1783
  • [50] Utilizing Multi-Field Text Features for Efficient Email Spam Filtering
    Liu, Wuying
    Wang, Ting
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2012, 5 (03) : 505 - 518