Email Spam Filtering

被引:15
|
作者
Puertas Sanz, Enrique [1 ]
Gomez Hidalgo, Jose Maria [2 ]
Cortizo Perez, Jose Carlos [3 ]
机构
[1] Univ Europea Madrid, Madrid 28670, Spain
[2] Optenet, Madrid 28230, Spain
[3] AINet Solut, Madrid 28943, Spain
关键词
D O I
10.1016/S0065-2458(08)00603-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, email Spam has become an increasingly important problem, with a big economic impact in society. In this work, we present the problem of Spam, how it affects us, and how we can fight against it. We discuss legal, economic, and technical measures used to stop these unsolicited emails. Among all the technical measures, those based on content analysis have been particularly effective in filtering Spam, so we focus on them, explaining how they work in detail. In summary, we explain the structure and the process of different Machine Learning methods used for this task, and how we can make them to be cost sensitive through several methods like threshold optimization, instance weighting, or MetaCost. We also discuss how to evaluate Spam filters using basic metrics, TREC metrics, and the receiver operating characteristic convex bull method, that best suits classification problems in which target conditions are not known, as it is the case. We also describe how actual filters are used in practice. We also present different methods used by spammers to attack Spam filters and what we can expect to find in the coming years in the battle of Spam filters against spammers.
引用
收藏
页码:45 / 114
页数:70
相关论文
共 50 条
  • [22] A Three-Way Decision Approach to Email Spam Filtering
    Zhou, Bing
    Yao, Yiyu
    Luo, Jigang
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2010, 6085 : 28 - 39
  • [23] A suffix tree approach to anti-spam email filtering
    Rajesh Pampapathi
    Boris Mirkin
    Mark Levene
    Machine Learning, 2006, 65 : 309 - 338
  • [24] On the Relative Age of Spam and Ham Training Samples for Email Filtering
    Cormack, Gordon V.
    da Cruz, Jose-Marcio Martins
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 744 - 745
  • [25] Comparison of Deep and Traditional Learning Methods for Email Spam Filtering
    Sheneamer, Abdullah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (01) : 560 - 565
  • [26] Spam Email Filtering Using Network-Level Properties
    Cortez, Paulo
    Correia, Andre
    Sousa, Pedro
    Rocha, Miguel
    Rio, Miguel
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2010, 6171 : 476 - +
  • [27] Filtering obfuscated email spam by means of phonetic string matching
    Freschi, Valerio
    Seraghiti, Andrea
    Bogliolo, Alessandro
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 505 - 509
  • [28] A survey of learning-based techniques of email spam filtering
    Blanzieri, Enrico
    Bryl, Anton
    ARTIFICIAL INTELLIGENCE REVIEW, 2008, 29 (01) : 63 - 92
  • [29] A survey of learning-based techniques of email spam filtering
    Enrico Blanzieri
    Anton Bryl
    Artificial Intelligence Review, 2008, 29 : 63 - 92
  • [30] A Study of Neighbor Users Selection in Email Networks for Spam Filtering
    Wang, Yongchao
    Chao, Yuyan
    He, Lifeng
    ICCNS 2018: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND NETWORK SECURITY, 2018, : 22 - 26