Email Spam Filtering

被引:15
|
作者
Puertas Sanz, Enrique [1 ]
Gomez Hidalgo, Jose Maria [2 ]
Cortizo Perez, Jose Carlos [3 ]
机构
[1] Univ Europea Madrid, Madrid 28670, Spain
[2] Optenet, Madrid 28230, Spain
[3] AINet Solut, Madrid 28943, Spain
关键词
D O I
10.1016/S0065-2458(08)00603-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, email Spam has become an increasingly important problem, with a big economic impact in society. In this work, we present the problem of Spam, how it affects us, and how we can fight against it. We discuss legal, economic, and technical measures used to stop these unsolicited emails. Among all the technical measures, those based on content analysis have been particularly effective in filtering Spam, so we focus on them, explaining how they work in detail. In summary, we explain the structure and the process of different Machine Learning methods used for this task, and how we can make them to be cost sensitive through several methods like threshold optimization, instance weighting, or MetaCost. We also discuss how to evaluate Spam filters using basic metrics, TREC metrics, and the receiver operating characteristic convex bull method, that best suits classification problems in which target conditions are not known, as it is the case. We also describe how actual filters are used in practice. We also present different methods used by spammers to attack Spam filters and what we can expect to find in the coming years in the battle of Spam filters against spammers.
引用
收藏
页码:45 / 114
页数:70
相关论文
共 50 条
  • [1] Selected methods of spam filtering in email
    Miszalska, Izabella
    Zabierowski, Wojciech
    Napieralski, Andrzej
    2007 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON THE EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS, 2007, : 507 - 513
  • [2] Symbiotic filtering for spam email detection
    Lopes, Clotilde
    Cortez, Paulo
    Sousa, Pedro
    Rocha, Miguel
    Rio, Miguel
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (08) : 9365 - 9372
  • [3] Filtering and email pricing as solutions to spam
    Eaton, B. Curtis
    MacDonald, Ian A.
    Meriluoto, Laura
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2013, 46 (03): : 881 - 899
  • [4] Email Spam Filtering Based on the MNMF Algorithm
    Liu, Zun-xiong
    Tian, Shan-shan
    Huang, Zhi-qiang
    Liu, Jiang-wei
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (01): : 31 - 44
  • [5] Spam filtering and email-mediated applications
    Li, Wenbin
    Zhong, Ning
    Yao, Y. Y.
    Liu, Jiming
    Liu, Chunnian
    WEB INTELLIGENCE MEETS BRAIN INFORMATICS, 2007, 4845 : 382 - 405
  • [6] Efficient Feature Set for Spam Email Filtering
    Varghese, Reshma
    Dhanya, K. A.
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 732 - 737
  • [7] Unsupervised feature learning for spam email filtering
    Diale, Melvin
    Celik, Turgay
    Van Der Walt, Christiaan
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 74 : 89 - 104
  • [8] Filtering spam email based on retry patterns
    Lieven, Peter
    Scheuermann, Bjoern
    Stini, Michael
    Mauve, Martin
    2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 1515 - 1520
  • [9] Structured ensemble learning for email spam filtering
    Liu, W. (wyliu@nudt.edu.cn), 2012, Science Press (49):
  • [10] On extendable software architecture for spam email filtering
    Ma, Wanli
    Tran, Dat
    Sharma, Dharmendra
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 924 - +