Adaptive email spam filtering based on information theory

被引:0
|
作者
Zhang, Xin [1 ]
Dai, Wenyuan [1 ]
Xue, Gui-Rong [1 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
关键词
adaptive; email spam; information theory;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most previous email spam filtering techniques rely on traditional classification learning which assumes the data from training and test sets are drawn from the same underlying distribution. However, in practice, this identical-distribution assumption often violates. In general, email service providers collect training data from various public available resources, while the tasks focus on users' individual inboxes. Topics in the mail-boxes vary among different users, and distributions shift as a result. In this paper, we propose an adaptive email spam filtering algorithm based on information theory which relaxes the identical-distribution assumption and adapts the knowledge learned from one distribution to another. Our work focuses on the content analysis which minimizes the loss in mutual information between email instances and word features, before and after classification. We present theoretical and empirical analyses to show that our algorithm is able to solve the adaptive email spam filtering problem well. The experimental results show that our algorithm greatly improves the accuracy of email filtering, against the traditional classification algorithms, while scaling very well.
引用
收藏
页码:159 / 170
页数:12
相关论文
共 50 条
  • [1] ADAPTIVE PRIVACY POLICY PREDICTION FOR EMAIL SPAM FILTERING
    Rajendran, P.
    Hemalatha, S. M.
    Janaki, M.
    Durkananthini, B.
    2016 WORLD CONFERENCE ON FUTURISTIC TRENDS IN RESEARCH AND INNOVATION FOR SOCIAL WELFARE (STARTUP CONCLAVE), 2016,
  • [2] Efficient spam email filtering using adaptive ontology
    Youn, Seongwook
    McLeod, Dennis
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 249 - +
  • [3] Email Spam Filtering
    Puertas Sanz, Enrique
    Gomez Hidalgo, Jose Maria
    Cortizo Perez, Jose Carlos
    ADVANCES IN COMPUTERS, VOL 74: SOFTWARE DEVELOPMENT, 2008, 74 : 45 - 114
  • [4] Email Spam Filtering Based on the MNMF Algorithm
    Liu, Zun-xiong
    Tian, Shan-shan
    Huang, Zhi-qiang
    Liu, Jiang-wei
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (01): : 31 - 44
  • [5] Filtering spam email based on retry patterns
    Lieven, Peter
    Scheuermann, Bjoern
    Stini, Michael
    Mauve, Martin
    2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 1515 - 1520
  • [6] A Collaborative Abstraction Based Email Spam Filtering with Fingerprints
    Rajendran, P.
    Tamilarasi, A.
    Mynavathi, R.
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 123 (02) : 1913 - 1923
  • [7] A Collaborative Abstraction Based Email Spam Filtering with Fingerprints
    P. Rajendran
    A. Tamilarasi
    R. Mynavathi
    Wireless Personal Communications, 2022, 123 : 1913 - 1923
  • [8] Identification of Spam Email Based on Information from Email Header
    bin Abd Razak, Shukor
    Bin Mohamad, Ahmad Fahrulrazie
    2013 13TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2013, : 347 - 353
  • [9] Selected methods of spam filtering in email
    Miszalska, Izabella
    Zabierowski, Wojciech
    Napieralski, Andrzej
    2007 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON THE EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS, 2007, : 507 - 513
  • [10] Symbiotic filtering for spam email detection
    Lopes, Clotilde
    Cortez, Paulo
    Sousa, Pedro
    Rocha, Miguel
    Rio, Miguel
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (08) : 9365 - 9372