Filtering Spam by Using Factors Hyperbolic Trees

被引:0
|
作者
Hou, Hailong [1 ]
Chen, Yan [1 ]
Beyah, Raheem [1 ]
Zhang, Yan-Qing [1 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
关键词
spam; Bayesian algorithm; Ranked Term Frequency; fuzzy logic; factors hyperbolic trees;
D O I
10.1109/GLOCOM.2008.ECP.362
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most of current anti-spam techniques, like the Bayesian anti-spam algorithm, primarily use lexical matching for filtering unsolicited bulk E-mails (UBE) and unsolicited commercial E-mails (UCE). However, precision of spam filtering is usually low when the lexical matching algorithms are used in real dynamic environments. For example, an E-mail of refrigerator advertisements is useful for most families, but it is useless for Eskimos. The lexical matching anti-spam algorithms cannot distinguish such processed E-mails that are junk to most people but are useful for others. We propose a Factors Hyperbolic Tree (FHT) based algorithm that, unlike the lexical matching algorithms, handles spam filtering in a dynamic environment by considering various relevant factors. The new Ranked Term Frequency (RTF) algorithm is proposed to extract indicators from E-mails that are related to environmental factors. Type-1 and Type-2 fuzzy logic systems are used to evaluate the indicators and determine whether E-mails are spam based on the environmental factors. Additionally, weights of factors in a FHT database are continuously updated according to dynamic conditional factors in a real environment. Simulation results show that the FHT algorithm filters out spam with high precision. Furthermore, the FHT algorithm is more efficient than other methods when it filters E-mails with complex influencing factors. The main contribution of this paper is that the FHT based algorithm can filter E-mails based on influencing factors instead of matched words to allow dynamic filtering of spam E-mails.
引用
收藏
页数:5
相关论文
共 50 条
  • [11] Adaptive spam mail filtering using genetic algorithm
    Sanpakdee, U
    Walairacht, A
    Walairacht, S
    8th International Conference on Advanced Communication Technology, Vols 1-3: TOWARD THE ERA OF UBIQUITOUS NETWORKS AND SOCIETIES, 2006, : U441 - U445
  • [12] Spam filtering using statistical data compression models
    Bratko, Andrej
    Cormack, Gordon V.
    Filipic, Bogdan
    Lynam, Thomas R.
    Zupan, Blaz
    JOURNAL OF MACHINE LEARNING RESEARCH, 2006, 7 : 2673 - 2698
  • [13] Email Spam Filtering
    Puertas Sanz, Enrique
    Gomez Hidalgo, Jose Maria
    Cortizo Perez, Jose Carlos
    ADVANCES IN COMPUTERS, VOL 74: SOFTWARE DEVELOPMENT, 2008, 74 : 45 - 114
  • [14] Online Spam Filtering Using Support Vector Machines
    Amayri, Ola
    Bouguila, Nizar
    ISCC: 2009 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, 2009, : 337 - 340
  • [15] Adaptive spam filtering using dynamic feature spaces
    Zhou, Yan
    Mulekar, Madhuri S.
    Nerellapalli, Praveen
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (04) : 627 - 646
  • [16] Using LPP and LS-SVM For Spam Filtering
    Sun, Xia
    Zhang, Qingzhou
    Wang, Ziqiang
    2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL II, 2009, : 451 - 454
  • [17] Using Live Spam Beater (LiSB) Framework for Spam Filtering during SMTP Transactions
    Gomez-Meire, Silvana
    Gabriel Marquez, Cesar
    Patricia Aray-Cappello, Eliana
    Mendez, Jose R.
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [18] Image spam filtering using convolutional neural networks
    Fan Aiwan
    Yang Zhaofeng
    PERSONAL AND UBIQUITOUS COMPUTING, 2018, 22 (5-6) : 1029 - 1037
  • [19] Using visual features for anti-SPAM filtering
    Wu, CT
    Cheng, KT
    Zhu, Q
    Wu, KL
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 2925 - 2928
  • [20] Image spam filtering using convolutional neural networks
    Fan Aiwan
    Yang Zhaofeng
    Personal and Ubiquitous Computing, 2018, 22 : 1029 - 1037