Filtering Spam by Using Factors Hyperbolic Trees

被引:0
|
作者
Hou, Hailong [1 ]
Chen, Yan [1 ]
Beyah, Raheem [1 ]
Zhang, Yan-Qing [1 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
关键词
spam; Bayesian algorithm; Ranked Term Frequency; fuzzy logic; factors hyperbolic trees;
D O I
10.1109/GLOCOM.2008.ECP.362
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most of current anti-spam techniques, like the Bayesian anti-spam algorithm, primarily use lexical matching for filtering unsolicited bulk E-mails (UBE) and unsolicited commercial E-mails (UCE). However, precision of spam filtering is usually low when the lexical matching algorithms are used in real dynamic environments. For example, an E-mail of refrigerator advertisements is useful for most families, but it is useless for Eskimos. The lexical matching anti-spam algorithms cannot distinguish such processed E-mails that are junk to most people but are useful for others. We propose a Factors Hyperbolic Tree (FHT) based algorithm that, unlike the lexical matching algorithms, handles spam filtering in a dynamic environment by considering various relevant factors. The new Ranked Term Frequency (RTF) algorithm is proposed to extract indicators from E-mails that are related to environmental factors. Type-1 and Type-2 fuzzy logic systems are used to evaluate the indicators and determine whether E-mails are spam based on the environmental factors. Additionally, weights of factors in a FHT database are continuously updated according to dynamic conditional factors in a real environment. Simulation results show that the FHT algorithm filters out spam with high precision. Furthermore, the FHT algorithm is more efficient than other methods when it filters E-mails with complex influencing factors. The main contribution of this paper is that the FHT based algorithm can filter E-mails based on influencing factors instead of matched words to allow dynamic filtering of spam E-mails.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Spam filtering using spam mail communities
    Deepak, P
    Parameswaran, S
    2005 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2005, : 377 - 383
  • [2] Filtering spam
    Editor & Publisher, 1999, (Suppl):
  • [3] Filtering spam
    Baker, B
    INTERNET WORLD, 1998, 9 (01): : 14 - 14
  • [4] Image spam filtering using visual information
    Biggio, Battista
    Fumera, Giorgio
    Pillai, Ignazio
    Roli, Fabio
    14TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2007, : 105 - +
  • [5] Spam filtering using Kolmogorov complexity analysis
    Richard, G.
    Doncescu, A.
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2008, 4 (01) : 136 - 148
  • [6] On the Study of Anomaly-based Spam Filtering Using Spam as Representation of Normality
    Laorden, Carlos
    Ugarte-Pedrero, Xabier
    Santos, Igor
    Sanz, Borja
    Nieves, Javier
    Bringas, Pablo G.
    2012 IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE (CCNC), 2012, : 693 - 695
  • [7] Adaptive filtering of SPAM
    Pelletier, L
    Almhana, J
    Choulakian, V
    SECOND ANNUAL CONFERENCE ON COMMUNICATION NETWORKS AND SERVICES RESEARCH, PROCEEDINGS, 2004, : 218 - 224
  • [8] Spam filtering scheme
    Wang, Jing (wngjing@hotmail.com), 1600, Northeast University (35):
  • [9] Short Messages Spam Filtering Using Sentiment Analysis
    Ezpeleta, Enaitz
    Zurutuza, Urko
    Gomez Hidalgo, Jose Maria
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 142 - 153
  • [10] Adaptive spam mail filtering using genetic algorithm
    Sanpakdee, U
    Walairacht, A
    Walairacht, S
    8th International Conference on Advanced Communication Technology, Vols 1-3: TOWARD THE ERA OF UBIQUITOUS NETWORKS AND SOCIETIES, 2006, : U441 - U445