Filtering Spam by Using Factors Hyperbolic Trees

被引:0
|
作者
Hou, Hailong [1 ]
Chen, Yan [1 ]
Beyah, Raheem [1 ]
Zhang, Yan-Qing [1 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
关键词
spam; Bayesian algorithm; Ranked Term Frequency; fuzzy logic; factors hyperbolic trees;
D O I
10.1109/GLOCOM.2008.ECP.362
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most of current anti-spam techniques, like the Bayesian anti-spam algorithm, primarily use lexical matching for filtering unsolicited bulk E-mails (UBE) and unsolicited commercial E-mails (UCE). However, precision of spam filtering is usually low when the lexical matching algorithms are used in real dynamic environments. For example, an E-mail of refrigerator advertisements is useful for most families, but it is useless for Eskimos. The lexical matching anti-spam algorithms cannot distinguish such processed E-mails that are junk to most people but are useful for others. We propose a Factors Hyperbolic Tree (FHT) based algorithm that, unlike the lexical matching algorithms, handles spam filtering in a dynamic environment by considering various relevant factors. The new Ranked Term Frequency (RTF) algorithm is proposed to extract indicators from E-mails that are related to environmental factors. Type-1 and Type-2 fuzzy logic systems are used to evaluate the indicators and determine whether E-mails are spam based on the environmental factors. Additionally, weights of factors in a FHT database are continuously updated according to dynamic conditional factors in a real environment. Simulation results show that the FHT algorithm filters out spam with high precision. Furthermore, the FHT algorithm is more efficient than other methods when it filters E-mails with complex influencing factors. The main contribution of this paper is that the FHT based algorithm can filter E-mails based on influencing factors instead of matched words to allow dynamic filtering of spam E-mails.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Multi-objective Spam Filtering Using an Evolutionary Algorithm
    Dudley, James
    Barone, Luigi
    While, Lyndon
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 123 - 130
  • [42] Filtering Chinese Image Spam Using Pseudo-OCR
    XU Bin
    LI Ruiguang
    LIU Yashu
    YAN Hanbing
    LI Siyuan
    ZHANG Honggang
    Chinese Journal of Electronics, 2015, 24 (01) : 134 - 139
  • [43] Spam Filtering in Twitter Using Sender-Receiver Relationship
    Song, Jonghyuk
    Lee, Sangho
    Kim, Jong
    RECENT ADVANCES IN INTRUSION DETECTION, 2011, 6961 : 301 - +
  • [44] Using Cellular Automata for Improving KNN Based Spam Filtering
    Barigou, Fatiha
    Beldjilali, Bouziane
    Atmani, Baghdad
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2014, 11 (04) : 345 - 353
  • [45] SMS Spam Filtering using Supervised Machine Learning Algorithms
    Navaney, Pavas
    Dubey, Gaurav
    Rana, Ajay
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 43 - 48
  • [46] Towards Filtering Spam Mails using Dimensionality Reduction Methods
    Thomas, Josin
    Raj, Nisha S.
    Vinod, P.
    2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 163 - 168
  • [47] Filtering short message spam of group sending using CAPTCHA
    He, Peizhou
    Sun, Yong
    Zheng, Wei
    Wen, Xiangming
    FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 558 - 561
  • [48] Collaborative spam filtering using e-mail networks
    Kong, Joseph S.
    Rezaei, Behnam A.
    Sarshar, Nima
    Roychowdhury, Vwani P.
    Boykin, P. Oscar
    COMPUTER, 2006, 39 (08) : 67 - +
  • [49] Spam Filtering using Association Rules and Naive Bayes Classifier
    Yang, Tianda
    Qian, Kai
    Lo, Dan Chia-Tien
    Al Nasr, Kamal
    Qian, Ying
    PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATCS AND COMPUTING (IEEE PIC), 2015, : 638 - 642
  • [50] Filtering Image Spam Using File Properties and Color Histogram
    He, Peizhou
    Wen, Xiangming
    Zheng, Wei
    Lin, Xinqi
    2008 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 276 - 279