Filtering Spam by Using Factors Hyperbolic Trees

被引：0

作者：

Hou, Hailong ^{[1
]}

Chen, Yan ^{[1
]}

Beyah, Raheem ^{[1
]}

Zhang, Yan-Qing ^{[1
]}

机构：

[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA

来源：

GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE | 2008年

关键词：

spam; Bayesian algorithm; Ranked Term Frequency; fuzzy logic; factors hyperbolic trees;

D O I：

10.1109/GLOCOM.2008.ECP.362

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Most of current anti-spam techniques, like the Bayesian anti-spam algorithm, primarily use lexical matching for filtering unsolicited bulk E-mails (UBE) and unsolicited commercial E-mails (UCE). However, precision of spam filtering is usually low when the lexical matching algorithms are used in real dynamic environments. For example, an E-mail of refrigerator advertisements is useful for most families, but it is useless for Eskimos. The lexical matching anti-spam algorithms cannot distinguish such processed E-mails that are junk to most people but are useful for others. We propose a Factors Hyperbolic Tree (FHT) based algorithm that, unlike the lexical matching algorithms, handles spam filtering in a dynamic environment by considering various relevant factors. The new Ranked Term Frequency (RTF) algorithm is proposed to extract indicators from E-mails that are related to environmental factors. Type-1 and Type-2 fuzzy logic systems are used to evaluate the indicators and determine whether E-mails are spam based on the environmental factors. Additionally, weights of factors in a FHT database are continuously updated according to dynamic conditional factors in a real environment. Simulation results show that the FHT algorithm filters out spam with high precision. Furthermore, the FHT algorithm is more efficient than other methods when it filters E-mails with complex influencing factors. The main contribution of this paper is that the FHT based algorithm can filter E-mails based on influencing factors instead of matched words to allow dynamic filtering of spam E-mails.

引用

页数：5

共 50 条

[1] Spam filtering using spam mail communities
Deepak, P
Parameswaran, S
2005 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2005, : 377 - 383
[2] Filtering spam
Editor & Publisher, 1999, (Suppl):
[3] Filtering spam
Baker, B
INTERNET WORLD, 1998, 9 (01): : 14 - 14
[4] Image spam filtering using visual information
Biggio, Battista
Fumera, Giorgio
Pillai, Ignazio
Roli, Fabio
14TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2007, : 105 - +
[5] Spam filtering using Kolmogorov complexity analysis
Richard, G.
Doncescu, A.
INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2008, 4 (01) : 136 - 148
[6] On the Study of Anomaly-based Spam Filtering Using Spam as Representation of Normality
Laorden, Carlos
Ugarte-Pedrero, Xabier
Santos, Igor
Sanz, Borja
Nieves, Javier
Bringas, Pablo G.
2012 IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE (CCNC), 2012, : 693 - 695
[7] Adaptive filtering of SPAM
Pelletier, L
Almhana, J
Choulakian, V
SECOND ANNUAL CONFERENCE ON COMMUNICATION NETWORKS AND SERVICES RESEARCH, PROCEEDINGS, 2004, : 218 - 224
[8] Spam filtering scheme
Wang, Jing (wngjing@hotmail.com), 1600, Northeast University (35):
[9] Short Messages Spam Filtering Using Sentiment Analysis
Ezpeleta, Enaitz
Zurutuza, Urko
Gomez Hidalgo, Jose Maria
TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 142 - 153
[10] Adaptive spam mail filtering using genetic algorithm
Sanpakdee, U
Walairacht, A
Walairacht, S
8th International Conference on Advanced Communication Technology, Vols 1-3: TOWARD THE ERA OF UBIQUITOUS NETWORKS AND SOCIETIES, 2006, : U441 - U445

← 1 2 3 4 5 →