Filtering Spam by Using Factors Hyperbolic Trees

被引：0

作者：

Hou, Hailong ^{[1
]}

Chen, Yan ^{[1
]}

Beyah, Raheem ^{[1
]}

Zhang, Yan-Qing ^{[1
]}

机构：

[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA

来源：

GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE | 2008年

关键词：

spam; Bayesian algorithm; Ranked Term Frequency; fuzzy logic; factors hyperbolic trees;

D O I：

10.1109/GLOCOM.2008.ECP.362

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Most of current anti-spam techniques, like the Bayesian anti-spam algorithm, primarily use lexical matching for filtering unsolicited bulk E-mails (UBE) and unsolicited commercial E-mails (UCE). However, precision of spam filtering is usually low when the lexical matching algorithms are used in real dynamic environments. For example, an E-mail of refrigerator advertisements is useful for most families, but it is useless for Eskimos. The lexical matching anti-spam algorithms cannot distinguish such processed E-mails that are junk to most people but are useful for others. We propose a Factors Hyperbolic Tree (FHT) based algorithm that, unlike the lexical matching algorithms, handles spam filtering in a dynamic environment by considering various relevant factors. The new Ranked Term Frequency (RTF) algorithm is proposed to extract indicators from E-mails that are related to environmental factors. Type-1 and Type-2 fuzzy logic systems are used to evaluate the indicators and determine whether E-mails are spam based on the environmental factors. Additionally, weights of factors in a FHT database are continuously updated according to dynamic conditional factors in a real environment. Simulation results show that the FHT algorithm filters out spam with high precision. Furthermore, the FHT algorithm is more efficient than other methods when it filters E-mails with complex influencing factors. The main contribution of this paper is that the FHT based algorithm can filter E-mails based on influencing factors instead of matched words to allow dynamic filtering of spam E-mails.

引用

页数：5

共 50 条

[11] Adaptive spam mail filtering using genetic algorithm
Sanpakdee, U
Walairacht, A
Walairacht, S
8th International Conference on Advanced Communication Technology, Vols 1-3: TOWARD THE ERA OF UBIQUITOUS NETWORKS AND SOCIETIES, 2006, : U441 - U445
[12] Spam filtering using statistical data compression models
Bratko, Andrej
Cormack, Gordon V.
Filipic, Bogdan
Lynam, Thomas R.
Zupan, Blaz
JOURNAL OF MACHINE LEARNING RESEARCH, 2006, 7 : 2673 - 2698
[13] Email Spam Filtering
Puertas Sanz, Enrique
Gomez Hidalgo, Jose Maria
Cortizo Perez, Jose Carlos
ADVANCES IN COMPUTERS, VOL 74: SOFTWARE DEVELOPMENT, 2008, 74 : 45 - 114
[14] Online Spam Filtering Using Support Vector Machines
Amayri, Ola
Bouguila, Nizar
ISCC: 2009 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, 2009, : 337 - 340
[15] Adaptive spam filtering using dynamic feature spaces
Zhou, Yan
Mulekar, Madhuri S.
Nerellapalli, Praveen
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (04) : 627 - 646
[16] Using LPP and LS-SVM For Spam Filtering
Sun, Xia
Zhang, Qingzhou
Wang, Ziqiang
2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL II, 2009, : 451 - 454
[17] Using Live Spam Beater (LiSB) Framework for Spam Filtering during SMTP Transactions
Gomez-Meire, Silvana
Gabriel Marquez, Cesar
Patricia Aray-Cappello, Eliana
Mendez, Jose R.
APPLIED SCIENCES-BASEL, 2022, 12 (20):
[18] Image spam filtering using convolutional neural networks
Fan Aiwan
Yang Zhaofeng
PERSONAL AND UBIQUITOUS COMPUTING, 2018, 22 (5-6) : 1029 - 1037
[19] Using visual features for anti-SPAM filtering
Wu, CT
Cheng, KT
Zhu, Q
Wu, KL
2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 2925 - 2928
[20] Image spam filtering using convolutional neural networks
Fan Aiwan
Yang Zhaofeng
Personal and Ubiquitous Computing, 2018, 22 : 1029 - 1037

← 1 2 3 4 5 →