On enhancing the performance of spam mail filtering system using semantic enrichment

被引:0
|
作者
Kim, HJ [1 ]
Kim, HN [1 ]
Jung, JJ [1 ]
Jo, GS [1 ]
机构
[1] Inha Univ, Sch Comp & Informat Engn, Intelligent E Commerce Syst Lab, Inchon 402751, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the explosive growth of the Internet, e-mails are regarded as one of the most important methods to send e-mails as a substitute for traditional communications. As e-mail has become a major mean of communication in the Internet age, exponentially growing spam mails have been raised as a main problem. As a result of this problem, researchers have suggested many methodologies to solve it. Especially, Bayesian classifier-based systems show high performances to filter spam mail and many commercial products available. However, they have several problems. First, it has a cold start problem, that is, training phase has to be done before execution of the system. The system must be trained about spam and non-spam mail. Second, its cost for filtering spam mail is higher than rule-based systems. Last problem, we focus on, is that the filtering performance is decreased when E-mail has only a few terms which represent its contents. To solve this problem, we suggest spam mail filtering system using concept indexing and Semantic Enrichment. For the performance evaluation, we compare our experimental results with those of Bayesian classifier which is widely used in spam mail filtering. The experimental result shows that the proposed system has improved performance in comparison with Bayesian classifier respectively.
引用
收藏
页码:1095 / 1100
页数:6
相关论文
共 50 条
  • [41] Stacking classifiers for anti-spam filtering of e-mail
    Sakkis, G
    Androutsopoulos, I
    Paliouras, G
    Karkaletsis, V
    Spyropoulos, CD
    Stamatopoulos, P
    PROCEEDINGS OF THE 2001 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2001, : 44 - 50
  • [42] An SMS Spam Filtering System Using Support Vector Machine
    Joe, Inwhee
    Shim, Hyetaek
    FUTURE GENERATION INFORMATION TECHNOLOGY, 2010, 6485 : 577 - 584
  • [43] E-mail Spam Filtering using Genetic Algorithm based on Probabilistic Weights and Words Count
    Bhattacharya, Pronaya
    Singh, Arunendra
    INTERNATIONAL JOURNAL OF INTEGRATED ENGINEERING, 2020, 12 (01): : 40 - 49
  • [44] Time-efficient spam e-mail filtering using n-gram models
    Ciltik, Ali
    Gungor, Tunga
    PATTERN RECOGNITION LETTERS, 2008, 29 (01) : 19 - 33
  • [45] Spam mail templates using genetic algorithm
    Walairacht, Aranya
    IMECS 2007: International Multiconference of Engineers and Computer Scientists, Vols I and II, 2007, : 137 - 140
  • [46] The Adaptive SPAM Mail Detection System using Clustering based on Text Mining
    Hong, Sung-Sam
    Kong, Jong-Hwan
    Han, Myung-Mook
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2014, 8 (06): : 2186 - 2196
  • [47] Spam filtering based on supervised latent semantic features extraction
    Zeng, Qingpeng
    Wu, Shuixiu
    Wang, Mingwen
    Journal of Computational Information Systems, 2008, 4 (03): : 1299 - 1306
  • [48] Research on a distributed spam filtering system
    Zhang, Qiuyu
    Sun, Jingtao
    Huang, Wenhan
    DCABES 2007 PROCEEDINGS, VOLS I AND II, 2007, : 513 - 517
  • [49] Enhancing Media Enrichment by Semantic Extraction
    Krug, Michael
    Wiedemann, Fabian
    Gaedke, Martin
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 111 - 114
  • [50] Feature selection by fuzzy inference and its application to spam-mail filtering
    Kim, JW
    Kang, SJ
    COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 361 - 366