Spam E-Mail Classification Based on the IFWB Algorithm

被引:0
|
作者
Jou, Chichang [1 ]
机构
[1] Tamkang Univ, Dept Informat Management, New Taipei City 25137, Taiwan
关键词
spam classification; incremental forgetting; misclassification cost; CONCEPT DRIFT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of spam e-mails has been addressed for some time. Most of the solutions are based on spam e-mail classification and filtering. However, the content of spam e-mails drifts with new concepts or social events. Thus, several spam classifiers perform effectively when their models are initially established, and their performances deteriorate with time. A learning mechanism is required to adjust the classification parameters for new and old e-mails. Because of the spread of spam e-mails, the number of spam e-mails is larger than that of legitimate e-mails. Therefore, most classifiers produce high recall for spam e-mails and low recall for legitimate e-mails. Based on the Bayesian algorithm, we propose an incremental forgetting weighted algorithm with a misclassification cost mechanism that extracts features by IGICF (Information Gain and Inverse Class Frequency) to address the problem of concept drift and data skew in spam e-mail classification. We implemented the algorithm and performed detailed tests on the effectiveness of the mechanism.
引用
收藏
页码:314 / 324
页数:11
相关论文
共 50 条
  • [1] Spam Classification Based on E-Mail Path Analysis
    Palla, Srikanth
    Dantu, Ram
    Cangussu, Joao W.
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2008, 2 (02) : 46 - 69
  • [2] Cloud e-mail security: An accurate e-mail spam classification based on enhanced binary differential evolution (BDE) algorithm
    Hamed, Nadir O.
    Samak, Ahmed H.
    Ahmad, Mostafa A.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 5943 - 5955
  • [3] Voting-based Classification for E-mail Spam Detection
    Al-Shboul, Bashar
    Hakh, Heba
    Faris, Hossam
    Aljarah, Ibrahim
    Alsawalqah, Hamad
    [J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2016, 10 (01) : 29 - 42
  • [4] E-mail Spam Classification Using Grasshopper Optimization Algorithm and Neural Networks
    Ghaleb, Sanaa A. A.
    Mohamad, Mumtazimah
    Fadzli, Syed Abdullah
    Ghanem, Waheed A. H. M.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 4749 - 4766
  • [5] A Multiobjective Evolutionary Algorithm for Spam E-mail Filtering
    Lopez-Herrera, A. G.
    Herrera-Viedma, E.
    Herrera, F.
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 366 - +
  • [6] An improved Bayes algorithm for filtering spam e-mail
    Wang, Meizhen
    Li, Zhitang
    Wu, Hantao
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2009, 37 (08): : 27 - 30
  • [7] Content Based Spam E-mail Filtering
    Liu, Pingchuan
    Moh, Teng-Sheng
    [J]. 2016 INTERNATIONAL CONFERENCE ON COLLABORATION TECHNOLOGIES AND SYSTEMS (CTS), 2016, : 218 - 224
  • [8] E-mail, hold the spam
    Hoyle, J
    [J]. JOURNAL OF THE AMERICAN DENTAL ASSOCIATION, 2000, 131 (10): : 1426 - 1426
  • [9] Development of Proposed Ensemble Model for Spam e-mail Classification
    Shrivas, Akhilesh Kumar
    Dewangan, Amit Kumar
    Ghosh, S. M.
    Singh, Devendra
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2021, 50 (03): : 411 - 423
  • [10] The Research and Implementation of Spam E-mail Filtering Based on Improved Bayesian Algorithm
    Zhang, Sifa
    Zuo, Fengmei
    [J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, 2008, : 132 - 135