Email Filtering based on Supervised Learning and Mutual Information Feature Selection

被引:0
|
作者
Gad, Walaa [1 ]
Rady, Sherine [1 ]
机构
[1] Ain Shams Univ, Dept Informat Syst, Fac Comp & Informat Sci, Cairo, Egypt
关键词
email filtering; supervised learning; classification; mutual information; feature selection;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Electronic mail is one of today's most important ways to communicate and transfer information. Because of fast delivery and easy to access, it is used almost in every aspect of communication in work and life. However, the increase in email users has resulted in a dramatic increase in spam emails during the past few years. In this paper, we propose an email-filtering approach that is based on supervised classifier and mutual information. The proposed model has the advantage of combining machine supervised learning with feature selection. Term frequency (TF) is presented to assign relevance weights to words of each email class. We conduct experiments to compare between six different classifiers. Results show that the proposed approach has high performance in terms of precision, recall and accuracy performance measures.
引用
收藏
页码:147 / 152
页数:6
相关论文
共 50 条
  • [1] Semi-supervised feature selection based on discernibility matrix and mutual information
    Qian, Wenbin
    Wan, Lijuan
    Shu, Wenhao
    APPLIED INTELLIGENCE, 2024, 54 (13-14) : 7278 - 7295
  • [2] A NOVEL FEATURE SELECTION ALGORITHM WITH SUPERVISED MUTUAL INFORMATION FOR CLASSIFICATION
    Palanichamy, Jaganathan
    Ramasamy, Kuppuchamy
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2013, 22 (04)
  • [3] Semi-supervised Feature Selection by Mutual Information Based on Kernel Density Estimation
    Xu, Siqi
    Dai, Jianhua
    Shi, Hong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 818 - 823
  • [4] Supervised feature selection by clustering using conditional mutual information-based distances
    Martinez Sotoca, Jose
    Pla, Filiberto
    PATTERN RECOGNITION, 2010, 43 (06) : 2068 - 2081
  • [5] Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Li, Jinjin
    Wu, Xindong
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2017, 25 (06) : 1491 - 1507
  • [6] Feature selection for orthogonal broad learning system based on mutual information
    Liu, Zhicheng
    Chen, Bao
    Xie, Bingxue
    Qiang, Huangping
    Zhu, Ziqi
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] Mutual Information-Based Feature Selection and Ensemble Learning for Classification
    Qi, Chengming
    Zhou, Zhangbing
    Wang, Qun
    Hu, Lishuan
    2016 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2016, : 116 - 121
  • [8] Learning to Maximize Mutual Information for Dynamic Feature Selection
    Covert, Ian
    Qiu, Wei
    Lu, Mingyu
    Kim, Nayoon
    White, Nathan
    Lee, Su-In
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [9] Feature Selection and Similarity Coefficient Based Method for Email Spam Filtering
    Abdelrahim, Ali Ahmed A.
    Elhadi, Ammar Ahmed E.
    Ibrahim, Hamza
    Elmisbah, Naser
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, : 630 - 633
  • [10] Conditional Mutual Information based Feature Selection
    Cheng, Hongrong
    Qin, Zhiguang
    Qian, Weizhong
    Liu, Wei
    KAM: 2008 INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING, PROCEEDINGS, 2008, : 103 - 107