Hybrid momentum accelerated bat algorithm with GWO based optimization approach for spam classification

被引:0
|
作者
Pradip Dhal
Chandrashekhar Azad
机构
[1] National Institute of Technology,Department of Computer Science and Engineering, ITER
[2] Siksha ‘O’ Anusandhan (Deemed to Be University),Department of Computer Science and Engineering
关键词
Spam detection; Feature selection; Bat algorithm; Grey wolf optimization;
D O I
暂无
中图分类号
学科分类号
摘要
Spam emails have become more prevalent, necessitating the development of more effective and reliable anti-spam filters. Internet users face security threats, and youngsters are exposed to inappropriate content while receiving spam emails. The gigantic data flow between billions of people and the tremendous number of features (attributes) makes the task more tiresome and complex. Feature Selection (FS) technique is essential for overwhelming accuracy, time and spatial complexity when we have high dimensional data (i.e., the number of features is very large). Spam emails have been successfully filtered and detected using Machine Learning (ML) methods by various researchers nowadays. This work proposes a hybrid binary Metaheuristic Algorithm (MA) based Feature Selection (FS) approach for classifying email spam. The proposed FS approach is based upon two MA, i.e., Bat Algorithm (BA) with Grey Wolf Optimization(GWO). A novel concept of bat momentum has been introduced here, replacing the previous bat velocity. Two quantity, i.e., velocity and momentum, has an entirely different effect on the particle (i.e. bats). But they always follow the exact directions for both of them. To provide the best possible set of features for the FS process, the proposed approach uses an amalgamation technique to reach both the global and local optimum solution. To get the global optimum solution, a new momentum-based equation has been added to the BA, substituting the velocity equation from the prior BA. The GWO property has been added to the momentum-based equation mentioned above to improve the FS process search capabilities. Here a novel concept convergence timer has been introduced, which can eliminate the convergence issue in the iterative algorithm if it arises. A novel GWO based lévy flight update has been introduced here to produce the local optimum solution. We have evaluated our proposed method on two benchmark spam corpora (Spambase, SpamAssassin) having different significant properties. The proposed FS approach has been tested on various classification and clustering algorithms to check the robustness and how the model will behave on unknown data. After comparing multiple state-of-the-art and existing approaches, the proposed method is superior in boosting classification accuracy while minimizing the features in the feature set for misclassifying legitimate emails as spam.
引用
下载
收藏
页码:26929 / 26969
页数:40
相关论文
共 50 条
  • [41] Smart Substation Network Fault Classification Based on a Hybrid Optimization Algorithm
    Xia, Xin
    Liu, Xiaofeng
    Lou, Jichao
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2019, 65 (04) : 657 - 663
  • [42] A novel approach for spam detection using horse herd optimization algorithm
    Ali Hosseinalipour
    Reza Ghanbarzadeh
    Neural Computing and Applications, 2022, 34 : 13091 - 13105
  • [43] A novel approach for spam detection using horse herd optimization algorithm
    Hosseinalipour, Ali
    Ghanbarzadeh, Reza
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (15): : 13091 - 13105
  • [44] Hybrid approach based on cuckoo optimization algorithm and genetic algorithm for task scheduling
    Akbari, Mehdi
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (04) : 1931 - 1947
  • [45] Hybrid approach based on cuckoo optimization algorithm and genetic algorithm for task scheduling
    Mehdi Akbari
    Evolutionary Intelligence, 2021, 14 : 1931 - 1947
  • [46] BAT optimization based Retinal artery vein classification
    Sathananthavathi, V.
    Indumathi, G.
    SOFT COMPUTING, 2021, 25 (04) : 2821 - 2835
  • [47] BAT optimization based Retinal artery vein classification
    V. Sathananthavathi
    G. Indumathi
    Soft Computing, 2021, 25 : 2821 - 2835
  • [48] Optimization of Neural Network Using Improved Bat Algorithm for Data Classification
    Bangyal, Waqas Haider
    Ahmad, Jamil
    Rauf, Hafiz Tayyab
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (04) : 670 - 681
  • [49] A semantic-based classification approach for an enhanced spam detection
    Saidani, Nadjate
    Adi, Kamel
    Allili, Mohand Said
    COMPUTERS & SECURITY, 2020, 94
  • [50] A Hybrid Approach Based on Particle Swarm Optimization and Random Forests for E-Mail Spam Filtering
    Faris, Hossam
    Aljarah, Ibrahim
    Al-Shboul, Bashar
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2016, PT I, 2016, 9875 : 498 - 508