The Role of Feature Selection in Machine Learning for Detection of Spam and Phishing Attacks

被引:4
|
作者
Salihovic, Ina [1 ]
Serdarevic, Haris [1 ]
Kevric, Jasmin [1 ]
机构
[1] Int Burch Univ, Sarajevo 71000, Bosnia & Herceg
关键词
Phishing; Spam emails; Machine learning; Feature selection;
D O I
10.1007/978-3-030-02577-9_47
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the increase in Internet use throughout the world, expansion in network security is indispensable since it decreases the chances of privacy spoofing, identity or information theft and bank frauds. Two of the most frequent network security breaches involve phishing and spam emails as they are an easy way to pass a virus or a malicious site, which can lead to extensive frauds. Despite the fact that there is an abundance of tools for detection and blocking of these types of messages and websites, society is still trying to combat and rise above said problem. The purpose of this paper was to exclude the human factor in security breaches executed in this manner with the use of various machine learning algorithms. For the purpose of training and testing of the most successful algorithms (Random Forest, k-Nearest Neighbor, Artificial Neural Network, Support Vector Machine, Logistic Regression, Naive Bayes) paper used two separate bases, UCIs Phishing Websites Data Set and Spam Emails Dataset together with Weka software, and found that the best results for both of them are achieved with the Random Forest algorithm. However, databases responded differently to feature selection algorithms, as the best result for phishing (97.33% accuracy) was accomplished through Ranker + Principal Components Optimization, and the best result for spam (94.24% accuracy) was accomplished through BestFirst + CfsSubsEval Optimization in Weka. These findings provide a base platform for future work towards a faster and more accurate online fraud detection.
引用
收藏
页码:476 / 483
页数:8
相关论文
共 50 条
  • [1] Feature Selection Approach for Phishing Detection Based on Machine Learning
    Wei, Yi
    Sekiya, Yuji
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON APPLIED CYBER SECURITY (ACS) 2021, 2022, 378 : 61 - 70
  • [2] Using machine learning to deal with Phishing and Spam Detection: An overview
    El Kouari, Oumaima
    Benaboud, Hafssa
    Lazaar, Saiida
    [J]. 3RD INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEM & SECURITY (NISS'20), 2020,
  • [3] Explainable machine learning for phishing feature detection
    Calzarossa, Maria Carla
    Giudici, Paolo
    Zieni, Rasha
    [J]. QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2024, 40 (01) : 362 - 373
  • [4] Phishing Website Detection Using Machine Learning Classifiers Optimized by Feature Selection
    Mehanovic, Dzelila
    Kevric, Jasmin
    [J]. TRAITEMENT DU SIGNAL, 2020, 37 (04) : 563 - 569
  • [5] A Genetic Programming Approach to Feature Selection and Construction for Ransomware, Phishing and Spam Detection
    Al-Sahaf, Harith
    Welch, Ian
    [J]. PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 332 - 333
  • [6] Phishing Attacks Detection using Machine Learning and Deep Learning Models
    Aljabri, Malak
    Mirza, Samiha
    [J]. 2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 175 - 180
  • [7] Phishing Attacks Detection A Machine Learning-Based Approach
    Salahdine, Fatima
    El Mrabet, Zakaria
    Kaabouch, Naima
    [J]. 2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 250 - 255
  • [8] Phishing Attacks Detection Using Ensemble Machine Learning Algorithms
    Innab, Nisreen
    Osman, Ahmed Abdelgader Fadol
    Ataelfadiel, Mohammed Awad Mohammed
    Abu-Zanona, Marwan
    Elzaghmouri, Bassam Mohammad
    Zawaideh, Farah H.
    Alawneh, Mouiad Fadeil
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 1325 - 1345
  • [9] Unveiling suspicious phishing attacks: enhancing detection with an optimal feature vectorization algorithm and supervised machine learning
    Tamal, Maruf A.
    Islam, Md K.
    Bhuiyan, Touhid
    Sattar, Abdus
    Prince, Nayem Uddin
    [J]. FRONTIERS IN COMPUTER SCIENCE, 2024, 6
  • [10] Feature Selections for the Machine Learning based Detection of Phishing Websites
    Buber, Ebubekir
    Demir, Onder
    Sahingoz, Ozgur Koray
    [J]. 2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,