Feature-based performance comparison of machine learning algorithms for phishing detection through uniform resource locator

被引:1
|
作者
Savas, Taki [1 ]
Savas, Serkan [2 ]
机构
[1] Interprobe Intelligence & Analyt Ankara, Ankara, Turkey
[2] Cankiri Karatekin Univ, Muhendisl Fak, Bilgisayar Muh Bolumu, Cankiri, Turkey
关键词
Cybersecurity; phishing; machine learning; domain; cyber-attack detection; CLASSIFICATION; MODEL;
D O I
10.2339/politeknik.1035286
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recently, phishing attacks are very common. Such attacks are carried out with the aim of obtaining personal information of individuals or defrauding individuals. There are multiple types of phishing attacks. One of these types is the common attacks carried out through the uniform resource locator (URL). The purpose of this study is to classify whether URL addresses are malicious or not using different machine learning algorithms. Eight different machine learning algorithms including support vector machines, random forest, Gaussian Naive Bayes, logistic regression, k-nearest neighbor, decision trees, multilayer perceptrons and XGBoost algorithms were used in the study. Data were obtained from USOM, Alexa, and Phishtank to be used for training and testing purposes. Feature extraction was performed limited by applying various data pre-processing steps to these data. As a result of the research, the accuracy of 99.8% in more than one model has been achieved, and the success of machine learning algorithms in this area has been proven.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Feature-Based Performance Comparison of Machine Learning Algorithms for Phishing Detection through Uniform Resource Locator
    Savas, Taki
    Savas, Serkan
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2022, 25 (03): : 1261 - 1270
  • [2] Phishing uniform resource locator detection using machine learning: A step towards secure system
    Mahajan, Shilpa
    SECURITY AND PRIVACY, 2023, 6 (06)
  • [3] Detection of phishing websites using an efficient feature-based machine learning framework
    Routhu Srinivasa Rao
    Alwyn Roshan Pais
    Neural Computing and Applications, 2019, 31 : 3851 - 3873
  • [4] Detection of phishing websites using an efficient feature-based machine learning framework
    Rao, Routhu Srinivasa
    Pais, Alwyn Roshan
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08): : 3851 - 3873
  • [5] Modeling Hybrid Feature-Based Phishing Websites Detection Using Machine Learning Techniques
    Das Guptta S.
    Shahriar K.T.
    Alqahtani H.
    Alsalman D.
    Sarker I.H.
    Annals of Data Science, 2024, 11 (01) : 217 - 242
  • [6] A Comparison of Machine Learning Algorithms for Multilingual Phishing Detection
    Staples, Dakota
    Hakak, Saqib
    Cook, Paul
    2023 20TH ANNUAL INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST, PST, 2023, : 399 - 404
  • [7] Phishing Hybrid Feature-Based Classifier by Using Recursive Features Subset Selection and Machine Learning Algorithms
    Zuhair, Hiba
    Selamat, Ali
    RECENT TRENDS IN DATA SCIENCE AND SOFT COMPUTING, IRICT 2018, 2019, 843 : 267 - 277
  • [8] Detecting Phishing Websites Using an Efficient Feature-based Machine Learning Framework
    Sundaram, K. Mohana
    Sasikumar, R.
    Meghana, Atthipalli Sai
    Anuja, Arava
    Praneetha, Chandolu
    REVISTA GEINTEC-GESTAO INOVACAO E TECNOLOGIAS, 2021, 11 (02): : 2106 - 2112
  • [9] Phishing detection based on machine learning and feature selection methods
    Almseidin M.
    Abu Zuraiq A.M.
    Al-kasassbeh M.
    Alnidami N.
    International Journal of Interactive Mobile Technologies, 2019, 13 (12) : 71 - 183
  • [10] Feature Selections for the Machine Learning based Detection of Phishing Websites
    Buber, Ebubekir
    Demir, Onder
    Sahingoz, Ozgur Koray
    2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,