Feature-based performance comparison of machine learning algorithms for phishing detection through uniform resource locator

被引:1
|
作者
Savas, Taki [1 ]
Savas, Serkan [2 ]
机构
[1] Interprobe Intelligence & Analyt Ankara, Ankara, Turkey
[2] Cankiri Karatekin Univ, Muhendisl Fak, Bilgisayar Muh Bolumu, Cankiri, Turkey
关键词
Cybersecurity; phishing; machine learning; domain; cyber-attack detection; CLASSIFICATION; MODEL;
D O I
10.2339/politeknik.1035286
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recently, phishing attacks are very common. Such attacks are carried out with the aim of obtaining personal information of individuals or defrauding individuals. There are multiple types of phishing attacks. One of these types is the common attacks carried out through the uniform resource locator (URL). The purpose of this study is to classify whether URL addresses are malicious or not using different machine learning algorithms. Eight different machine learning algorithms including support vector machines, random forest, Gaussian Naive Bayes, logistic regression, k-nearest neighbor, decision trees, multilayer perceptrons and XGBoost algorithms were used in the study. Data were obtained from USOM, Alexa, and Phishtank to be used for training and testing purposes. Feature extraction was performed limited by applying various data pre-processing steps to these data. As a result of the research, the accuracy of 99.8% in more than one model has been achieved, and the success of machine learning algorithms in this area has been proven.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Recent Research on Phishing Detection Through Machine Learning Algorithm
    Quang, Do Nguyet
    Selamat, Ali
    Krejcar, Ondrej
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2021, 12798 LNAI : 495 - 508
  • [32] Adversarial Autoencoder Data Synthesis for Enhancing Machine Learning-Based Phishing Detection Algorithms
    Shirazi, Hossein
    Muramudalige, Shashika R.
    Ray, Indrakshi
    Jayasumana, Anura P.
    Wang, Haonan
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (04) : 2411 - 2422
  • [33] A Deep Learning-Based Innovative Technique for Phishing Detection in Modern Security with Uniform Resource Locators
    Aldakheel, Eman Abdullah
    Zakariah, Mohammed
    Gashgari, Ghada Abdalaziz
    Almarshad, Fahdah A.
    Alzahrani, Abdullah I. A.
    SENSORS, 2023, 23 (09)
  • [34] Phishing Website Detection Based on Machine Learning: A Survey
    Singh, Charu
    Meenu
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 398 - 404
  • [35] Spear Phishing Emails Detection Based on Machine Learning
    Ding, Xiong
    Liu, Baoxu
    Jiang, Zhengwei
    Wang, Qiuyun
    Xin, Liling
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 354 - 359
  • [36] Machine learning-based phishing attack detection
    Hossain S.
    Sarma D.
    Chakma R.J.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (09): : 378 - 388
  • [37] Machine Learning-Based Phishing Attack Detection
    Hossain, Sohrab
    Sarma, Dhiman
    Chakma, Rana Joyti
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 378 - 388
  • [38] Machine Learning Based Phishing Web Sites Detection
    Huu Hieu Nguyen
    Duc Thai Nguyen
    AETA 2015: RECENT ADVANCES IN ELECTRICAL ENGINEERING AND RELATED SCIENCES, 2016, 371 : 123 - 131
  • [39] Enhancing Phishing Detection: A Machine Learning Approach With Feature Selection and Deep Learning Models
    Nayak, Ganesh S.
    Muniyal, Balachandra
    Belavagi, Manjula C.
    IEEE ACCESS, 2025, 13 : 33308 - 33320
  • [40] Sarcasm Detection in Tweets: A Feature-based Approach using Supervised Machine Learning Models
    Rahaman, Arifur
    Kuri, Ratnadip
    Islam, Syful
    Hossain, Md Javed
    Kabir, Mohammed Humayun
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (06) : 454 - 460