Feature-based performance comparison of machine learning algorithms for phishing detection through uniform resource locator

被引:1
|
作者
Savas, Taki [1 ]
Savas, Serkan [2 ]
机构
[1] Interprobe Intelligence & Analyt Ankara, Ankara, Turkey
[2] Cankiri Karatekin Univ, Muhendisl Fak, Bilgisayar Muh Bolumu, Cankiri, Turkey
关键词
Cybersecurity; phishing; machine learning; domain; cyber-attack detection; CLASSIFICATION; MODEL;
D O I
10.2339/politeknik.1035286
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recently, phishing attacks are very common. Such attacks are carried out with the aim of obtaining personal information of individuals or defrauding individuals. There are multiple types of phishing attacks. One of these types is the common attacks carried out through the uniform resource locator (URL). The purpose of this study is to classify whether URL addresses are malicious or not using different machine learning algorithms. Eight different machine learning algorithms including support vector machines, random forest, Gaussian Naive Bayes, logistic regression, k-nearest neighbor, decision trees, multilayer perceptrons and XGBoost algorithms were used in the study. Data were obtained from USOM, Alexa, and Phishtank to be used for training and testing purposes. Feature extraction was performed limited by applying various data pre-processing steps to these data. As a result of the research, the accuracy of 99.8% in more than one model has been achieved, and the success of machine learning algorithms in this area has been proven.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Phishing Detection: A Recent Intelligent Machine Learning Comparison based on Models Content and Features
    Abdelhamid, Neda
    Thabtah, Fadi
    Abdel-jaber, Hussein
    2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2017, : 72 - 77
  • [42] Image feature-based electric vehicle detection and classification system using machine learning
    Kim S.
    Kang S.-J.
    Kang, Suk-Ju (sjkang@sogang.ac.kr), 1600, Korean Institute of Electrical Engineers (66): : 1092 - 1099
  • [43] Phishing Website Detection Using Machine Learning Classifiers Optimized by Feature Selection
    Mehanovic, Dzelila
    Kevric, Jasmin
    TRAITEMENT DU SIGNAL, 2020, 37 (04) : 563 - 569
  • [44] A new hybrid ensemble feature selection framework for machine learning-based phishing detection system
    Chiew, Kang Leng
    Tan, Choon Lin
    Wong, KokSheik
    Yong, Kelvin S. C.
    Tiong, Wei King
    INFORMATION SCIENCES, 2019, 484 : 153 - 166
  • [45] Enhancing Arabic Phishing Email Detection: A Hybrid Machine Learning Based on Genetic Algorithm Feature Selection
    Alsuwaylimi, Amjad A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 312 - 325
  • [46] Enhanced Feature Selection Using Genetic Algorithm for Machine-Learning-Based Phishing URL Detection
    Kocyigit, Emre
    Korkmaz, Mehmet
    Sahingoz, Ozgur Koray
    Diri, Banu
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [47] Research on Fatigue Driving Feature Detection Algorithms of drivers based on machine learning
    Hou Zhongwei
    Ou Shuangjiang
    Xu Dengyuan
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2021, 9 (01) : 167 - 172
  • [48] Feature Selection and Intrusion Detection in Cloud Environment based on Machine Learning Algorithms
    Javadpour, Amir
    Abharian, Sanaz Kazemi
    Wang, Guojun
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 1417 - 1421
  • [49] A Survey of Feature Selection for Vulnerability Prediction Using Feature-based Machine Learning
    Li, ZhanJun
    Shao, Yan
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 30 - 36
  • [50] A Machine-learning based Unbiased Phishing Detection Approach
    Shirazi, Hossein
    Zweigle, Landon
    Ray, Indrakshi
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS (SECRYPT), VOL 1, 2020, : 423 - 430