URL filtering using machine learning algorithms

被引:0
|
作者
Aljahdalic, Asia Othman [1 ]
Banafee, Shoroq [1 ]
Aljohani, Thana [1 ]
机构
[1] Univ Jeddah, Coll Comp Sci & Engn, Jeddah, Saudi Arabia
来源
INFORMATION SECURITY JOURNAL | 2024年 / 33卷 / 03期
关键词
Classifier; detection; extracted feature; machine learning; URL phishing;
D O I
10.1080/19393555.2023.2193350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyber-attacks using malicious uniform resource locator (URL) propagation are very common and serious. Statistics indicate that there is a need to research and apply techniques and methods for identifying and preventing malicious URLs. The main objective of this research is to train machine learning models on selected dataset to predict phishing websites based on URL-related features. The accuracy level of each model is measured and compared. Finally, the best performing model will be used to develop a web application that provide internet users with an easy way to check suspicious URLs. We have used five different machine learning models to classify URLs as legitimate or phishing, these models are eXtreme Gradient Boosting (XGBoost), k-nearest neighbors (KNN), support vector machine (SVM), Decision Tree, and Random Forest. Finally, we used Voting Classifier to combine the work of Random Forest (RF) algorithm with other two models, Gaussian Naive Bayes, and Logistic Regression, to check if we can increase the accuracy of RF as suggested in the literature, but we found that the accuracy of RF alone was higher than the accuracy of the combined models. This project can be implemented as a browser extension or mobile application to classify suspicious URLs to legitimate or phishing with the use of the saved model.
引用
收藏
页码:193 / 203
页数:11
相关论文
共 50 条
  • [21] Dynamic Packet Filtering Using Machine Learning
    Chebrolu, Chandan Sai
    Lung, Chung-Horng
    Ajila, Samuel A.
    [J]. 2022 IEEE 23RD INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2022), 2022, : 206 - 211
  • [22] Machine learning techniques for automated web page classification using URL features
    Devi, M. Indra
    Rajaram, R.
    Selvakuberan, K.
    [J]. ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 116 - 118
  • [23] URL-Based Dynamic Monitoring of Android Malware using Machine Learning
    Somarriba, Oscar
    Urbina, Henry Jaentschke
    [J]. PROCEEDINGS OF THE 2022 IEEE 40TH CENTRAL AMERICA AND PANAMA CONVENTION (CONCAPAN), 2022,
  • [24] Machine Learning Techniques for Detecting Phishing URL Attacks
    Mosa, Diana T.
    Shams, Mahmoud Y.
    Abohany, Amr A.
    El-kenawy, El-Sayed M.
    Thabet, M.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (01): : 1271 - 1290
  • [25] Privacy Preserving Machine Learning for Malicious URL Detection
    Shaik, Imtiyazuddin
    Emmadi, Nitesh
    Tupsamudre, Harshal
    Narumanchi, Harika
    Bhattachar, Rajan Mindigal Alasingara
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS - DEXA 2021 WORKSHOPS, 2021, 1479 : 31 - 41
  • [26] Machine Learning for Multiple Stage Phishing URL Prediction
    Amen, Khalid
    Zohdy, Mohamad
    Mahmoud, Mohammed
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 794 - 800
  • [27] Revolutionizing Machine Learning Algorithms using GPUs
    Sharma, Ritvik
    Vinutha, M.
    Moharir, Minal
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATION SYSTEM AND INFORMATION TECHNOLOGY FOR SUSTAINABLE SOLUTIONS (CSITSS), 2016, : 318 - 323
  • [28] Diagnosis of diabetes using machine learning algorithms
    Alaa Khaleel F.
    Al-Bakry A.M.
    [J]. Materials Today: Proceedings, 2023, 80 : 3200 - 3203
  • [29] Mindful Machine Learning Using Machine Learning Algorithms to Predict the Practice of Mindfulness
    Sauer, Sebastian
    Buettner, Ricardo
    Heidenreich, Thomas
    Lemke, Jana
    Berg, Christoph
    Kurz, Christoph
    [J]. EUROPEAN JOURNAL OF PSYCHOLOGICAL ASSESSMENT, 2018, 34 (01) : 6 - 13
  • [30] Disturbance observer using machine learning algorithms
    Han D.-K.
    Fitri I.R.
    Kim J.-S.
    [J]. Kim, Jung-Su (jungsu@seoultech.ac.kr), 2018, Institute of Control, Robotics and Systems (24) : 386 - 392