URL filtering using machine learning algorithms

被引:0
|
作者
Aljahdalic, Asia Othman [1 ]
Banafee, Shoroq [1 ]
Aljohani, Thana [1 ]
机构
[1] Univ Jeddah, Coll Comp Sci & Engn, Jeddah, Saudi Arabia
来源
INFORMATION SECURITY JOURNAL | 2024年 / 33卷 / 03期
关键词
Classifier; detection; extracted feature; machine learning; URL phishing;
D O I
10.1080/19393555.2023.2193350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyber-attacks using malicious uniform resource locator (URL) propagation are very common and serious. Statistics indicate that there is a need to research and apply techniques and methods for identifying and preventing malicious URLs. The main objective of this research is to train machine learning models on selected dataset to predict phishing websites based on URL-related features. The accuracy level of each model is measured and compared. Finally, the best performing model will be used to develop a web application that provide internet users with an easy way to check suspicious URLs. We have used five different machine learning models to classify URLs as legitimate or phishing, these models are eXtreme Gradient Boosting (XGBoost), k-nearest neighbors (KNN), support vector machine (SVM), Decision Tree, and Random Forest. Finally, we used Voting Classifier to combine the work of Random Forest (RF) algorithm with other two models, Gaussian Naive Bayes, and Logistic Regression, to check if we can increase the accuracy of RF as suggested in the literature, but we found that the accuracy of RF alone was higher than the accuracy of the combined models. This project can be implemented as a browser extension or mobile application to classify suspicious URLs to legitimate or phishing with the use of the saved model.
引用
收藏
页码:193 / 203
页数:11
相关论文
共 50 条
  • [1] URL Filtering by Using Machine Learning
    Saqib, Malik Najmus
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (08): : 275 - 279
  • [2] URL Filtering by Using Machine Learning
    Saqib, Malik Najmus
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (09): : 275 - 279
  • [3] Structural Analysis of URL For Malicious URL Detection Using Machine Learning
    Raja, A. Saleem
    Peerbasha, S.
    Iqbal, Y. Mohammed
    Sundarvadivazhagan, B.
    Surputheen, M. Mohamed
    [J]. JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2023, 5 (04): : 28 - 41
  • [4] SMS Spam Filtering using Supervised Machine Learning Algorithms
    Navaney, Pavas
    Dubey, Gaurav
    Rana, Ajay
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 43 - 48
  • [5] Malicious URL Detection Using Machine Learning
    Hani, Dr Raed Bani
    Amoura, Motasem
    Ammourah, Mohammad
    Abu Khalil, Yazeed
    [J]. 2024 15TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS, ICICS 2024, 2024,
  • [6] Malicious URL and Intrusion Detection using Machine Learning
    Hamza, Amr
    Hammam, Farah
    Abouzeid, Medhat
    Ahmed, Mohammad Arsalan
    Dhou, Salam
    Aloul, Fadi
    [J]. 38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 795 - 800
  • [7] Phishing URL detection using machine learning methods
    Ahammad, S. K. Hasane
    Kale, Sunil D.
    Upadhye, Gopal D.
    Pande, Sandeep Dwarkanath
    Babu, E. Venkatesh
    Dhumane, Amol, V
    Bahadur, Dilip Kumar Jang
    [J]. ADVANCES IN ENGINEERING SOFTWARE, 2022, 173
  • [8] Comparison of machine learning algorithms in Chinese web filtering
    Du, AN
    Fang, BX
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2526 - 2531
  • [9] Malicious url detection using machine learning and ensemble modeling
    Pakhare P.S.
    Krishnan S.
    Charniya N.N.
    [J]. Lecture Notes on Data Engineering and Communications Technologies, 2021, 66 : 839 - 850
  • [10] An Improved Method of Phishing URL Detection Using Machine Learning
    Sugantham, Amy Joyce, V
    Mishra, Pradeepta
    Agarwal, Rashmi
    [J]. SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 5, SMARTCOM 2024, 2024, 949 : 245 - 254