URL filtering using machine learning algorithms

被引：0

作者：

Aljahdalic, Asia Othman ^{[1
]}

Banafee, Shoroq ^{[1
]}

Aljohani, Thana ^{[1
]}

机构：

[1] Univ Jeddah, Coll Comp Sci & Engn, Jeddah, Saudi Arabia

来源：

INFORMATION SECURITY JOURNAL | 2024年 / 33卷 / 03期

关键词：

Classifier; detection; extracted feature; machine learning; URL phishing;

D O I：

10.1080/19393555.2023.2193350

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cyber-attacks using malicious uniform resource locator (URL) propagation are very common and serious. Statistics indicate that there is a need to research and apply techniques and methods for identifying and preventing malicious URLs. The main objective of this research is to train machine learning models on selected dataset to predict phishing websites based on URL-related features. The accuracy level of each model is measured and compared. Finally, the best performing model will be used to develop a web application that provide internet users with an easy way to check suspicious URLs. We have used five different machine learning models to classify URLs as legitimate or phishing, these models are eXtreme Gradient Boosting (XGBoost), k-nearest neighbors (KNN), support vector machine (SVM), Decision Tree, and Random Forest. Finally, we used Voting Classifier to combine the work of Random Forest (RF) algorithm with other two models, Gaussian Naive Bayes, and Logistic Regression, to check if we can increase the accuracy of RF as suggested in the literature, but we found that the accuracy of RF alone was higher than the accuracy of the combined models. This project can be implemented as a browser extension or mobile application to classify suspicious URLs to legitimate or phishing with the use of the saved model.

引用

页码：193 / 203

页数：11

共 50 条

[21] Dynamic Packet Filtering Using Machine Learning
Chebrolu, Chandan Sai
Lung, Chung-Horng
Ajila, Samuel A.
[J]. 2022 IEEE 23RD INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2022), 2022, : 206 - 211
[22] Machine learning techniques for automated web page classification using URL features
Devi, M. Indra
Rajaram, R.
Selvakuberan, K.
[J]. ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 116 - 118
[23] URL-Based Dynamic Monitoring of Android Malware using Machine Learning
Somarriba, Oscar
Urbina, Henry Jaentschke
[J]. PROCEEDINGS OF THE 2022 IEEE 40TH CENTRAL AMERICA AND PANAMA CONVENTION (CONCAPAN), 2022,
[24] Machine Learning Techniques for Detecting Phishing URL Attacks
Mosa, Diana T.
Shams, Mahmoud Y.
Abohany, Amr A.
El-kenawy, El-Sayed M.
Thabet, M.
[J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (01): : 1271 - 1290
[25] Privacy Preserving Machine Learning for Malicious URL Detection
Shaik, Imtiyazuddin
Emmadi, Nitesh
Tupsamudre, Harshal
Narumanchi, Harika
Bhattachar, Rajan Mindigal Alasingara
[J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS - DEXA 2021 WORKSHOPS, 2021, 1479 : 31 - 41
[26] Machine Learning for Multiple Stage Phishing URL Prediction
Amen, Khalid
Zohdy, Mohamad
Mahmoud, Mohammed
[J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 794 - 800
[27] Revolutionizing Machine Learning Algorithms using GPUs
Sharma, Ritvik
Vinutha, M.
Moharir, Minal
[J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATION SYSTEM AND INFORMATION TECHNOLOGY FOR SUSTAINABLE SOLUTIONS (CSITSS), 2016, : 318 - 323
[28] Diagnosis of diabetes using machine learning algorithms
Alaa Khaleel F.
Al-Bakry A.M.
[J]. Materials Today: Proceedings, 2023, 80 : 3200 - 3203
[29] Mindful Machine Learning Using Machine Learning Algorithms to Predict the Practice of Mindfulness
Sauer, Sebastian
Buettner, Ricardo
Heidenreich, Thomas
Lemke, Jana
Berg, Christoph
Kurz, Christoph
[J]. EUROPEAN JOURNAL OF PSYCHOLOGICAL ASSESSMENT, 2018, 34 (01) : 6 - 13
[30] Disturbance observer using machine learning algorithms
Han D.-K.
Fitri I.R.
Kim J.-S.
[J]. Kim, Jung-Su (jungsu@seoultech.ac.kr), 2018, Institute of Control, Robotics and Systems (24) : 386 - 392

← 1 2 3 4 5 →