A Deep Learning-Based Innovative Technique for Phishing Detection in Modern Security with Uniform Resource Locators

被引：20

作者：

Aldakheel, Eman Abdullah ^{[1
]}

Zakariah, Mohammed ^{[2
]}

Gashgari, Ghada Abdalaziz ^{[3
]}

Almarshad, Fahdah A. ^{[4
]}

Alzahrani, Abdullah I. A. ^{[5
]}

机构：

[1] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh 11671, Saudi Arabia

[2] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh 12372, Saudi Arabia

[3] Univ Jeddah, Coll Comp Sci & Engn, Dept Cybersecur, Ar Rabwah Jeddah 23449, Saudi Arabia

[4] Prince Sattam Bin Abdul Aziz Univ, Coll Comp Engn & Sci, Dept Informat Syst, Al Kharj 11942, Saudi Arabia

[5] Shaqra Univ, Coll Sci & Humanities Al Quwaiiyah, Dept Comp Sci, Shaqra 11961, Saudi Arabia

来源：

SENSORS | 2023年 / 23卷 / 09期

关键词：

phishing detection system; deep learning; convolutional neural network; PhishTank data set; URL analysis; machine-learning;

D O I：

10.3390/s23094403

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Organizations and individuals worldwide are becoming increasingly vulnerable to cyberattacks as phishing continues to grow and the number of phishing websites grows. As a result, improved cyber defense necessitates more effective phishing detection (PD). In this paper, we introduce a novel method for detecting phishing sites with high accuracy. Our approach utilizes a Convolution Neural Network (CNN)-based model for precise classification that effectively distinguishes legitimate websites from phishing websites. We evaluate the performance of our model on the PhishTank dataset, which is a widely used dataset for detecting phishing websites based solely on Uniform Resource Locators (URL) features. Our approach presents a unique contribution to the field of phishing detection by achieving high accuracy rates and outperforming previous state-of-the-art models. Experiment results revealed that our proposed method performs well in terms of accuracy and its false-positive rate. We created a real data set by crawling 10,000 phishing URLs from PhishTank and 10,000 legitimate websites and then ran experiments using standard evaluation metrics on the data sets. This approach is founded on integrated and deep learning (DL). The CNN-based model can distinguish phishing websites from legitimate websites with a high degree of accuracy. When binary-categorical loss and the Adam optimizer are used, the accuracy of the k-nearest neighbors (KNN), Natural Language Processing (NLP), Recurrent Neural Network (RNN), and Random Forest (RF) models is 87%, 97.98%, 97.4% and 94.26%, respectively, in contrast to previous publications. Our model outperformed previous works due to several factors, including the use of more layers and larger training sizes, and the extraction of additional features from the PhishTank dataset. Specifically, our proposed model comprises seven layers, starting with the input layer and progressing to the seventh, which incorporates a layer with pooling, convolutional, linear 1 and 2, and linear six layers as the output layers. These design choices contribute to the high accuracy of our model, which achieved a 98.77% accuracy rate.

引用

页数：27

共 50 条

[21] A new hybrid deep learning-based phishing detection system using MCS-DNN classifier
J. Anitha
M. Kalaiarasu
Neural Computing and Applications, 2022, 34 : 5867 - 5882
[22] Feature-Based Performance Comparison of Machine Learning Algorithms for Phishing Detection through Uniform Resource Locator
Savas, Taki
Savas, Serkan
JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2022, 25 (03): : 1261 - 1270
[23] Deep Learning-Based Efficient Model Development for Phishing Detection Using Random Forest and BLSTM Classifiers
Wang, Shan
Khan, Sulaiman
Xu, Chuyi
Nazir, Shah
Hafeez, Abdul
COMPLEXITY, 2020, 2020
[24] Feature-based performance comparison of machine learning algorithms for phishing detection through uniform resource locator
Savas, Taki
Savas, Serkan
JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2022,
[25] A new hybrid deep learning-based phishing detection system using MCS-DNN classifier
Anitha, J.
Kalaiarasu, M.
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (08): : 5867 - 5882
[26] An intelligent cyber security phishing detection system using deep learning techniques
Ala Mughaid
Shadi AlZu’bi
Adnan Hnaif
Salah Taamneh
Asma Alnajjar
Esraa Abu Elsoud
Cluster Computing, 2022, 25 : 3819 - 3828
[27] An ensemble classification method based on machine learning models for malicious Uniform Resource Locators (URL)
Sankaranarayanan, Suresh
Sivachandran, Arvinthan Thevar
Khairuddin, Anis Salwa Mohd
Hasikin, Khairunnisa
Sait, Abdul Rahman Wahab
PLOS ONE, 2024, 19 (05):
[28] A novel deep learning-based intrusion detection system for IoT DDoS security
Hizal, Selman
Cavusoglu, Unal
Akgun, Devrim
INTERNET OF THINGS, 2024, 28
[29] DeepAID: Interpreting and Improving Deep Learning-based Anomaly Detection in Security Applications
Han, Dongqi
Wang, Zhiliang
Chen, Wenqi
Zhong, Ying
Wang, Su
Zhang, Han
Yang, Jiahai
Shi, Xingang
Yin, Xia
CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 3197 - 3217
[30] MDepthNet based phishing attack detection using integrated deep learning methodologies for cyber security enhancement
Yamarthy, Anil Kumar
Koteswararao, Ch
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (05): : 6377 - 6395

← 1 2 3 4 5 →