A Deep Learning-Based Innovative Technique for Phishing Detection in Modern Security with Uniform Resource Locators

被引：20

作者：

Aldakheel, Eman Abdullah ^{[1
]}

Zakariah, Mohammed ^{[2
]}

Gashgari, Ghada Abdalaziz ^{[3
]}

Almarshad, Fahdah A. ^{[4
]}

Alzahrani, Abdullah I. A. ^{[5
]}

机构：

[1] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh 11671, Saudi Arabia

[2] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh 12372, Saudi Arabia

[3] Univ Jeddah, Coll Comp Sci & Engn, Dept Cybersecur, Ar Rabwah Jeddah 23449, Saudi Arabia

[4] Prince Sattam Bin Abdul Aziz Univ, Coll Comp Engn & Sci, Dept Informat Syst, Al Kharj 11942, Saudi Arabia

[5] Shaqra Univ, Coll Sci & Humanities Al Quwaiiyah, Dept Comp Sci, Shaqra 11961, Saudi Arabia

来源：

SENSORS | 2023年 / 23卷 / 09期

关键词：

phishing detection system; deep learning; convolutional neural network; PhishTank data set; URL analysis; machine-learning;

D O I：

10.3390/s23094403

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Organizations and individuals worldwide are becoming increasingly vulnerable to cyberattacks as phishing continues to grow and the number of phishing websites grows. As a result, improved cyber defense necessitates more effective phishing detection (PD). In this paper, we introduce a novel method for detecting phishing sites with high accuracy. Our approach utilizes a Convolution Neural Network (CNN)-based model for precise classification that effectively distinguishes legitimate websites from phishing websites. We evaluate the performance of our model on the PhishTank dataset, which is a widely used dataset for detecting phishing websites based solely on Uniform Resource Locators (URL) features. Our approach presents a unique contribution to the field of phishing detection by achieving high accuracy rates and outperforming previous state-of-the-art models. Experiment results revealed that our proposed method performs well in terms of accuracy and its false-positive rate. We created a real data set by crawling 10,000 phishing URLs from PhishTank and 10,000 legitimate websites and then ran experiments using standard evaluation metrics on the data sets. This approach is founded on integrated and deep learning (DL). The CNN-based model can distinguish phishing websites from legitimate websites with a high degree of accuracy. When binary-categorical loss and the Adam optimizer are used, the accuracy of the k-nearest neighbors (KNN), Natural Language Processing (NLP), Recurrent Neural Network (RNN), and Random Forest (RF) models is 87%, 97.98%, 97.4% and 94.26%, respectively, in contrast to previous publications. Our model outperformed previous works due to several factors, including the use of more layers and larger training sizes, and the extraction of additional features from the PhishTank dataset. Specifically, our proposed model comprises seven layers, starting with the input layer and progressing to the seventh, which incorporates a layer with pooling, convolutional, linear 1 and 2, and linear six layers as the output layers. These design choices contribute to the high accuracy of our model, which achieved a 98.77% accuracy rate.

引用

页数：27

共 50 条

[1] A Deep Learning-Based Framework for Phishing Website Detection
Tang, Lizhen
Mahmoud, Qusay H.
IEEE ACCESS, 2022, 10 : 1509 - 1521
[2] Understanding phishers' strategies of mimicking uniform resource locators to leverage phishing attacks: A machine learning approach
Tharani, J. Samantha
Arachchilage, Nalin A. G.
SECURITY AND PRIVACY, 2020, 3 (05)
[3] Classifying and clustering malicious advertisement uniform resource locators using deep learning
Zhang, Xichen
Lashkari, Arash Habibi
Ghorbani, Ali A.
COMPUTATIONAL INTELLIGENCE, 2021, 37 (01) : 511 - 537
[4] Machine learning-based phishing attack detection
Hossain S.
Sarma D.
Chakma R.J.
International Journal of Advanced Computer Science and Applications, 2020, 11 (09): : 378 - 388
[5] Machine Learning-Based Phishing Attack Detection
Hossain, Sohrab
Sarma, Dhiman
Chakma, Rana Joyti
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 378 - 388
[6] An Efficient Spoofing Attack Detection Using Deep Learning-based Physical Layer Security Technique
Mohan, Swethambri
Annadurai, Atchaya
Gunaseelan, K.
DEFENCE SCIENCE JOURNAL, 2024, 74 (04) : 526 - 534
[7] A Hybrid Phishing Detection System Using Deep Learning-based URL and Content Analysis
Korkmaz, Mehmet
Kocyigit, Emre
Sahingoz, Ozgur Koray
Diri, Banu
ELEKTRONIKA IR ELEKTROTECHNIKA, 2022, 28 (05) : 80 - 89
[8] Phishing website detection: How effective are deep learning-based models and hyperparameter optimization
Almousa, May
Zhang, Tianyang
Sarrafzadeh, Abdolhossein
Anwar, Mohd
SECURITY AND PRIVACY, 2022, 5 (06):
[9] Deep Learning-Based Network Security Threat Detection and Defense
Chao, Jinjin
Xie, Tian
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (11) : 669 - 679
[10] Phishing Attacks Detection A Machine Learning-Based Approach
Salahdine, Fatima
El Mrabet, Zakaria
Kaabouch, Naima
2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 250 - 255

← 1 2 3 4 5 →