A Deep Learning-Based Innovative Technique for Phishing Detection in Modern Security with Uniform Resource Locators

被引:20
|
作者
Aldakheel, Eman Abdullah [1 ]
Zakariah, Mohammed [2 ]
Gashgari, Ghada Abdalaziz [3 ]
Almarshad, Fahdah A. [4 ]
Alzahrani, Abdullah I. A. [5 ]
机构
[1] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh 11671, Saudi Arabia
[2] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh 12372, Saudi Arabia
[3] Univ Jeddah, Coll Comp Sci & Engn, Dept Cybersecur, Ar Rabwah Jeddah 23449, Saudi Arabia
[4] Prince Sattam Bin Abdul Aziz Univ, Coll Comp Engn & Sci, Dept Informat Syst, Al Kharj 11942, Saudi Arabia
[5] Shaqra Univ, Coll Sci & Humanities Al Quwaiiyah, Dept Comp Sci, Shaqra 11961, Saudi Arabia
关键词
phishing detection system; deep learning; convolutional neural network; PhishTank data set; URL analysis; machine-learning;
D O I
10.3390/s23094403
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Organizations and individuals worldwide are becoming increasingly vulnerable to cyberattacks as phishing continues to grow and the number of phishing websites grows. As a result, improved cyber defense necessitates more effective phishing detection (PD). In this paper, we introduce a novel method for detecting phishing sites with high accuracy. Our approach utilizes a Convolution Neural Network (CNN)-based model for precise classification that effectively distinguishes legitimate websites from phishing websites. We evaluate the performance of our model on the PhishTank dataset, which is a widely used dataset for detecting phishing websites based solely on Uniform Resource Locators (URL) features. Our approach presents a unique contribution to the field of phishing detection by achieving high accuracy rates and outperforming previous state-of-the-art models. Experiment results revealed that our proposed method performs well in terms of accuracy and its false-positive rate. We created a real data set by crawling 10,000 phishing URLs from PhishTank and 10,000 legitimate websites and then ran experiments using standard evaluation metrics on the data sets. This approach is founded on integrated and deep learning (DL). The CNN-based model can distinguish phishing websites from legitimate websites with a high degree of accuracy. When binary-categorical loss and the Adam optimizer are used, the accuracy of the k-nearest neighbors (KNN), Natural Language Processing (NLP), Recurrent Neural Network (RNN), and Random Forest (RF) models is 87%, 97.98%, 97.4% and 94.26%, respectively, in contrast to previous publications. Our model outperformed previous works due to several factors, including the use of more layers and larger training sizes, and the extraction of additional features from the PhishTank dataset. Specifically, our proposed model comprises seven layers, starting with the input layer and progressing to the seventh, which incorporates a layer with pooling, convolutional, linear 1 and 2, and linear six layers as the output layers. These design choices contribute to the high accuracy of our model, which achieved a 98.77% accuracy rate.
引用
收藏
页数:27
相关论文
共 50 条
  • [31] The application of deep learning-based technique detection model in table tennis teaching and learning
    He, Shunshui
    SYSTEMS AND SOFT COMPUTING, 2024, 6
  • [32] A Deep Learning for Arabic SMS Phishing Based on URLs Detection
    Alsufyani, Sadeem
    Alajmani, Samah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 388 - 396
  • [33] Deep learning-based fall detection
    Chiang, Jason Wei Hoe
    Zhang, Li
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 891 - 898
  • [34] A Deep Learning-Based Innovative Points Extraction Method
    Yu, Tao
    Wang, Rui
    Zhan, Hongfei
    Lin, Yingjun
    Yu, Junhe
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 130 - 138
  • [35] Phishing uniform resource locator detection using machine learning: A step towards secure system
    Mahajan, Shilpa
    SECURITY AND PRIVACY, 2023, 6 (06)
  • [36] Phishing Webpage Classification via Deep Learning-Based Algorithms: An Empirical Study
    Nguyet Quang Do
    Selamat, Ali
    Krejcar, Ondrej
    Yokoi, Takeru
    Fujita, Hamido
    APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [37] Optimal Deep Learning-based Cyberattack Detection and Classification Technique on Social Networks
    Albraikan, Amani Abdulrahman
    Hassine, Siwar Ben Haj
    Fati, Suliman Mohamed
    Al-Wesabi, Fahd N.
    Hilal, Anwer Mustafa
    Motwakel, Abdelwahed
    Hamza, Manar Ahmed
    Al Duhayyim, Mesfer
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 907 - 923
  • [38] An enhanced deep learning-based phishing detection mechanism to effectively identify malicious URLs using variational autoencoders
    Prabakaran, Manoj Kumar
    Chandrasekar, Abinaya Devi
    Meenakshi Sundaram, Parvathy
    IET INFORMATION SECURITY, 2023, 17 (03) : 423 - 440
  • [39] A Deep Learning-Based Password Security Evaluation Model
    Hong, Ki Hyeon
    Lee, Byung Mun
    APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [40] Security Consideration for Deep Learning-Based Image Forensics
    Zhao, Wei
    Yang, Pengpeng
    Ni, Rongrong
    Zhao, Yao
    Wu, Haorui
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (12): : 3263 - 3266