Deep learning based phishing website identification system using CNN-LSTM classifier

被引:7
|
作者
Sapkal, Vinod [1 ]
Gupta, Praveen [2 ]
Khan, Aboo Bakar [3 ]
机构
[1] CSMU Navi Mumbai, Dept Comp Sci & Engn, Panvel, Maharashtra, India
[2] CSMU Navi Mumbai, Dept Comp Sci & Informat Technol, Panvel, Maharashtra, India
[3] CSMU Navi Mumbai, Dept Elect & Elect Engn, Panvel, Maharashtra, India
来源
关键词
Phishing; CNN; LSTM; NLP; MACHINE;
D O I
10.47974/JIOS-1343
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
The term phishing refers to an attack that pretends to be the website of a large corporation, typically one dealing with money, such as a bank or other financial institution or an online retailer. Its primary objective is to acquire personally identifiable information from users, such as their social security numbers, credit card information, and passwords. Due to the rise of phishing attacks, various techniques have been developed in order to combat these threats. One of these is deep learning algorithms, which are capable of learning and analyzing massive datasets. Due to their capabilities, these algorithms are very useful in identifying and preventing phishing attacks. Due to the complexity of the phishing websites, many development systems have been created to detect them. Unfortunately, the output that was desired cannot be achieved by these systems, and they have a number of other flaws as well. The purpose of this paper is to propose a hybrid deep learning-based phishing detection system that is easy to put into practice. The quality of the input dataset is improved through the process of preprocessing the dataset. After that, the procedures of clustering and feature selection are carried out in order to improve the accuracy and decrease the amount of time required for the processing. The resulting features are then fed into the CNN_LSTM, which is a classification system that classifies websites that are phishing and legitimate. Proposed Hybrid deep learning models are proposed to combine the features of natural language processing (NLP) and character embedding. They can then reveal high-level connections between characters. In terms of the metric that is being used for the evaluation, the performance of the models that have been proposed is better than that of the other models.
引用
收藏
页码:315 / 330
页数:16
相关论文
共 50 条
  • [31] Deep insight into daily runoff forecasting based on a CNN-LSTM model
    Deng, Huiqi
    Chen, Wenjie
    Huang, Guoru
    NATURAL HAZARDS, 2022, 113 (03) : 1675 - 1696
  • [32] Deep insight into daily runoff forecasting based on a CNN-LSTM model
    Huiqi Deng
    Wenjie Chen
    Guoru Huang
    Natural Hazards, 2022, 113 : 1675 - 1696
  • [33] A Deep Learning-Based Framework for Phishing Website Detection
    Tang, Lizhen
    Mahmoud, Qusay H.
    IEEE ACCESS, 2022, 10 : 1509 - 1521
  • [34] A vision system based on CNN-LSTM for robotic citrus sorting
    Yu, Yonghua
    An, Xiaosong
    Lin, Jiahao
    Li, Shanjun
    Chen, Yaohui
    INFORMATION PROCESSING IN AGRICULTURE, 2024, 11 (01): : 14 - 25
  • [35] An adsorption isotherm identification method based on CNN-LSTM neural network
    Liu, Kaidi
    Xie, Xiaohan
    Yan, Juanting
    Zhang, Sizong
    Zhang, Hui
    JOURNAL OF MOLECULAR MODELING, 2023, 29 (09)
  • [36] Learning Temporal Representation of Transaction Amount for Fraudulent Transaction Recognition using CNN, Stacked LSTM, and CNN-LSTM
    Heryadi, Yaya
    Warnars, Harco Leslie Hendric Spits
    2017 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND COMPUTATIONAL INTELLIGENCE (CYBERNETICSCOM), 2017, : 84 - 89
  • [37] An adsorption isotherm identification method based on CNN-LSTM neural network
    Kaidi Liu
    Xiaohan Xie
    Juanting Yan
    Sizong Zhang
    Hui Zhang
    Journal of Molecular Modeling, 2023, 29
  • [38] Parkinson's disease detection and classification using EEG based on deep CNN-LSTM model
    Li, Kuan
    Ao, Bin
    Wu, Xin
    Wen, Qing
    Ul Haq, Ejaz
    Yin, Jianping
    BIOTECHNOLOGY AND GENETIC ENGINEERING REVIEWS, 2024, 40 (03) : 2577 - 2596
  • [39] Design of recommendation system for tourist spot using sentiment analysis based on CNN-LSTM
    An, Hyeon-woo
    Moon, Nammee
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 13 (3) : 1653 - 1663
  • [40] A hybrid approach to detecting Parkinson's disease using spectrogram and deep learning CNN-LSTM network
    Shibina V.
    Thasleema T.M.
    International Journal of Speech Technology, 2024, 27 (03) : 657 - 671