Identification of Phishing URLs Using Machine Learning Models

被引:0
|
作者
Vivek, Meghashyam [1 ]
Premjith, Nithin [1 ]
Johnson, Aaron Antonio [1 ]
Maurya, Ashutosh Kumar [1 ]
Jingle, I. Diana Jeba [1 ]
机构
[1] Christ, Bangalore, Karnataka, India
关键词
XGBoost; Phishing; Prediction; Machine learning; Classifier;
D O I
10.1007/978-981-99-9043-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we provide a machine learning-based method for identifying phishing URLs. Sixteen features, including Have IP, Have At, URL Length, URL Depth, Non-standard double slash, HTTPS domain, Shortened URL, Hyphen Count, DNS Record, Domain age, Domain active, iFrame, Mouse Over, Right click, Web Forwards, and Label, were extracted from the 600,000 URLs we gathered as a dataset of legitimate and phishing URLs. We then used this dataset to train a variety of machine learning models. These included standalone models such Naive Bayes, Logistic Regression, Decision Trees, and K-Nearest Neighbors (KNN). We also used ensemble models like Hard Voting, XGBoost, Random Forests, and AdaBoost. Finally, we used deep learning models such as Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), Gated Recurrent Units (GRU) and Convolutional Neural Networks (CNN). On evaluation of performance metrics like accuracy, precision, recall, train time and prediction time it was found that XGBoost provides the best performance across all categories.
引用
收藏
页码:209 / 219
页数:11
相关论文
共 50 条
  • [31] Prediction of phishing websites using machine learning
    Mithilesh Kumar Pandey
    Munindra Kumar Singh
    Saurabh Pal
    B. B. Tiwari
    Spatial Information Research, 2023, 31 : 157 - 166
  • [32] Addressing Phishing Threats Using A Metaheuristic Perspective On Machine Learning Classification Models Code
    Hu, Bo
    Zhang, SaiNan
    Journal of Applied Science and Engineering, 2024, 28 (07): : 1503 - 1514
  • [33] A Novel Algorithm to Detect Phishing URLs
    Hawanna, Varsharani Ramdas
    Kulkarni, V. Y.
    Rane, R. A.
    2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 548 - 552
  • [34] An ensemble learning approach for detecting phishing URLs in encrypted TLS traffic
    Cheemaladinne Kondaiah
    Alwyn Roshan Pais
    Routhu Srinivasa Rao
    Telecommunication Systems, 2024, 87 (4) : 1015 - 1031
  • [35] Detection of Phishing Websites by Investigating Their URLs using LSTM Algorithm
    Alanzi, Barah Mohammed
    Uliyan, Diaa Mohammed
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (05): : 419 - 428
  • [36] Detecting Phishing Sites Using URLs Collected from Emails
    Wang, Chuan-Sheng
    Hsu, Fu-Hau
    Chen, Shih-Jen
    Hwang, Yan-Ling
    Wu, Min-Hao
    APPLIED SCIENCE AND PRECISION ENGINEERING INNOVATION, PTS 1 AND 2, 2014, 479-480 : 916 - +
  • [37] Machine learning models for phishing detection from TLS traffic
    Munish Kumar
    Cheemaladinne Kondaiah
    Alwyn Roshan Pais
    Routhu Srinivasa Rao
    Cluster Computing, 2023, 26 : 3263 - 3277
  • [38] Machine learning models for phishing detection from TLS traffic
    Kumar, Munish
    Kondaiah, Cheemaladinne
    Pais, Alwyn Roshan
    Rao, Routhu Srinivasa
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (05): : 3263 - 3277
  • [39] Mining Web to Detect Phishing URLs
    Basnet, Ram B.
    Sung, Andrew H.
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 568 - 573
  • [40] Phishing Website Classification and Detection Using Machine Learning
    Kumar, Jitendra
    Santhanavijayan, A.
    Janet, B.
    Rajendran, Balaji
    Bindhumadhava, B. S.
    2020 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2020), 2020, : 473 - 478