Identification of Phishing URLs Using Machine Learning Models

被引:0
|
作者
Vivek, Meghashyam [1 ]
Premjith, Nithin [1 ]
Johnson, Aaron Antonio [1 ]
Maurya, Ashutosh Kumar [1 ]
Jingle, I. Diana Jeba [1 ]
机构
[1] Christ, Bangalore, Karnataka, India
关键词
XGBoost; Phishing; Prediction; Machine learning; Classifier;
D O I
10.1007/978-981-99-9043-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we provide a machine learning-based method for identifying phishing URLs. Sixteen features, including Have IP, Have At, URL Length, URL Depth, Non-standard double slash, HTTPS domain, Shortened URL, Hyphen Count, DNS Record, Domain age, Domain active, iFrame, Mouse Over, Right click, Web Forwards, and Label, were extracted from the 600,000 URLs we gathered as a dataset of legitimate and phishing URLs. We then used this dataset to train a variety of machine learning models. These included standalone models such Naive Bayes, Logistic Regression, Decision Trees, and K-Nearest Neighbors (KNN). We also used ensemble models like Hard Voting, XGBoost, Random Forests, and AdaBoost. Finally, we used deep learning models such as Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), Gated Recurrent Units (GRU) and Convolutional Neural Networks (CNN). On evaluation of performance metrics like accuracy, precision, recall, train time and prediction time it was found that XGBoost provides the best performance across all categories.
引用
收藏
页码:209 / 219
页数:11
相关论文
共 50 条
  • [1] Detection of Phishing URLs Using Machine Learning Techniques
    James, Joby
    Sandhya, L.
    Thomas, Ciza
    2013 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION AND COMPUTING (ICCC), 2013, : 304 - +
  • [2] Machine Learning Algorithms Evaluation for Phishing URLs Classification
    Bouijij, Habiba
    Berqia, Amine
    2021 4TH INTERNATIONAL SYMPOSIUM ON ADVANCED ELECTRICAL AND COMMUNICATION TECHNOLOGIES (ISAECT), 2021,
  • [3] Machine learning based phishing detection from URLs
    Sahingoz, Ozgur Koray
    Buber, Ebubekir
    Demir, Onder
    Diri, Banu
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 : 345 - 357
  • [4] URL Phishing Detection using Machine Learning Techniques based on URLs Lexical Analysis
    Abutaha, Mohammed
    Ababneh, Mohammad
    Mahmoud, Khaled
    Baddar, Sherenaz Al-Haj
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 147 - 152
  • [5] Phishing Website Detection from URLs Using Classical Machine Learning ANN Model
    Salloum, Said
    Gaber, Tarek
    Vadera, Sunil
    Shaalan, Khaled
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM 2021, PT II, 2021, 399 : 509 - 523
  • [6] Performance Assessment of Multiple Machine Learning Classifiers for Detecting the Phishing URLs
    Rahman, Sheikh Shah Mohammad Motiur
    Rafiq, Fatama Binta
    Toma, Tapushe Rabaya
    Hossain, Syeda Sumbul
    Biplob, Khalid Been Badruzzaman
    DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 285 - 296
  • [7] Towards Developing a Tool to Detect Phishing URLs: A Machine Learning Approach
    Basnet, Ram B.
    Doleck, Tenzin
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION TECHNOLOGY CICT 2015, 2015, : 220 - 223
  • [8] Machine Learning Approach Based on Hybrid Features for Detection of Phishing URLs
    Ghimire, Awishkar
    Jha, Avinash Kumar
    Thapa, Surendrahikram
    Mishra, Sushruti
    Jha, Aryan Mani
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 954 - 959
  • [9] The Performance of Sequential Deep Learning Models in Detecting Phishing Websites Using Contextual Features of URLs
    Gopali, Saroj
    Namin, Akbar S.
    Abri, Faranak
    Jones, Keith S.
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 1064 - 1066
  • [10] Phishing Detection from URLs Using Deep Learning Approach
    Singh, Shweta
    Singh, M. P.
    Pandey, Ramprakash
    PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,