Detection of phishing websites using an efficient feature-based machine learning framework

被引:111
|
作者
Rao, Routhu Srinivasa [1 ]
Pais, Alwyn Roshan [1 ]
机构
[1] Natl Inst Technol Karnataka, Informat Secur Res Lab, Surathkal, India
来源
NEURAL COMPUTING & APPLICATIONS | 2019年 / 31卷 / 08期
关键词
Cyber-attack; Phishing; Anti-phishing; Heuristic technique; Machine learning algorithms; Random Forest; Oblique Random Forest; CLASSIFICATION; ENSEMBLE; MODEL;
D O I
10.1007/s00521-017-3305-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phishing is a cyber-attack which targets naive online users tricking into revealing sensitive information such as username, password, social security number or credit card number etc. Attackers fool the Internet users by masking webpage as a trustworthy or legitimate page to retrieve personal information. There are many anti-phishing solutions such as blacklist or whitelist, heuristic and visual similarity-based methods proposed to date, but online users are still getting trapped into revealing sensitive information in phishing websites. In this paper, we propose a novel classification model, based on heuristic features that are extracted from URL, source code, and third-party services to overcome the disadvantages of existing anti-phishing techniques. Our model has been evaluated using eight different machine learning algorithms and out of which, the Random Forest (RF) algorithm performed the best with an accuracy of 99.31%. The experiments were repeated with different (orthogonal and oblique) random forest classifiers to find the best classifier for the phishing website detection. Principal component analysis Random Forest (PCA-RF) performed the best out of all oblique Random Forests (oRFs) with an accuracy of 99.55%. We have also tested our model with the third-party-based features and without third-party-based features to determine the effectiveness of third-party services in the classification of suspicious websites. We also compared our results with the baseline models (CANTINA and CANTINA+). Our proposed technique outperformed these methods and also detected zero-day phishing attacks.
引用
收藏
页码:3851 / 3873
页数:23
相关论文
共 50 条
  • [1] Detection of phishing websites using an efficient feature-based machine learning framework
    Routhu Srinivasa Rao
    Alwyn Roshan Pais
    Neural Computing and Applications, 2019, 31 : 3851 - 3873
  • [2] Detecting Phishing Websites Using an Efficient Feature-based Machine Learning Framework
    Sundaram, K. Mohana
    Sasikumar, R.
    Meghana, Atthipalli Sai
    Anuja, Arava
    Praneetha, Chandolu
    REVISTA GEINTEC-GESTAO INOVACAO E TECNOLOGIAS, 2021, 11 (02): : 2106 - 2112
  • [3] Modeling Hybrid Feature-Based Phishing Websites Detection Using Machine Learning Techniques
    Das Guptta S.
    Shahriar K.T.
    Alqahtani H.
    Alsalman D.
    Sarker I.H.
    Annals of Data Science, 2024, 11 (01) : 217 - 242
  • [4] Feature Selections for the Machine Learning based Detection of Phishing Websites
    Buber, Ebubekir
    Demir, Onder
    Sahingoz, Ozgur Koray
    2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [5] A Feature Extraction Approach for the Detection of Phishing Websites Using Machine Learning
    Gundla, Sri Charan
    Karthik, M. Praveen
    Reddy, Middi Jashwanth Kumar
    Gourav
    Pankaj, Ashutosh
    Stamenkovic, Z.
    Raja, S. P.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (02)
  • [6] Detection of phishing websites using machine learning
    Razaque, Abdul
    Frej, Mohamed Ben Haj
    Sabyrov, Dauren
    Shaikhyn, Aidana
    Amsaad, Fathi
    Oun, Ahmed
    Proceedings - 2020 IEEE Cloud Summit, Cloud Summit 2020, 2020, : 103 - 107
  • [7] Detection of Phishing Websites using Machine Learning
    Razaque, Abdul
    Frej, Mohamed Ben Haj
    Sabyrov, Dauren
    Shaikhyn, Aidana
    Amsaad, Fathi
    Oun, Ahmed
    2020 IEEE CLOUD SUMMIT, 2020, : 103 - 107
  • [8] Detection of Phishing Websites Using Machine Learning
    Abbas, Ahmed Raad
    Singh, Sukhvir
    Kau, Mandeep
    INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES, ICICCT 2019, 2020, 89 : 1307 - 1314
  • [9] Phishing Websites Detection using Machine Learning
    Kulkarni, Arun
    Brown, Leonard L., III
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (07) : 8 - 13
  • [10] Detection and Prevention of Phishing Websites using Machine Learning Approach
    Patil, Vaibhav
    Thakkar, Pritesh
    Shah, Chirag
    Bhat, Tushar
    Godse, S. P.
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,