Detection of phishing websites using an efficient feature-based machine learning framework

被引:111
|
作者
Rao, Routhu Srinivasa [1 ]
Pais, Alwyn Roshan [1 ]
机构
[1] Natl Inst Technol Karnataka, Informat Secur Res Lab, Surathkal, India
来源
NEURAL COMPUTING & APPLICATIONS | 2019年 / 31卷 / 08期
关键词
Cyber-attack; Phishing; Anti-phishing; Heuristic technique; Machine learning algorithms; Random Forest; Oblique Random Forest; CLASSIFICATION; ENSEMBLE; MODEL;
D O I
10.1007/s00521-017-3305-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phishing is a cyber-attack which targets naive online users tricking into revealing sensitive information such as username, password, social security number or credit card number etc. Attackers fool the Internet users by masking webpage as a trustworthy or legitimate page to retrieve personal information. There are many anti-phishing solutions such as blacklist or whitelist, heuristic and visual similarity-based methods proposed to date, but online users are still getting trapped into revealing sensitive information in phishing websites. In this paper, we propose a novel classification model, based on heuristic features that are extracted from URL, source code, and third-party services to overcome the disadvantages of existing anti-phishing techniques. Our model has been evaluated using eight different machine learning algorithms and out of which, the Random Forest (RF) algorithm performed the best with an accuracy of 99.31%. The experiments were repeated with different (orthogonal and oblique) random forest classifiers to find the best classifier for the phishing website detection. Principal component analysis Random Forest (PCA-RF) performed the best out of all oblique Random Forests (oRFs) with an accuracy of 99.55%. We have also tested our model with the third-party-based features and without third-party-based features to determine the effectiveness of third-party services in the classification of suspicious websites. We also compared our results with the baseline models (CANTINA and CANTINA+). Our proposed technique outperformed these methods and also detected zero-day phishing attacks.
引用
收藏
页码:3851 / 3873
页数:23
相关论文
共 50 条
  • [21] Phishing detection based on machine learning and feature selection methods
    Almseidin M.
    Abu Zuraiq A.M.
    Al-kasassbeh M.
    Alnidami N.
    International Journal of Interactive Mobile Technologies, 2019, 13 (12) : 71 - 183
  • [22] Feature Selection Approach for Phishing Detection Based on Machine Learning
    Wei, Yi
    Sekiya, Yuji
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON APPLIED CYBER SECURITY (ACS) 2021, 2022, 378 : 61 - 70
  • [23] Phishing Hybrid Feature-Based Classifier by Using Recursive Features Subset Selection and Machine Learning Algorithms
    Zuhair, Hiba
    Selamat, Ali
    RECENT TRENDS IN DATA SCIENCE AND SOFT COMPUTING, IRICT 2018, 2019, 843 : 267 - 277
  • [24] Detecting phishing websites using machine learning technique
    Dutta, Ashit Kumar
    PLOS ONE, 2021, 16 (10):
  • [25] Explainable machine learning for phishing feature detection
    Calzarossa, Maria Carla
    Giudici, Paolo
    Zieni, Rasha
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2024, 40 (01) : 362 - 373
  • [26] Sufficiency of Ensemble Machine Learning Methods for Phishing Websites Detection
    Wei, Yi
    Sekiya, Yuji
    IEEE ACCESS, 2022, 10 : 124103 - 124113
  • [27] Comparative analysis of machine learning algorithms in detection of phishing websites
    Kosan, Muhammed Ali
    Yildiz, Oktay
    Karacan, Hacer
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2018, 24 (02): : 276 - 282
  • [28] Design of Efficient Phishing Detection Model using Machine Learning
    Kim, Bong -Hyun
    TEHNICKI GLASNIK-TECHNICAL JOURNAL, 2024, 18 (01): : 37 - 42
  • [29] Efficient detection of phishing websites using multilayer perceptron
    Odeh A.
    Keshta I.
    Abdelfattah E.
    International Journal of Interactive Mobile Technologies, 2020, 14 (11) : 22 - 31
  • [30] Feature-based machine learning for the efficient design of nanophotonic structures
    Ferranti, Francesco
    PHOTONICS AND NANOSTRUCTURES-FUNDAMENTALS AND APPLICATIONS, 2022, 52