The Effect of Feature Selection on Phish Website Detection An Empirical Study on Robust Feature Subset Selection for Effective Classification

被引:0
|
作者
Zuhair, Hiba [1 ,2 ]
Selmat, Ali [3 ,4 ]
Salleh, Mazleena [1 ]
机构
[1] Univ Teknol Malaysia, Fac Comp, Dept Comp Sci, Johor Baharu 81310, Johor, Malaysia
[2] Al Nahrain Univ, Baghdad, Iraq
[3] Univ Teknol Malaysia, UTM IRDA Ctr Excellence, Johor Baharu 81310, Johor, Malaysia
[4] Univ Teknol Malaysia, Fac Comp, Johor Baharu 81310, Johor, Malaysia
关键词
phish website; phishing detection; feature selection; classification model;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently, limited anti-phishing campaigns have given phishers more possibilities to bypass through their advanced deceptions. Moreover, failure to devise appropriate classification techniques to effectively identify these deceptions has degraded the detection of phishing websites. Consequently, exploiting as new; few; predictive; and effective features as possible has emerged as a key challenge to keep the detection resilient. Thus, some prior works had been carried out to investigate and apply certain selected methods to develop their own classification techniques. However, no study had generally agreed on which feature selection method that could be employed as the best assistant to enhance the classification performance. Hence, this study empirically examined these methods and their effects on classification performance. Furthermore, it recommends some promoting criteria to assess their outcomes and offers contribution on the problem at hand. Hybrid features, low and high dimensional datasets, different feature selection methods, and classification models were examined in this study. As a result, the findings displayed notably improved detection precision with low latency, as well as noteworthy gains in robustness and prediction susceptibilities. Although selecting an ideal feature subset was a challenging task, the findings retrieved from this study had provided the most advantageous feature subset as possible for robust selection and effective classification in the phishing detection domain.
引用
收藏
页码:221 / 232
页数:12
相关论文
共 50 条
  • [1] Feature Selection for Phishing Website Classification
    Shabudin, Shafaizal
    Sani, Nor Samsiah
    Ariffin, Khairul Akram Zainal
    Aliff, Mohd
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 587 - 595
  • [2] Analysis of Classification Model and Feature Subset Selection
    Khan, Muhammad A.
    Mirza, Anwar M.
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (10): : 3325 - 3334
  • [3] Feature Subset Selection for Fuzzy Classification Methods
    Cintra, Marcos E.
    Camargo, Heloisa A.
    [J]. INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND METHODS, PT 1, 2010, 80 : 318 - +
  • [4] Feature subset selection for classification of histological images
    Jelonek, J
    Stefanowski, J
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 1997, 9 (03) : 227 - 239
  • [5] Genetic feature subset selection for gender classification: A comparison study
    Sun, ZH
    Bebis, G
    Yuan, XJ
    Louis, SJ
    [J]. SIXTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2002, : 165 - 170
  • [6] Empirical Study of Individual Feature Evaluators and Cutting Criteria for Feature Selection in Classification
    Arauzo-Azofra, Antonio
    Aznarte M, Jose L.
    Benitez, Jose M.
    [J]. 2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 541 - +
  • [7] Towards Feature Subset Selection in Intrusion Detection
    Ahmad, Iftikhar
    Amin, Fazal e
    [J]. 2014 IEEE 7TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC), 2014, : 68 - 73
  • [8] Object detection using feature subset selection
    Sun, ZH
    Bebis, G
    Miller, R
    [J]. PATTERN RECOGNITION, 2004, 37 (11) : 2165 - 2176
  • [9] Effective feature selection using feature vector graph for classification
    Zhao, Guodong
    Wu, Yan
    Chen, Fuqiang
    Zhang, Junming
    Bai, Jing
    [J]. NEUROCOMPUTING, 2015, 151 : 376 - 389
  • [10] An Ensemble Associated Feature Subset Selection for Classification Problems
    Phienthrakul, Tanasanee
    [J]. 2015 3RD INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI 2015), 2015, : 63 - 67