A Study of Feature Selection and Dimensionality Reduction Methods for Classification-Based Phishing Detection System

被引:6
|
作者
Singh, Amit [1 ]
Tiwari, Abhishek [2 ]
机构
[1] Indian Comp Emergency Response Team, New Delhi, India
[2] Cent Univ Haryana, Dept Comp Sci & IT, Comp Applicat, Jant, India
关键词
Cyber-Crime; Dimensionality Reduction; Feature Selection; Machine Learning; Phishing;
D O I
10.4018/IJIRR.2021010101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Phishing was introduced in 1996, and now phishing is the biggest cybercrime challenge. Phishing is an abstract way to deceive users over the internet. Purpose of phishers is to extract the sensitive information of the user. Researchers have been working on solutions of phishing problem, but the parallel evolution of cybercrime techniques have made it a tough nut to crack. Recently, machine learning-based solutions are widely adopted to tackle the menace of phishing. This survey paper studies various feature selection method and dimensionality reduction methods and sees how they perform with machine learning-based classifier. The selection of features is vital for developing a good performance machine learning model. This work is comparing three broad categories of feature selection methods, namely filter, wrapper, and embedded feature selection methods, to reduce the dimensionality of data. The effectiveness of these methods has been assessed on several machine learning classifiers using k-fold cross-validation score, accuracy, precision, recall, and time.
引用
收藏
页码:1 / 35
页数:35
相关论文
共 50 条
  • [41] A Classification-Based Selection for Evolutionary Optimization
    Zhang, Jinyuan
    Huang, Jimmy Xiangji
    Hu, Qinmin Vivian
    [J]. PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 328 - 329
  • [42] INDIRECT METHODS FOR DIMENSIONALITY REDUCTION AND CLASSIFICATION
    MEISEL, WS
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1971, SMC1 (04): : 402 - &
  • [43] A new hybrid ensemble feature selection framework for machine learning-based phishing detection system
    Chiew, Kang Leng
    Tan, Choon Lin
    Wong, KokSheik
    Yong, Kelvin S. C.
    Tiong, Wei King
    [J]. INFORMATION SCIENCES, 2019, 484 : 153 - 166
  • [44] Fusion of Dimensionality Reduction Methods: a Case Study in Microarray Classification
    Deegalla, Sampath
    Bostrom, Henrik
    [J]. FUSION: 2009 12TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, 2009, : 460 - +
  • [45] An Effective Neural Network Phishing Detection Model Based on Optimal Feature Selection
    Zhu, Erzhou
    Ye, Chengcheng
    Liu, Dong
    Liu, Feng
    Wang, Futian
    Li, Xuejun
    [J]. 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 781 - 787
  • [46] A New Ensemble Model for Phishing Detection Based on Hybrid Cumulative Feature Selection
    Prince, Md Sirajum Munir
    Hasan, Asib
    Shah, Faisal Muhammad
    [J]. 11TH IEEE SYMPOSIUM ON COMPUTER APPLICATIONS & INDUSTRIAL ELECTRONICS (ISCAIE 2021), 2021, : 7 - 12
  • [47] Comparison of feature extraction methods in dimensionality reduction
    Wu, Jee-cheng
    Chang, Chiao-Po
    Tsuei, Gwo-Chyang
    [J]. CANADIAN JOURNAL OF REMOTE SENSING, 2010, 36 (06): : 645 - 649
  • [48] A Classification-Based Approach for Implicit Feature Identification
    Zeng, Lingwei
    Li, Fang
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, 2013, 8208 : 190 - 202
  • [49] Optimization of Phishing Website Classification Based on Synthetic Minority Oversampling Technique and Feature Selection
    Prayogo, Rizal Dwi
    Karimah, Siti Amatullah
    [J]. 2020 5TH INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS 2020), 2020, : 125 - 130
  • [50] A Hybrid Machine Learning based Phishing Web site Detection Technique through Dimensionality Reduction
    Tabassum, Nusrath
    Neha, Farhin Faiza
    Hossain, Md. Shohrab
    Narman, Husnu S.
    [J]. 2021 IEEE INTERNATIONAL BLACK SEA CONFERENCE ON COMMUNICATIONS AND NETWORKING (IEEE BLACKSEACOM), 2021, : 196 - 201