A Study of Feature Selection and Dimensionality Reduction Methods for Classification-Based Phishing Detection System

被引:6
|
作者
Singh, Amit [1 ]
Tiwari, Abhishek [2 ]
机构
[1] Indian Comp Emergency Response Team, New Delhi, India
[2] Cent Univ Haryana, Dept Comp Sci & IT, Comp Applicat, Jant, India
关键词
Cyber-Crime; Dimensionality Reduction; Feature Selection; Machine Learning; Phishing;
D O I
10.4018/IJIRR.2021010101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Phishing was introduced in 1996, and now phishing is the biggest cybercrime challenge. Phishing is an abstract way to deceive users over the internet. Purpose of phishers is to extract the sensitive information of the user. Researchers have been working on solutions of phishing problem, but the parallel evolution of cybercrime techniques have made it a tough nut to crack. Recently, machine learning-based solutions are widely adopted to tackle the menace of phishing. This survey paper studies various feature selection method and dimensionality reduction methods and sees how they perform with machine learning-based classifier. The selection of features is vital for developing a good performance machine learning model. This work is comparing three broad categories of feature selection methods, namely filter, wrapper, and embedded feature selection methods, to reduce the dimensionality of data. The effectiveness of these methods has been assessed on several machine learning classifiers using k-fold cross-validation score, accuracy, precision, recall, and time.
引用
收藏
页码:1 / 35
页数:35
相关论文
共 50 条
  • [1] Investigating the Effect Of Feature Selection and Dimensionality Reduction On Phishing Website Classification Problem
    Singh, Pradeep
    Jain, Niti
    Maini, Ambar
    [J]. 2015 1ST INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2015, : 388 - 393
  • [2] Phishing detection based on machine learning and feature selection methods
    Almseidin, Mohammed
    Abu Zuraiq, AlMaha
    Al-kasassbeh, Mouhammd
    Alnidami, Nidal
    [J]. International Journal of Interactive Mobile Technologies, 2019, 13 (12) : 71 - 183
  • [3] Phishing Webpage Detection using Feature Selection Methods
    Savyanavar, Amit S.
    Dr, Pradnya Sankpal
    Mhala, Nikhil C.
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (05) : 447 - 452
  • [5] Fast hybrid dimensionality reduction method for classification based on feature selection and grouped feature extraction
    Li, Mengmeng
    Wang, Haofeng
    Yang, Lifang
    Liang, You
    Shang, Zhigang
    Wan, Hong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150
  • [6] Feature Selection for Phishing Website Classification
    Shabudin, Shafaizal
    Sani, Nor Samsiah
    Ariffin, Khairul Akram Zainal
    Aliff, Mohd
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 587 - 595
  • [7] Feature dimensionality reduction for myoelectric pattern recognition: A comparison study of feature selection and feature projection methods
    Liu, Jie
    [J]. MEDICAL ENGINEERING & PHYSICS, 2014, 36 (12) : 1716 - 1720
  • [8] Comparative Study of Feature Subset Selection Methods for Dimensionality Reduction on Scientific Data
    Padmaja, D. Lakshmi
    Vishnuvardhan, B.
    [J]. 2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 31 - 34
  • [9] A Comparison of Different Dimensionality Reduction and Feature Selection Methods for Single Trial ERP Detection
    Lan, Tian
    Erdogmus, Deniz
    Black, Lois
    Van Santen, Jan
    [J]. 2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 6329 - 6332
  • [10] Algorithmic Feature Selection and Dimensionality Reduction in Signal Classification Tasks
    Zavadil, Jan
    Kus, Vaclav
    Chlada, Milan
    [J]. MATHEMATICAL MODELING IN PHYSICAL SCIENCES, IC-MSQUARE 2023, 2024, 446 : 187 - 193