Investigating the Effect Of Feature Selection and Dimensionality Reduction On Phishing Website Classification Problem

被引:0
|
作者
Singh, Pradeep [1 ]
Jain, Niti [1 ]
Maini, Ambar [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci, Raipur, Madhya Pradesh, India
关键词
Data Mining; phishing website detection; Classification;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Phishing is a term given to the method of gaining unauthorized access to a person's private information like passwords, account or credit card details. It is a deception technique that utilizes social engineering & technology to convince a victim to provide personal information, usually for monetary benefits. Phishing attacks have become frequent and involve the risk of identity theft and financial losses. Detection of phishing website has become very important for online banking and e-commerce users. We proposed an effective model that is based on preprocessing (Feature selection and dimensionality reduction) and classification DataMining algorithms. These algorithms were used to characterize and identify all the factors to classify the phishing website. We implemented five different classification algorithm and four preprocessing techniques to classify a websites legitimate or phishy. We also compared their respective performances in terms of accuracy and AUC.
引用
收藏
页码:388 / 393
页数:6
相关论文
共 50 条
  • [1] Feature Selection for Phishing Website Classification
    Shabudin, Shafaizal
    Sani, Nor Samsiah
    Ariffin, Khairul Akram Zainal
    Aliff, Mohd
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 587 - 595
  • [2] A Study of Feature Selection and Dimensionality Reduction Methods for Classification-Based Phishing Detection System
    Singh, Amit
    Tiwari, Abhishek
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2021, 11 (01) : 1 - 35
  • [3] Optimization of Phishing Website Classification Based on Synthetic Minority Oversampling Technique and Feature Selection
    Prayogo, Rizal Dwi
    Karimah, Siti Amatullah
    [J]. 2020 5TH INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS 2020), 2020, : 125 - 130
  • [4] Algorithmic Feature Selection and Dimensionality Reduction in Signal Classification Tasks
    Zavadil, Jan
    Kus, Vaclav
    Chlada, Milan
    [J]. MATHEMATICAL MODELING IN PHYSICAL SCIENCES, IC-MSQUARE 2023, 2024, 446 : 187 - 193
  • [5] Feature selection for dimensionality reduction
    Mladenic, Dunja
    [J]. SUBSPACE, LATENT STRUCTURE AND FEATURE SELECTION, 2006, 3940 : 84 - 102
  • [6] Review of Feature Selection, Dimensionality Reduction and Classification for Chronic Disease Diagnosis
    Alhassan, Afnan M.
    Zainon, Wan Mohd Nazmee Wan
    [J]. IEEE ACCESS, 2021, 9 : 87310 - 87317
  • [7] Fast hybrid dimensionality reduction method for classification based on feature selection and grouped feature extraction
    Li, Mengmeng
    Wang, Haofeng
    Yang, Lifang
    Liang, You
    Shang, Zhigang
    Wan, Hong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150
  • [8] INVESTIGATION OF THE IMPACT OF DIMENSIONALITY REDUCTION AND FEATURE SELECTION ON THE CLASSIFICATION OF HYPERSPECTRAL ENMAP DATA
    Keller, S.
    Braun, A. C.
    Hinz, S.
    Weinmann, M.
    [J]. 2016 8TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2016,
  • [9] Phishing Website Detection Using Machine Learning Classifiers Optimized by Feature Selection
    Mehanovic, Dzelila
    Kevric, Jasmin
    [J]. TRAITEMENT DU SIGNAL, 2020, 37 (04) : 563 - 569
  • [10] Phishing Website Detection: An Improved Accuracy through Feature Selection and Ensemble Learning
    Ubing, Alyssa Anne
    Jasmi, Syukrina Kamilia Binti
    Abdullah, Azween
    Jhanjhi, N. Z.
    Supramaniam, Mahadevan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 252 - 257