On Feature Selection for the Prediction of Phishing Websites

被引:2
|
作者
Fadheel, Wesam [1 ]
Abusharkh, Mohamed [2 ]
Abdel-Qader, Ikhlas [3 ]
机构
[1] Western Michigan Univ, Dept Comp Sci, Kalamazoo, MI 49008 USA
[2] Ferris State Univ, Sch Digital Media, Grand Rapids, MI USA
[3] Western Michigan Univ, Dept Elect & Comp Engn, Kalamazoo, MI 49008 USA
关键词
D O I
10.1109/DASC-PICom-DataCom-CyberSciTec.2017.146
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
with the rise of the big data paradigm, large data sets are being made available for knowledge mining. While this open up possibilities for new insights being gained every day, it also exposes data consumers to an increase in low quality, unreliable, redundant or noisy portions of the data. This would negatively affect the process of harvesting knowledge and recognizing patterns. Therefore, efficient feature selection methods to empower for real-time prediction or classification systems. Feature selection is the process of identifying the most relevant attributes and removing the redundant and irrelevant attributes. In this study, we implemented Kaiser-Meyer-Olkin (KMO) Test as a feature selection method and applied that to a publicly available phishing dataset, namely, the UCI of phishing website. furthermore, we used Logistic Regression and Support Vector Machine as classification methods to validate the feature selection method. Our results show just a slight difference in accuracy between implementation using full dataset features and the proposed much smaller dataset (almost 63% of original features set). This reduction in dimensionality is significant for the real-time systems especially when the accuracy reduction is slight. From there, we present a framework enabling a significant reduction in features. This opens the door for future work under which a wider set of classification algorithms will be tested in order to achieve the dimensionality reduction and an increase in performance accuracy.
引用
收藏
页码:871 / 876
页数:6
相关论文
共 50 条
  • [1] Applying Differential Evolution with Threshold Mechanism for Feature Selection on a Phishing Websites Classification
    Brezocnik, Lucija
    Fister, Iztok, Jr.
    Vrbancic, Grega
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2019, 2019, 1064 : 11 - 18
  • [2] From Phishing Behavior Analysis and Feature Selection to Enhance Prediction Rate in Phishing Detection
    Omar, Asmaa Reda
    Taie, Shereen
    Shaheen, Masoud E.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 1033 - 1044
  • [3] Intelligent feature selection model based on particle swarm optimization to detect phishing websites
    Alsenani, Theyab R. R.
    Ayon, Safial Islam
    Yousuf, Sayeda Mayesha
    Anik, Fahad Bin Kamal
    Chowdhury, Mohammad Ehsan Shahmi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (29) : 44943 - 44975
  • [4] Salp Swarm Optimization Search Based Feature Selection for Enhanced Phishing Websites Detection
    Abu Khurma, Ruba
    Sabri, Khair Eddin
    Castillo, Pedro A.
    Aljarah, Ibrahim
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2021, 2021, 12694 : 146 - 161
  • [5] Intelligent feature selection model based on particle swarm optimization to detect phishing websites
    Theyab R. Alsenani
    Safial Islam Ayon
    Sayeda Mayesha Yousuf
    Fahad Bin Kamal Anik
    Mohammad Ehsan Shahmi Chowdhury
    Multimedia Tools and Applications, 2023, 82 : 44943 - 44975
  • [6] Prediction of phishing websites using machine learning
    Pandey, Mithilesh Kumar
    Singh, Munindra Kumar
    Pal, Saurabh
    Tiwari, B. B.
    SPATIAL INFORMATION RESEARCH, 2023, 31 (02) : 157 - 166
  • [7] Phishing Websites Prediction Using Classification Techniques
    Ibrahim, Dyana Rashid
    Hadi, Ali Hussein
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 133 - 137
  • [8] Prediction of Phishing Websites Using Stacked Ensemble Method and Hybrid Features Selection Method
    Pandey M.K.
    Singh M.K.
    Pal S.
    Tiwari B.B.
    SN Computer Science, 3 (6)
  • [9] Feature Extraction and Classification Phishing Websites Based on URL
    Aydin, Mustafa
    Baykal, Nazife
    2015 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2015, : 769 - 770
  • [10] Prediction of Phishing Websites Using AI Techniques
    Gururaj, H. L.
    Mitra, Prithwijit
    Koner, Soumyadip
    Bal, Sauvik
    Flammini, Francesco
    Janhavi, V
    Kumar, Ravi, V
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2022, 16 (01)