Heuristic nonlinear regression strategy for detecting phishing websites

被引:58
|
作者
Babagoli, Mehdi [1 ]
Aghababa, Mohammad Pourmahmood [2 ]
Solouk, Vahid [1 ]
机构
[1] Urmia Univ Technol, Fac Comp Engn, Orumiyeh, Iran
[2] Urmia Univ Technol, Fac Elect Engn, Orumiyeh, Iran
关键词
Phishing; SVM; Harmony search; Feature selection; Decision tree; Wrapper; Nonlinear regression; HARMONY SEARCH ALGORITHM; CLASSIFICATION;
D O I
10.1007/s00500-018-3084-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a method of phishing website detection that utilizes a meta-heuristic-based nonlinear regression algorithm together with a feature selection approach. In order to validate the proposed method, we used a dataset comprised of 11055 phishing and legitimate webpages, and select 20 features to be extracted from the mentioned websites. This research utilizes two feature selection methods: decision tree and wrapper to select the best feature subset, while the latter incurred the detection accuracy rate as high as 96.32%. After the feature selection process, two meta-heuristic algorithms are successfully implemented to predict and detect the fraudulent websites: harmony search (HS) which was deployed based on nonlinear regression technique and support vector machine (SVM). The nonlinear regression approach was used to classify the websites, where the parameters of the proposed regression model were obtained using HS algorithm. The proposed HS algorithm uses dynamic pitch adjustment rate and generated new harmony. The nonlinear regression based on HS led to accuracy rates of 94.13 and 92.80% for train and test processes, respectively. As a result, the study finds that the nonlinear regression-based HS results in better performance compared to SVM.
引用
收藏
页码:4315 / 4327
页数:13
相关论文
共 50 条
  • [21] A Method for Detecting Phishing Websites Based on Tiny-Bert Stacking
    He, Daojing
    Lv, Xin
    Zhu, Shanshan
    Chan, Sammy
    Choo, Kim-Kwang Raymond
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02): : 2236 - 2243
  • [22] PhishZip: A New Compression-based Algorithm for Detecting Phishing Websites
    Purwanto, Rizka
    Pal, Arindam
    Blair, Alan
    Jha, Sanjay
    2020 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2020,
  • [23] Detecting Phishing Websites Based on the Study of the Financial Industry Webserver Logs
    Hu, Jun
    Zhang, Xiangzhu
    Ji, Yuchun
    Yan, Hanbing
    Ding, Li
    Li, Jia
    Meng, Huiming
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 325 - 328
  • [24] Detecting Phishing Web sites: A Heuristic URL-Based Approach
    Luong Anh Tuan Nguyen
    Ba Lam To
    Huu Khuong Nguyen
    Minh Hoang Nguyen
    2013 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2013, : 597 - 602
  • [25] PaSOFuAC: Particle Swarm Optimization Based Fuzzy Associative Classifier for Detecting Phishing Websites
    Priya, S.
    Selvakumar, S.
    Velusamy, R. Leela
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 125 (01) : 755 - 784
  • [26] Datasets for phishing websites detection
    Vrbancic, Grega
    Fister, Iztok, Jr.
    Podgorelec, Vili
    DATA IN BRIEF, 2020, 33
  • [27] PaSOFuAC: Particle Swarm Optimization Based Fuzzy Associative Classifier for Detecting Phishing Websites
    S. Priya
    S. Selvakumar
    R. Leela velusamy
    Wireless Personal Communications, 2022, 125 : 755 - 784
  • [28] CCrFS: Combine Correlation Features Selection for Detecting Phishing Websites Using Machine Learning
    Moedjahedy, Jimmy
    Setyanto, Arief
    Alarfaj, Fawaz Khaled
    Alreshoodi, Mohammed
    FUTURE INTERNET, 2022, 14 (08)
  • [29] Detecting Phishing Websites Using an Efficient Feature-based Machine Learning Framework
    Sundaram, K. Mohana
    Sasikumar, R.
    Meghana, Atthipalli Sai
    Anuja, Arava
    Praneetha, Chandolu
    REVISTA GEINTEC-GESTAO INOVACAO E TECNOLOGIAS, 2021, 11 (02): : 2106 - 2112
  • [30] PhiKitA: Phishing Kit Attacks Dataset for Phishing Websites Identification
    Castano, Felipe
    Fernandez, Eduardo Fidalgo
    Alaiz-Rodriguez, Rocio
    Alegre, Enrique
    IEEE ACCESS, 2023, 11 : 40779 - 40789