Enhanced intrusion detection model based on principal component analysis and variable ensemble machine learning algorithm

被引:1
|
作者
John, Ayuba [1 ]
Bin Isnin, Ismail Fauzi [2 ]
Madni, Syed Hamid Hussain [3 ]
Muchtar, Farkhana Binti [2 ]
机构
[1] Fed Univ Dutse, Fac Comp, Dutse, Jigawa State, Nigeria
[2] Univ Teknol Malaysia UTM, Fac Comp, Johor Baharu, Malaysia
[3] Univ Southampton, Sch Elect & Comp Sc, Johor Baharu, Malaysia
来源
关键词
Network security; Intrusion detection system; Classification; Detection; and Machine Learning Algorithm; PERFORMANCE; PREDICTION; STACKING; SYSTEMS;
D O I
10.1016/j.iswa.2024.200442
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The intrusion detection system (IDS) model, which can identify the presence of intruders in the network and take some predefined action for safe data transit across the network, is advantageous in achieving security in both simple and advanced network systems. Several IDS models have various security problems, such as low detection accuracy and high false alarms, which can be caused by the network traffic dataset's excessive dimensionality and class imbalance in the creation of IDS models. Principal Component Analysis (PCA) has proven to be a helpful feature selection technique for dimensionality reduction. As a result, because it is a linear transformation, it has challenges capturing non-linear relationships between feature properties in the network traffic datasets. This paper proposes a variable ensemble machine learning method to solve the problem and achieve a low variance model with high accuracy and low false alarm. First, PCA is combined with the AdaBoost ensemble machine learning algorithm, which acts as stagewise additive modelling to compensate for PCA's deficiency in feature selection in network traffic by minimizing the exponential loss function. Secondly, PCA is used for feature selection, and a LogitBoost classifier algorithm can be used for multiclass classification and acts as an additive tree regression to compensate for the PCA's weakness by minimizing the Logistic Loss to provide an optimal classifier output. Finally, the low variance ability of RandomForest, which employs the bagging approach, is applied to eliminate overfittings. The experiments of the IDS model developed from the proposed methods were evaluated on the WSN-DS, NSL-KDD, and UNSW-N15 datasets. The performance of the methods, PCA with AdaBoost, on the WSN-DS dataset has an accuracy score of 92.3 %, an 89.0 % accuracy score on the NSL-KDD dataset, and a 67.9 % accuracy score on UNSW-N15, which is the least accurate score. PCA and RandomForest surpassed them by scoring 100 % accuracy on all three datasets. PCA and Bagging have an accuracy score of 99.8 % on the WSN-DS dataset, 100 % on the NSL-KDD dataset, and 93.4 % on the UNSW-N15 dataset. In comparison, PCA and LogitBoost have an accuracy score of 98.9 % on the WSN-DS dataset, 100 % on the NSL-KDD dataset, and 88.7 % on the UNSW-N15 dataset.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Network Intrusion Detection Based on Kernel Principal Component Analysis and Extreme Learning Machine
    Zhou, Yuan
    Yu, Le
    Liu, Mingshan
    Zhang, Yuanyuan
    Li, Helin
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2018, : 860 - 864
  • [2] An ensemble system for machine learning IoT intrusion detection based on enhanced artificial hummingbird algorithm
    Shi, Leyi
    Yang, Qihang
    Gao, Luhan
    Ge, Haowei
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [3] Research on Network Intrusion Detection Based on Incremental Extreme Learning Machine and Adaptive Principal Component Analysis
    Gao, Jianlei
    Chai, Senchun
    Zhang, Baihai
    Xia, Yuanqing
    ENERGIES, 2019, 12 (07)
  • [4] Comparative Study of Principal Component Analysis Based Intrusion Detection Approach Using Machine Learning Algorithms
    Chabathula, Krupa Joel
    Jaidhar, C. D.
    Kumara, Ajay M. A.
    2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
  • [5] An Adaptive Ensemble Machine Learning Model for Intrusion Detection
    Gao, Xianwei
    Shan, Chun
    Hu, Changzhen
    Niu, Zequn
    Liu, Zhen
    IEEE ACCESS, 2019, 7 : 82512 - 82521
  • [6] A Naive Bayesian network intrusion detection algorithm based on Principal Component Analysis
    Han, Xiaoyan
    Xu, Liancheng
    Ren, Min
    Gu, Weiping
    2015 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME), 2015, : 325 - 328
  • [7] Improved Ant Colony Optimization and Machine Learning Based Ensemble Intrusion Detection Model
    Vanitha, S.
    Balasubramanie, P.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (01): : 849 - 864
  • [8] Research of Intrusion Detection based on Ensemble Learning Model
    Li Yang
    Li Jianlin
    Yue Songjie
    Wang Zhi
    INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS II, PTS 1-3, 2013, 336-338 : 2376 - 2380
  • [9] Multiple classification algorithm based on ensemble learning for intrusion detection
    Liu, Fulai
    Yue, Jiaqi
    Hu, Zhongyi
    Du, Ruiyan
    WIRELESS NETWORKS, 2025, 31 (03) : 2143 - 2154
  • [10] Ensemble of Machine Learning Algorithms for Intrusion Detection
    Chou, Te-Shun
    Fan, Jeffrey
    Fan, Sharon
    Makki, Kia
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3976 - +