Enhanced intrusion detection model based on principal component analysis and variable ensemble machine learning algorithm

被引:1
|
作者
John, Ayuba [1 ]
Bin Isnin, Ismail Fauzi [2 ]
Madni, Syed Hamid Hussain [3 ]
Muchtar, Farkhana Binti [2 ]
机构
[1] Fed Univ Dutse, Fac Comp, Dutse, Jigawa State, Nigeria
[2] Univ Teknol Malaysia UTM, Fac Comp, Johor Baharu, Malaysia
[3] Univ Southampton, Sch Elect & Comp Sc, Johor Baharu, Malaysia
来源
关键词
Network security; Intrusion detection system; Classification; Detection; and Machine Learning Algorithm; PERFORMANCE; PREDICTION; STACKING; SYSTEMS;
D O I
10.1016/j.iswa.2024.200442
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The intrusion detection system (IDS) model, which can identify the presence of intruders in the network and take some predefined action for safe data transit across the network, is advantageous in achieving security in both simple and advanced network systems. Several IDS models have various security problems, such as low detection accuracy and high false alarms, which can be caused by the network traffic dataset's excessive dimensionality and class imbalance in the creation of IDS models. Principal Component Analysis (PCA) has proven to be a helpful feature selection technique for dimensionality reduction. As a result, because it is a linear transformation, it has challenges capturing non-linear relationships between feature properties in the network traffic datasets. This paper proposes a variable ensemble machine learning method to solve the problem and achieve a low variance model with high accuracy and low false alarm. First, PCA is combined with the AdaBoost ensemble machine learning algorithm, which acts as stagewise additive modelling to compensate for PCA's deficiency in feature selection in network traffic by minimizing the exponential loss function. Secondly, PCA is used for feature selection, and a LogitBoost classifier algorithm can be used for multiclass classification and acts as an additive tree regression to compensate for the PCA's weakness by minimizing the Logistic Loss to provide an optimal classifier output. Finally, the low variance ability of RandomForest, which employs the bagging approach, is applied to eliminate overfittings. The experiments of the IDS model developed from the proposed methods were evaluated on the WSN-DS, NSL-KDD, and UNSW-N15 datasets. The performance of the methods, PCA with AdaBoost, on the WSN-DS dataset has an accuracy score of 92.3 %, an 89.0 % accuracy score on the NSL-KDD dataset, and a 67.9 % accuracy score on UNSW-N15, which is the least accurate score. PCA and RandomForest surpassed them by scoring 100 % accuracy on all three datasets. PCA and Bagging have an accuracy score of 99.8 % on the WSN-DS dataset, 100 % on the NSL-KDD dataset, and 93.4 % on the UNSW-N15 dataset. In comparison, PCA and LogitBoost have an accuracy score of 98.9 % on the WSN-DS dataset, 100 % on the NSL-KDD dataset, and 88.7 % on the UNSW-N15 dataset.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] A novel ensemble learning-based model for network intrusion detection
    Thockchom, Ngamba
    Singh, Moirangthem Marjit
    Nandi, Utpal
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 5693 - 5714
  • [32] RSSI-based Floor Localization Using Principal Component Analysis and Ensemble Extreme Learning Machine Technique
    Qi, Guowen
    Jin, Yi
    Yan, Jun
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [33] Video Jitter Detection Algorithm Based on Principal Component Analysis
    Xie, Songhua
    Nie, Hui
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2015, : 370 - 373
  • [34] A robust anomaly detection algorithm based on principal component analysis
    Huang, Yingkun
    Jin, Weidong
    Yu, Zhibin
    Li, Bing
    INTELLIGENT DATA ANALYSIS, 2021, 25 (02) : 249 - 263
  • [35] Ensemble kernel principal component analysis for fault detection
    Gan, L.-Z. (lzh_box@163.com), 1691, Northeast University (28):
  • [36] Enforcement of the principal component analysis–extreme learning machine algorithm by linear discriminant analysis
    A. Castaño
    F. Fernández-Navarro
    Annalisa Riccardi
    C. Hervás-Martínez
    Neural Computing and Applications, 2016, 27 : 1749 - 1760
  • [37] Algorithm for classifying arrhythmia using extreme learning machine and principal component analysis
    Kim, Jinkwon
    Shin, Hangsik
    Lee, Yonwook
    Lee, Myourigho
    2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 3257 - +
  • [38] Research on Network Intrusion Detection Based on SMOTE Algorithm and Machine Learning
    Zhang Y.
    Zhang T.
    Chen J.
    Wang Y.
    Zou Q.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2019, 39 (12): : 1258 - 1262
  • [39] Consensus hybrid ensemble machine learning for intrusion detection with AI
    Ahmed, Usman
    Jiangbin, Zheng
    Khan, Sheharyar
    Sadiq, Muhammad Tariq
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2025, 235
  • [40] A Robust Intrusion Detection System using Ensemble Machine Learning
    Divakar, Subham
    Priyadarshini, Rojalina
    Mishra, Brojo Kishore
    PROCEEDINGS OF 2020 6TH IEEE INTERNATIONAL WOMEN IN ENGINEERING (WIE) CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE 2020), 2020, : 348 - 351