Enhanced intrusion detection model based on principal component analysis and variable ensemble machine learning algorithm

被引:1
|
作者
John, Ayuba [1 ]
Bin Isnin, Ismail Fauzi [2 ]
Madni, Syed Hamid Hussain [3 ]
Muchtar, Farkhana Binti [2 ]
机构
[1] Fed Univ Dutse, Fac Comp, Dutse, Jigawa State, Nigeria
[2] Univ Teknol Malaysia UTM, Fac Comp, Johor Baharu, Malaysia
[3] Univ Southampton, Sch Elect & Comp Sc, Johor Baharu, Malaysia
来源
关键词
Network security; Intrusion detection system; Classification; Detection; and Machine Learning Algorithm; PERFORMANCE; PREDICTION; STACKING; SYSTEMS;
D O I
10.1016/j.iswa.2024.200442
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The intrusion detection system (IDS) model, which can identify the presence of intruders in the network and take some predefined action for safe data transit across the network, is advantageous in achieving security in both simple and advanced network systems. Several IDS models have various security problems, such as low detection accuracy and high false alarms, which can be caused by the network traffic dataset's excessive dimensionality and class imbalance in the creation of IDS models. Principal Component Analysis (PCA) has proven to be a helpful feature selection technique for dimensionality reduction. As a result, because it is a linear transformation, it has challenges capturing non-linear relationships between feature properties in the network traffic datasets. This paper proposes a variable ensemble machine learning method to solve the problem and achieve a low variance model with high accuracy and low false alarm. First, PCA is combined with the AdaBoost ensemble machine learning algorithm, which acts as stagewise additive modelling to compensate for PCA's deficiency in feature selection in network traffic by minimizing the exponential loss function. Secondly, PCA is used for feature selection, and a LogitBoost classifier algorithm can be used for multiclass classification and acts as an additive tree regression to compensate for the PCA's weakness by minimizing the Logistic Loss to provide an optimal classifier output. Finally, the low variance ability of RandomForest, which employs the bagging approach, is applied to eliminate overfittings. The experiments of the IDS model developed from the proposed methods were evaluated on the WSN-DS, NSL-KDD, and UNSW-N15 datasets. The performance of the methods, PCA with AdaBoost, on the WSN-DS dataset has an accuracy score of 92.3 %, an 89.0 % accuracy score on the NSL-KDD dataset, and a 67.9 % accuracy score on UNSW-N15, which is the least accurate score. PCA and RandomForest surpassed them by scoring 100 % accuracy on all three datasets. PCA and Bagging have an accuracy score of 99.8 % on the WSN-DS dataset, 100 % on the NSL-KDD dataset, and 93.4 % on the UNSW-N15 dataset. In comparison, PCA and LogitBoost have an accuracy score of 98.9 % on the WSN-DS dataset, 100 % on the NSL-KDD dataset, and 88.7 % on the UNSW-N15 dataset.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] A Network Intrusion Detection System Using Ensemble Machine Learning
    Kiflay, Aklil Zenebe
    Tsokanos, Athanasios
    Kirner, Raimund
    2021 INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2021,
  • [42] Enhanced intrusion detection framework for securing IoT network using principal component analysis and CNN
    Mazid, Abdul
    Kirmani, Sheeraz
    Abid, Manaullah
    INFORMATION SECURITY JOURNAL, 2024,
  • [43] Network Intrusion Detection System Using Principal Component Analysis Algorithm and Decision Tree Classifier
    Osho, Oyeyemi
    Hong, Sungbum
    Kwembe, Tor A.
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 273 - 279
  • [44] Network Intrusion Detection and Comparative Analysis Using Ensemble Machine Learning and Feature Selection
    Das, Saikat
    Saha, Sajal
    Priyoti, Annita Tahsin
    Roy, Etee Kawna
    Sheldon, Frederick T. T.
    Haque, Anwar
    Shiva, Sajjan
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4821 - 4833
  • [45] Anomaly-based Network Intrusion Detection using Ensemble Machine Learning Approach
    Das, Abhijit
    Pramod
    Sunitha, B. S.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (02) : 635 - 645
  • [46] Analysis of Machine Learning Techniques Based Intrusion Detection Systems
    Sharma, Rupam Kr.
    Kalita, Hemanta Kumar
    Borah, Parashjyoti
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS, ICACNI 2015, VOL 2, 2016, 44 : 485 - 493
  • [47] Sustainable Ensemble Learning Driving Intrusion Detection Model
    Li, Xinghua
    Zhu, Mengyao
    Yang, Laurence T.
    Xu, Mengfan
    Ma, Zhuo
    Zhong, Cheng
    Li, Hui
    Xiang, Yang
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (04) : 1591 - 1604
  • [48] Stock Index Prediction Based on Principal Component Analysis and Machine Learning
    Zhu, Shitao
    Zhao, Ming
    Wei, Shengqing
    An, Simeng
    2020 INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2020), 2020, : 246 - 249
  • [49] Intrusion detection model using machine learning algorithm on Big Data environment
    Othman, Suad Mohammed
    Ba-Alwi, Fadl Mutaher
    Alsohybe, Nabeel T.
    Al-Hashida, Amal Y.
    JOURNAL OF BIG DATA, 2018, 5 (01)
  • [50] AN ADAPTIVE LEARNING ALGORITHM FOR PRINCIPAL COMPONENT ANALYSIS
    CHEN, LH
    CHANG, SY
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (05): : 1255 - 1263