Evaluation of Machine Learning Algorithms in Network-Based Intrusion Detection Using Progressive Dataset

被引:3
|
作者
Chua, Tuan-Hong [1 ]
Salam, Iftekhar [1 ]
机构
[1] Xiamen Univ Malaysia, Sch Comp & Data Sci, Sepang 43900, Malaysia
来源
SYMMETRY-BASEL | 2023年 / 15卷 / 06期
关键词
intrusion detection; machine learning; deep learning; cybersecurity; DETECTION SYSTEM;
D O I
10.3390/sym15061251
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cybersecurity has become one of the focuses of organisations. The number of cyberattacks keeps increasing as Internet usage continues to grow. As new types of cyberattacks continue to emerge, researchers focus on developing machine learning (ML)-based intrusion detection systems (IDS) to detect zero-day attacks. They usually remove some or all attack samples from the training dataset and only include them in the testing dataset when evaluating the performance. This method may detect unknown attacks; however, it does not reflect the long-term performance of the IDS as it only shows the changes in the type of attacks. In this work, we focused on evaluating the long-term performance of ML-based IDS. To achieve this goal, we proposed evaluating the ML-based IDS using a dataset created later than the training dataset. The proposed method can better assess the long-term performance as the testing dataset reflects the changes in the attack type and network infrastructure changes over time. We have implemented six of the most popular ML models, including decision tree (DT), random forest (RF), support vector machine (SVM), naive Bayes (NB), artificial neural network (ANN), and deep neural network (DNN). These models are trained and tested with a pair of datasets with symmetrical classes. Our experiments using the CIC-IDS2017 and the CSE-CIC-IDS2018 datasets show that SVM and ANN are most resistant to overfitting. Our experiments also indicate that DT and RF suffer the most from overfitting, although they perform well on the training dataset. On the other hand, our experiments using the LUFlow dataset have shown that all models can perform well when the difference between the training and testing datasets is small.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] Toward a Reliable Evaluation of Machine Learning Schemes for Network-Based Intrusion Detection
    Viegas E.K.
    Santin A.O.
    Tedeschi P.
    IEEE Internet of Things Magazine, 2023, 6 (02): : 70 - 75
  • [2] Network Intrusion Detection Using Machine Learning Anomaly Detection Algorithms
    Hanifi, Khadija
    Bank, Hasan
    Karsligil, M. Elif
    Yavuz, A. Gokhan
    Guvensan, M. Amac
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [3] Network intrusion detection using oversampling technique and machine learning algorithms
    Ahmed, Hafiza Anisa
    Hameed, Anum
    Bawany, Narmeen Zakaria
    PEERJ COMPUTER SCIENCE, 2022, 8 : 1 - 19
  • [4] Enhancing Network Intrusion Detection Model Using Machine Learning Algorithms
    Awad, Nancy Awadallah
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (01): : 979 - 990
  • [5] Network intrusion detection using oversampling technique and machine learning algorithms
    Ahmed H.A.
    Hameed A.
    Bawany N.Z.
    PeerJ Computer Science, 2022, 8
  • [6] Evaluation of Tree-Based Machine Learning Algorithms for Network Intrusion Detection in the Internet of Things
    Essa, Mohamed Saied
    Guirguis, Shawkat Kamal
    IT PROFESSIONAL, 2023, 25 (05) : 45 - 56
  • [7] Comparative Evaluation of Machine Learning Algorithms for Network Intrusion Detection and Attack Classification
    Leon, Miguel
    Markovic, Tijana
    Punnekkat, Sasikumar
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [8] Toward feasible machine learning model updates in network-based intrusion detection
    Horchulhack, Pedro
    Viegas, Eduardo K.
    Santin, Altair O.
    COMPUTER NETWORKS, 2022, 202
  • [9] Machine Learning Techniques for Network-based Intrusion Detection System: A Survey Paper
    Ahmed, Lubna Ali Hassan
    Hamad, Yahia Abdalla Mohamed
    2021 IEEE NATIONAL COMPUTING COLLEGES CONFERENCE (NCCC 2021), 2021, : 1024 - +
  • [10] Evaluation of Machine Learning Algorithms for Intrusion Detection System
    Almseidin, Mohammad
    Alzubi, Maen
    Kovacs, Szilveszter
    Alkasassbeh, Mouhammd
    2017 IEEE 15TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS (SISY), 2017, : 277 - 282