The Effect of Dataset Imbalance on the Performance of SCADA Intrusion Detection Systems

被引:22
|
作者
Balla, Asaad [1 ]
Habaebi, Mohamed Hadi [1 ]
Elsheikh, Elfatih A. A. [2 ]
Islam, Md. Rafiqul [1 ]
Suliman, F. M. [2 ]
机构
[1] Int Islamic Univ Malaysia, Dept Elect & Comp Engn, Kuala Lumpur 53100, Malaysia
[2] King Khalid Univ, Coll Engn, Dept Elect Engn, Abha 61421, Saudi Arabia
关键词
IDS; ICS; SCADA; imbalanced datasets; cyber security;
D O I
10.3390/s23020758
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Integrating IoT devices in SCADA systems has provided efficient and improved data collection and transmission technologies. This enhancement comes with significant security challenges, exposing traditionally isolated systems to the public internet. Effective and highly reliable security devices, such as intrusion detection system (IDSs) and intrusion prevention systems (IPS), are critical. Countless studies used deep learning algorithms to design an efficient IDS; however, the fundamental issue of imbalanced datasets was not fully addressed. In our research, we examined the impact of data imbalance on developing an effective SCADA-based IDS. To investigate the impact of various data balancing techniques, we chose two unbalanced datasets, the Morris power dataset, and CICIDS2017 dataset, including random sampling, one-sided selection (OSS), near-miss, SMOTE, and ADASYN. For binary classification, convolutional neural networks were coupled with long short-term memory (CNN-LSTM). The system's effectiveness was determined by the confusion matrix, which includes evaluation metrics, such as accuracy, precision, detection rate, and F1-score. Four experiments on the two datasets demonstrate the impact of the data imbalance. This research aims to help security researchers in understanding imbalanced datasets and their impact on DL SCADA-IDS.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Koga2022 Dataset: Comprehensive Dataset with Detailed Classification for Network Intrusion Detection Systems
    Sato, Hideya
    Kobayashi, Ryotaro
    2022 TENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS, CANDARW, 2022, : 351 - 357
  • [42] Improving the Reliability of Network Intrusion Detection Systems Through Dataset Integration
    Magan-Carrion, Roberto
    Urda, Daniel
    Diaz-Cano, Ignacio
    Dorronsoro, Bernabe
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2022, 10 (04) : 1717 - 1732
  • [43] Real time dataset generation framework for intrusion detection systems in IoT
    Al-Hadhrami, Yahya
    Hussain, Farookh Khadeer
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 108 : 414 - 423
  • [44] PDIWS: Thermal Imaging Dataset for Person Detection in Intrusion Warning Systems
    Nguyen Duc Thuan
    Le Hai Anh
    Hoang Si Hong
    2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 71 - 75
  • [45] A Dataset for Evaluating Intrusion Detection Systems in IEEE 802.11 Wireless Networks
    Vilela, Douglas W. F. L.
    Ferreira, Ed' Wilson T.
    Shinoda, Ailton Akira
    Araujo, Nelcileno V. de Souza
    de Oliveira, Ruy
    Nascimento, Valtemir E.
    2014 IEEE COLOMBIAN CONFERENCE ON COMMUNICATIONS AND COMPUTING (COLCOM), 2014,
  • [46] lp-norms in One-Class Classification for Intrusion Detection in SCADA Systems
    Nader, Patric
    Honeine, Paul
    Beauseroy, Pierre
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2014, 10 (04) : 2308 - 2317
  • [47] State-Based Network Intrusion Detection Systems for SCADA Protocols: A Proof of Concept
    Carcano, Andrea
    Fovino, Igor Nai
    Masera, Marcelo
    Trombetta, Alberto
    CRITICAL INFORMATION INFRASTRUCTURES SECURITY, 2010, 6027 : 138 - +
  • [48] Context-aware local Intrusion Detection in SCADA systems: a testbed and two showcases
    Chromik, Justyna J.
    Pilch, Carina
    Brackmann, Pascal
    Duhme, Christof
    Everinghoff, Franziska
    Giberlein, Artur
    Teodorowicz, Thomas
    Wieland, Julian
    Haverkort, Boudewijn R.
    Remke, Anne
    2017 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2017, : 467 - 472
  • [49] Performance Analysis of Intrusion Detection Systems Using a Feature Selection Method on the UNSW-NB15 Dataset
    Kasongo, Sydney M.
    Sun, Yanxia
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [50] An Evaluation Framework for Intrusion Detection Dataset
    Gharib, Amirhossein
    Sharafaldin, Iman
    Lashkari, Arash Habibi
    Ghorbani, Ali A.
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SECURITY (ICISS), 2014, : 41 - 45