Improving Intrusion Detection Through Training Data Augmentation

被引:5
|
作者
Otokwala, Uneneibotejit [1 ]
Petrovski, Andrei [1 ]
Kalutarage, Harsha [1 ]
机构
[1] Robert Gordon Univ, Sch Comp, Aberdeen, Scotland
关键词
Imbalanced data; Minority oversampling; Data augmentation; Intrusion detection;
D O I
10.1109/SIN54109.2021.9699293
中图分类号
学科分类号
摘要
Imbalanced classes in datasets are common problems often found in security data. Therefore, several strategies like class resampling and cost-sensitive training have been proposed to address it. In this paper, we propose a data augmentation strategy to oversample the minority classes in the dataset. Using our Sort-Augment-Combine (SAC) technique, we split the dataset into subsets of the class labels and then generate synthetic data from each of the subsets. The synthetic data were then used to oversample the minority classes. Upon the completion of the oversampling, the independent classes were combined to form an augmented training data for model fitting. Using performance metrics such as accuracy, recall (sensitivity) and true positives (specificity), the models trained using the augmented datasets show an improvement in performance metrics over the original dataset. Similarly, in a binary class dataset, SAC performed optimally and the combination of SAC and ROSE model shows an improvement in overall accuracy, sensitivity and specificity when compared with the performance of the Random Forest model on the original dataset, ROSE and SMOTE augmented datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Intrusion Detection for Smart Home Security Based on Data Augmentation with Edge Computing
    Yuan, Danni
    Ota, Kaoru
    Dong, Mianxiong
    Zhu, Xiaoyan
    Wu, Tao
    Zhang, Linjie
    Ma, Jianfeng
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [22] Intrusion Detection System After Data Augmentation Schemes Based on the VAE and CVAE
    Liu, Chang
    Antypenko, Ruslan
    Sushko, Iryna
    Zakharchenko, Oksana
    IEEE TRANSACTIONS ON RELIABILITY, 2022, 71 (02) : 1000 - 1010
  • [23] Enhancing Intrusion Detection Systems Using a Deep Learning and Data Augmentation Approach
    Mohammad, Rasheed
    Saeed, Faisal
    Almazroi, Abdulwahab Ali
    Alsubaei, Faisal S.
    Almazroi, Abdulaleem Ali
    SYSTEMS, 2024, 12 (03):
  • [24] On IoT intrusion detection based on data augmentation for enhancing learning on unbalanced samples
    Zhang, Ying
    Liu, Qiang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 133 : 213 - 227
  • [25] Improving Active Learning Performance through the Use of Data Augmentation
    Fonseca, Joao
    Bacao, Fernando
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [26] Efficient Training Data Extraction Framework for Intrusion Detection Systems
    Makiou, Abdelhamid
    Serhrouchni, Ahmed
    2015 6TH INTERNATIONAL CONFERENCE ON THE NETWORK OF THE FUTURE (NOF), 2015,
  • [27] Decomposing Training Data to Improve Network Intrusion Detection Performance
    Saia, Roberto
    Podda, Alessandro Sebastian
    Fenu, Gianni
    Balia, Riccardo
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1:, 2021, : 241 - 248
  • [28] Improving disk failure detection accuracy via data augmentation
    Wang, Wang
    Tang, Xuehai
    Zhou, Biyu
    Xiao, Wenjie
    Han, Jizhong
    Hu, Songlin
    2022 IEEE/ACM 30TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2022,
  • [29] Data Augmentation Method for Improving Vehicle Detection and Recognition Performance
    Chen, Xiu-Zhi
    Cheng, Chen-Pu
    Chen, Yen-Lin
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 419 - 420
  • [30] Data augmentation for training deep regression for in vitro cell detection
    Debeir, Olivier
    Decaestecker, Christine
    2019 FIFTH INTERNATIONAL CONFERENCE ON ADVANCES IN BIOMEDICAL ENGINEERING (ICABME), 2019, : 1 - 3