Improving Intrusion Detection Through Training Data Augmentation

被引:5
|
作者
Otokwala, Uneneibotejit [1 ]
Petrovski, Andrei [1 ]
Kalutarage, Harsha [1 ]
机构
[1] Robert Gordon Univ, Sch Comp, Aberdeen, Scotland
关键词
Imbalanced data; Minority oversampling; Data augmentation; Intrusion detection;
D O I
10.1109/SIN54109.2021.9699293
中图分类号
学科分类号
摘要
Imbalanced classes in datasets are common problems often found in security data. Therefore, several strategies like class resampling and cost-sensitive training have been proposed to address it. In this paper, we propose a data augmentation strategy to oversample the minority classes in the dataset. Using our Sort-Augment-Combine (SAC) technique, we split the dataset into subsets of the class labels and then generate synthetic data from each of the subsets. The synthetic data were then used to oversample the minority classes. Upon the completion of the oversampling, the independent classes were combined to form an augmented training data for model fitting. Using performance metrics such as accuracy, recall (sensitivity) and true positives (specificity), the models trained using the augmented datasets show an improvement in performance metrics over the original dataset. Similarly, in a binary class dataset, SAC performed optimally and the combination of SAC and ROSE model shows an improvement in overall accuracy, sensitivity and specificity when compared with the performance of the Random Forest model on the original dataset, ROSE and SMOTE augmented datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Improving Deep Learning Parkinson's Disease Detection Through Data Augmentation Training
    Taleb, Catherine
    Likforman-Sulem, Laurence
    Mokbel, Chafic
    PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 1144 : 79 - 93
  • [2] Improving Intrusion Detection through Merging Heterogeneous IP Data
    Zhu, Wenjie
    Wang, Qiang
    PROCEEDING OF THE IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2012, : 122 - 125
  • [3] Intrusion Detection Model Updates Through GAN Data Augmentation and Transfer Learning
    Horchulhack, Pedro
    Viegas, Eduardo K.
    Santin, Altair O.
    Geremias, Jhonatan
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 2668 - 2673
  • [4] Data Augmentation for Intrusion Detection and Classification in Cloud Networks
    Chkirbene, Zina
    Ben Abdallah, Habib
    Hassine, Kawther
    Hamila, Ridha
    Erbad, Aiman
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 831 - 836
  • [5] Intrusion detection through behavioral data
    Gunetti, D
    Ruffo, G
    ADVANCES IN INTELLIGENT DATA ANALYSIS, PROCEEDINGS, 1999, 1642 : 383 - 394
  • [6] Intrusion detection using noisy training data
    Park, Y
    Lee, J
    Cho, Y
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2004, PT 1, 2004, 3043 : 547 - 556
  • [7] Data augmentation using generative models for track intrusion detection
    Lee, Soohyung
    Kim, Beomseong
    Lee, Heesung
    SCIENCE PROGRESS, 2023, 106 (04)
  • [8] Improving the effectiveness of intrusion detection systems for hierarchical data
    Yahalom, Ran
    Steren, Alon
    Nameri, Yonatan
    Roytman, Maxim
    Porgador, Angel
    Elovici, Yuval
    KNOWLEDGE-BASED SYSTEMS, 2019, 168 : 59 - 69
  • [9] Improving Data Quality of Proxy Logs for Intrusion Detection
    Sha, Hongzhou
    Liu, Tingwen
    Qin, Peng
    Sun, Yong
    Liu, Qingyun
    RESEARCH IN ATTACKS, INTRUSIONS, AND DEFENSES, 2013, 8145 : 454 - +
  • [10] Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation
    Staliunaite, Ieva
    Gorinski, Philip John
    Iacobacci, Ignacio
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13834 - 13842