Improving Intrusion Detection Through Training Data Augmentation

被引:5
|
作者
Otokwala, Uneneibotejit [1 ]
Petrovski, Andrei [1 ]
Kalutarage, Harsha [1 ]
机构
[1] Robert Gordon Univ, Sch Comp, Aberdeen, Scotland
关键词
Imbalanced data; Minority oversampling; Data augmentation; Intrusion detection;
D O I
10.1109/SIN54109.2021.9699293
中图分类号
学科分类号
摘要
Imbalanced classes in datasets are common problems often found in security data. Therefore, several strategies like class resampling and cost-sensitive training have been proposed to address it. In this paper, we propose a data augmentation strategy to oversample the minority classes in the dataset. Using our Sort-Augment-Combine (SAC) technique, we split the dataset into subsets of the class labels and then generate synthetic data from each of the subsets. The synthetic data were then used to oversample the minority classes. Upon the completion of the oversampling, the independent classes were combined to form an augmented training data for model fitting. Using performance metrics such as accuracy, recall (sensitivity) and true positives (specificity), the models trained using the augmented datasets show an improvement in performance metrics over the original dataset. Similarly, in a binary class dataset, SAC performed optimally and the combination of SAC and ROSE model shows an improvement in overall accuracy, sensitivity and specificity when compared with the performance of the Random Forest model on the original dataset, ROSE and SMOTE augmented datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Computer intrusion detection through EWMA for autocorrelated and uncorrelated data
    Ye, N
    Vilbert, S
    Chen, Q
    IEEE TRANSACTIONS ON RELIABILITY, 2003, 52 (01) : 75 - 82
  • [42] Improving Generalizability of Graph Anomaly Detection Models via Data Augmentation
    Zhou, Shuang
    Huang, Xiao
    Liu, Ninghao
    Zhou, Huachi
    Chung, Fu-Lai
    Huang, Long-Kai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12721 - 12735
  • [43] Improving the Robustness of Pedestrian Detection in Autonomous Driving With Generative Data Augmentation
    Wu, Yalun
    Xiang, Yingxiao
    Tong, Endong
    Ye, Yuqi
    Cui, Zhibo
    Tian, Yunzhe
    Zhang, Lejun
    Liu, Jiqiang
    Han, Zhen
    Niu, Wenjia
    IEEE NETWORK, 2024, 38 (03): : 63 - 69
  • [44] ACAMDA: Improving Data Efficiency in Reinforcement Learning Through Guided Counterfactual Data Augmentation
    Sun, Yuewen
    Wang, Erli
    Huang, Biwei
    Lu, Chaochao
    Feng, Lu
    Sun, Changyin
    Zhang, Kun
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15193 - 15201
  • [45] IMPROVING PERSON DETECTION USING SYNTHETIC TRAINING DATA
    Yu, Jie
    Farin, Dirk
    Krueger, Christof
    Schiele, Bernt
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3477 - 3480
  • [46] CNN Training with Twenty Samples for Crack Detection via Data Augmentation
    Wang, Zirui
    Yang, Jingjing
    Jiang, Haonan
    Fan, Xueling
    SENSORS, 2020, 20 (17) : 1 - 17
  • [47] On Data Augmentation for GAN Training
    Tran, Ngoc-Trung
    Tran, Viet-Hung
    Nguyen, Ngoc-Bao
    Nguyen, Trung-Kien
    Cheung, Ngai-Man
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1882 - 1897
  • [48] CLUSTERING UNDER-SAMPLING DATA FOR IMPROVING THE PERFORMANCE OF INTRUSION DETECTION SYSTEM
    Aziz, Mohammad Nasrul
    Ahmad, Tohari
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2021, 16 (02): : 1342 - 1355
  • [49] Improving the Speed of the Network Intrusion Detection
    Sadeghi, Zahra
    Bahrami, Asadollah Shah
    2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 88 - 91
  • [50] Improving the Performance of IIoT Intrusion Detection System Using Hybrid Synthetic Data
    Chen, Chia-Mei
    Hsu, Chi-Hsuen
    Cai, Zheng-Xun
    Lai, Gu-Hsin
    Ou, Ya-Hui
    2024 19TH ASIA JOINT CONFERENCE ON INFORMATION SECURITY, ASIAJCIS 2024, 2024, : 62 - 68