Improving Intrusion Detection Through Training Data Augmentation

被引:5
|
作者
Otokwala, Uneneibotejit [1 ]
Petrovski, Andrei [1 ]
Kalutarage, Harsha [1 ]
机构
[1] Robert Gordon Univ, Sch Comp, Aberdeen, Scotland
关键词
Imbalanced data; Minority oversampling; Data augmentation; Intrusion detection;
D O I
10.1109/SIN54109.2021.9699293
中图分类号
学科分类号
摘要
Imbalanced classes in datasets are common problems often found in security data. Therefore, several strategies like class resampling and cost-sensitive training have been proposed to address it. In this paper, we propose a data augmentation strategy to oversample the minority classes in the dataset. Using our Sort-Augment-Combine (SAC) technique, we split the dataset into subsets of the class labels and then generate synthetic data from each of the subsets. The synthetic data were then used to oversample the minority classes. Upon the completion of the oversampling, the independent classes were combined to form an augmented training data for model fitting. Using performance metrics such as accuracy, recall (sensitivity) and true positives (specificity), the models trained using the augmented datasets show an improvement in performance metrics over the original dataset. Similarly, in a binary class dataset, SAC performed optimally and the combination of SAC and ROSE model shows an improvement in overall accuracy, sensitivity and specificity when compared with the performance of the Random Forest model on the original dataset, ROSE and SMOTE augmented datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Improving Network Intrusion Detection through Soft Computing and Natural Immunology
    Shahrestani, Seyed A.
    PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE (ACS'08): RECENT ADVANCES ON APPLIED COMPUTER SCIENCE, 2008, : 87 - +
  • [32] Towards Improving the Intrusion Detection through ELM (Extreme Learning Machine)
    Ahmad, Iftikhar
    Alsemmeari, Rayan Atteah
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (02): : 1097 - 1111
  • [33] Improving Intrusion Detection Confidence Through a Moving Target Defense Strategy
    dos Santos, Roger R.
    Viegas, Eduardo K.
    Santin, Altair O.
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [34] Improving the Reliability of Network Intrusion Detection Systems Through Dataset Integration
    Magan-Carrion, Roberto
    Urda, Daniel
    Diaz-Cano, Ignacio
    Dorronsoro, Bernabe
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2022, 10 (04) : 1717 - 1732
  • [35] Improving intrusion detection radar
    Foley, E
    Harman, K
    Cheal, J
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2002, 17 (08) : 22 - 27
  • [36] A STOCHASTIC APPROXIMATION APPROACH FOR IMPROVING INTRUSION DETECTION DATA FUSION STRUCTURES
    Manousakis, K.
    Sterne, D.
    Ivanic, N.
    Lawler, G.
    McAuley, A.
    2008 IEEE MILITARY COMMUNICATIONS CONFERENCE: MILCOM 2008, VOLS 1-7, 2008, : 959 - 965
  • [37] CAN Intrusion Detection System Based on Data Augmentation and Improved Bi-LSTM
    Zhao, Haihang
    Cheng, Anyu
    Wang, Yi
    Wang, Shanshan
    Wang, Hongrong
    2024 IEEE THE 20TH ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, APCCAS 2024, 2024, : 198 - 202
  • [38] VAE-WACGAN: An Improved Data Augmentation Method Based on VAEGAN for Intrusion Detection
    Tian, Wuxin
    Shen, Yanping
    Guo, Na
    Yuan, Jing
    Yang, Yanqing
    SENSORS, 2024, 24 (18)
  • [39] Improving Social Bot Detection Through Aid and Training
    Kenny, Ryan
    Fischhoff, Baruch
    Davis, Alex
    Canfield, Casey
    HUMAN FACTORS, 2024, 66 (10) : 2323 - 2344
  • [40] Improving DRS-to-Text Generation Through Delexicalization and Data Augmentation
    Amin, Muhammad Saad
    Anselma, Luca
    Mazzei, Alessandro
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT I, NLDB 2024, 2024, 14762 : 121 - 136