Improving Intrusion Detection Through Training Data Augmentation

被引：5

作者：

Otokwala, Uneneibotejit ^{[1
]}

Petrovski, Andrei ^{[1
]}

Kalutarage, Harsha ^{[1
]}

机构：

[1] Robert Gordon Univ, Sch Comp, Aberdeen, Scotland

来源：

2021 14TH INTERNATIONAL CONFERENCE ON SECURITY OF INFORMATION AND NETWORKS (SIN 2021) | 2021年

关键词：

Imbalanced data; Minority oversampling; Data augmentation; Intrusion detection;

D O I：

10.1109/SIN54109.2021.9699293

中图分类号：

学科分类号：

摘要：

Imbalanced classes in datasets are common problems often found in security data. Therefore, several strategies like class resampling and cost-sensitive training have been proposed to address it. In this paper, we propose a data augmentation strategy to oversample the minority classes in the dataset. Using our Sort-Augment-Combine (SAC) technique, we split the dataset into subsets of the class labels and then generate synthetic data from each of the subsets. The synthetic data were then used to oversample the minority classes. Upon the completion of the oversampling, the independent classes were combined to form an augmented training data for model fitting. Using performance metrics such as accuracy, recall (sensitivity) and true positives (specificity), the models trained using the augmented datasets show an improvement in performance metrics over the original dataset. Similarly, in a binary class dataset, SAC performed optimally and the combination of SAC and ROSE model shows an improvement in overall accuracy, sensitivity and specificity when compared with the performance of the Random Forest model on the original dataset, ROSE and SMOTE augmented datasets.

引用

页数：8

共 50 条

[31] Improving Network Intrusion Detection through Soft Computing and Natural Immunology
Shahrestani, Seyed A.
PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE (ACS'08): RECENT ADVANCES ON APPLIED COMPUTER SCIENCE, 2008, : 87 - +
[32] Towards Improving the Intrusion Detection through ELM (Extreme Learning Machine)
Ahmad, Iftikhar
Alsemmeari, Rayan Atteah
CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (02): : 1097 - 1111
[33] Improving Intrusion Detection Confidence Through a Moving Target Defense Strategy
dos Santos, Roger R.
Viegas, Eduardo K.
Santin, Altair O.
2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[34] Improving the Reliability of Network Intrusion Detection Systems Through Dataset Integration
Magan-Carrion, Roberto
Urda, Daniel
Diaz-Cano, Ignacio
Dorronsoro, Bernabe
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2022, 10 (04) : 1717 - 1732
[35] Improving intrusion detection radar
Foley, E
Harman, K
Cheal, J
IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2002, 17 (08) : 22 - 27
[36] A STOCHASTIC APPROXIMATION APPROACH FOR IMPROVING INTRUSION DETECTION DATA FUSION STRUCTURES
Manousakis, K.
Sterne, D.
Ivanic, N.
Lawler, G.
McAuley, A.
2008 IEEE MILITARY COMMUNICATIONS CONFERENCE: MILCOM 2008, VOLS 1-7, 2008, : 959 - 965
[37] CAN Intrusion Detection System Based on Data Augmentation and Improved Bi-LSTM
Zhao, Haihang
Cheng, Anyu
Wang, Yi
Wang, Shanshan
Wang, Hongrong
2024 IEEE THE 20TH ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, APCCAS 2024, 2024, : 198 - 202
[38] VAE-WACGAN: An Improved Data Augmentation Method Based on VAEGAN for Intrusion Detection
Tian, Wuxin
Shen, Yanping
Guo, Na
Yuan, Jing
Yang, Yanqing
SENSORS, 2024, 24 (18)
[39] Improving Social Bot Detection Through Aid and Training
Kenny, Ryan
Fischhoff, Baruch
Davis, Alex
Canfield, Casey
HUMAN FACTORS, 2024, 66 (10) : 2323 - 2344
[40] Improving DRS-to-Text Generation Through Delexicalization and Data Augmentation
Amin, Muhammad Saad
Anselma, Luca
Mazzei, Alessandro
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT I, NLDB 2024, 2024, 14762 : 121 - 136

← 1 2 3 4 5 →