Data Augmentation with Generative Models for Improved Malware Detection: A Comparative Study

被引：0

作者：

Burks, Roland, III ^{[1
]}

Islam, Kazi Aminul ^{[2
]}

Li, Jiang ^{[2
]}

Lu, Yan ^{[3
]}

机构：

[1] Samford Univ, Dept MCS, Birmingham, AL 35229 USA

[2] Old Dominion Univ, Dept ECE, Norfolk, VA USA

[3] Old Dominion Univ, Dept CMSE, Norfolk, VA USA

来源：

2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON) | 2019年

关键词：

Variational Autoencoders; Generative Adversarial Networks; Deep Residual Networks; Deep Learning; CNN;

D O I：

10.1109/uemcon47517.2019.8993085

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Generative Models have been very accommodating when it comes to generating artificial data. Two of the most popular and promising models are the Generative Adversarial Network (GAN) and Variational Autoencoder (VAE) models. They both play critical roles in classification problems by generating synthetic data to train classifier more accurately. Malware detection is the process of determining whether or not software is malicious on the host's system and diagnosing what type of attack it is. Without adequate amount of training data, it makes malware detection less efficient. In this paper, we compare the two generative models to generate synthetic training data to boost the Residual Network (ResNet-18) classifier for malware detection. Experiment results show that adding synthetic malware samples generated by VAE to the training data improved the accuracy of ResNet-18 by 2% as it compared to 6% by GAN.

引用

页码：660 / 665

页数：6

共 50 条

[1] Easy Data Augmentation for Improved Malware Detection: A Comparative Study
Bae, Jangseong
Lee, Changki
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 214 - 218
[2] Using Generative Adversarial Networks for Data Augmentation in Android Malware Detection
Chen, Yi-Ming
Yang, Chun-Hsien
Chen, Guo-Chung
2021 IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (DSC), 2021,
[3] Data augmentation using generative models for track intrusion detection
Lee, Soohyung
Kim, Beomseong
Lee, Heesung
SCIENCE PROGRESS, 2023, 106 (04)
[4] Data Augmentation with Improved Generative Adversarial Networks
Shi, Hongjiang
Wang, Lu
Ding, Guangtai
Yang, Fenglei
Li, Xiaoqiang
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 73 - 78
[5] Data Augmentation for Opcode Sequence Based Malware Detection
McLaughlin, Niall
del Rincon, Jesus Martinez
2022 CYBER RESEARCH CONFERENCE - IRELAND (CYBER-RCI), 2022, : 28 - 35
[6] Marvolo: Programmatic Data Augmentation for Deep Malware Detection
Wong, Mike
Raff, Edward
Holt, James
Netravali, Ravi
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT I, 2023, 14169 : 270 - 285
[7] FabricGAN: an enhanced generative adversarial network for data augmentation and improved fabric defect detection
Xu, Yiqin
Zhi, Chao
Wang, Shuai
Chen, Jianglong
Sun, Runjun
Dong, Zijing
Yu, Lingjie
TEXTILE RESEARCH JOURNAL, 2024, 94 (15-16) : 1771 - 1785
[8] A Comparative Study of Engraved-Digit Data Augmentation by Generative Adversarial Networks
Abdulraheem, Abdulkabir
Jung, Im Y.
SUSTAINABILITY, 2022, 14 (19)
[9] Data augmentation with generative models improves detection of Non-B DNA structures
Cherednichenko, Oleksandr
Poptsova, Maria
Computers in Biology and Medicine, 2025, 184
[10] Generative Malware Outbreak Detection
Park, Sean
Gondal, Iqbal
Kamruzzaman, Joarder
Oliver, Jon
2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2019, : 1149 - 1154

← 1 2 3 4 5 →