Data Augmentation with Generative Models for Improved Malware Detection: A Comparative Study

被引:0
|
作者
Burks, Roland, III [1 ]
Islam, Kazi Aminul [2 ]
Li, Jiang [2 ]
Lu, Yan [3 ]
机构
[1] Samford Univ, Dept MCS, Birmingham, AL 35229 USA
[2] Old Dominion Univ, Dept ECE, Norfolk, VA USA
[3] Old Dominion Univ, Dept CMSE, Norfolk, VA USA
关键词
Variational Autoencoders; Generative Adversarial Networks; Deep Residual Networks; Deep Learning; CNN;
D O I
10.1109/uemcon47517.2019.8993085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generative Models have been very accommodating when it comes to generating artificial data. Two of the most popular and promising models are the Generative Adversarial Network (GAN) and Variational Autoencoder (VAE) models. They both play critical roles in classification problems by generating synthetic data to train classifier more accurately. Malware detection is the process of determining whether or not software is malicious on the host's system and diagnosing what type of attack it is. Without adequate amount of training data, it makes malware detection less efficient. In this paper, we compare the two generative models to generate synthetic training data to boost the Residual Network (ResNet-18) classifier for malware detection. Experiment results show that adding synthetic malware samples generated by VAE to the training data improved the accuracy of ResNet-18 by 2% as it compared to 6% by GAN.
引用
收藏
页码:660 / 665
页数:6
相关论文
共 50 条
  • [1] Easy Data Augmentation for Improved Malware Detection: A Comparative Study
    Bae, Jangseong
    Lee, Changki
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 214 - 218
  • [2] Using Generative Adversarial Networks for Data Augmentation in Android Malware Detection
    Chen, Yi-Ming
    Yang, Chun-Hsien
    Chen, Guo-Chung
    2021 IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (DSC), 2021,
  • [3] Data augmentation using generative models for track intrusion detection
    Lee, Soohyung
    Kim, Beomseong
    Lee, Heesung
    SCIENCE PROGRESS, 2023, 106 (04)
  • [4] Data Augmentation with Improved Generative Adversarial Networks
    Shi, Hongjiang
    Wang, Lu
    Ding, Guangtai
    Yang, Fenglei
    Li, Xiaoqiang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 73 - 78
  • [5] Data Augmentation for Opcode Sequence Based Malware Detection
    McLaughlin, Niall
    del Rincon, Jesus Martinez
    2022 CYBER RESEARCH CONFERENCE - IRELAND (CYBER-RCI), 2022, : 28 - 35
  • [6] Marvolo: Programmatic Data Augmentation for Deep Malware Detection
    Wong, Mike
    Raff, Edward
    Holt, James
    Netravali, Ravi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT I, 2023, 14169 : 270 - 285
  • [7] FabricGAN: an enhanced generative adversarial network for data augmentation and improved fabric defect detection
    Xu, Yiqin
    Zhi, Chao
    Wang, Shuai
    Chen, Jianglong
    Sun, Runjun
    Dong, Zijing
    Yu, Lingjie
    TEXTILE RESEARCH JOURNAL, 2024, 94 (15-16) : 1771 - 1785
  • [8] A Comparative Study of Engraved-Digit Data Augmentation by Generative Adversarial Networks
    Abdulraheem, Abdulkabir
    Jung, Im Y.
    SUSTAINABILITY, 2022, 14 (19)
  • [9] Data augmentation with generative models improves detection of Non-B DNA structures
    Cherednichenko, Oleksandr
    Poptsova, Maria
    Computers in Biology and Medicine, 2025, 184
  • [10] Generative Malware Outbreak Detection
    Park, Sean
    Gondal, Iqbal
    Kamruzzaman, Joarder
    Oliver, Jon
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2019, : 1149 - 1154