Deep Adversarial Data Augmentation for Extremely Low Data Regimes

被引:38
|
作者
Zhang, Xiaofeng [1 ,2 ]
Wang, Zhangyang [3 ]
Liu, Dong [2 ]
Lin, Qifeng [1 ]
Ling, Qing [1 ,4 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230027, Peoples R China
[3] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
[4] Sun Yat Sen Univ, Guangdong Prov Key Lab Computat Sci, Guangzhou 510006, Peoples R China
关键词
Classification; extremely low data regime; GAN; data augmentation; object detection;
D O I
10.1109/TCSVT.2020.2967419
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning has revolutionized the performance of classification and object detection, but meanwhile demands sufficient labeled data for training. Given insufficient data, while many techniques have been developed to help combat overfitting, the challenge remains if one tries to train deep networks, especially in the ill-posed extremely low data regimes: only a small set of labeled data are available, and nothing - including unlabeled data - else. Such regimes arise from practical situations where not only data labeling but also data collection itself is expensive. We propose a deep adversarial data augmentation (DADA) technique to address the problem, in which we elaborately formulate data augmentation as a problem of training a class-conditional and supervised generative adversarial network (GAN). Specifically, a new discriminator loss is proposed to fit the goal of data augmentation, through which both real and augmented samples are enforced to contribute to and be consistent in finding the decision boundaries. Tailored training techniques are developed accordingly. To quantitatively validate its effectiveness, we first perform extensive simulations to show that DADA substantially outperforms both traditional data augmentation and a few GAN-based options. We then extend experiments to three real-world small labeled classification datasets where existing data augmentation and/or transfer learning strategies are either less effective or infeasible. We also demonstrate that DADA to can be extended to the detection task. We improve the pedestrian synthesis work by substitute for our discriminator and training scheme. Validation experiment shows that DADA can improve the detection mean average precision (mAP) compared with some traditional data augmentation techniques in object detection. Source code is available at https://github.com/SchafferZhang/DADA.
引用
收藏
页码:15 / 28
页数:14
相关论文
共 50 条
  • [1] DADA: DEEP ADVERSARIAL DATA AUGMENTATION FOR EXTREMELY LOW DATA REGIME CLASSIFICATION
    Zhang, Xiaofeng
    Wang, Zhangyang
    Liu, Dong
    Ling, Qing
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2807 - 2811
  • [2] Data augmentation and pre-trained networks for extremely low data regimes unsupervised visual inspection
    Gutierrez, Pierre
    Cordier, Antoine
    Caldeira, Thais
    Sautory, Theophile
    [J]. AUTOMATED VISUAL INSPECTION AND MACHINE VISION IV, 2021, 11787
  • [3] Deep Adversarial Data Augmentation for Fabric Defect Classification With Scarce Defect Data
    Lu, Bingyu
    Zhang, Meng
    Huang, Biqing
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [4] Deep Adversarial Data Augmentation for Fabric Defect Classification With Scarce Defect Data
    Lu, Bingyu
    Zhang, Meng
    Huang, Biqing
    [J]. IEEE Transactions on Instrumentation and Measurement, 2022, 71
  • [5] Evasion Generative Adversarial Network for Low Data Regimes
    Randhawa R.H.
    Aslam N.
    Alauthman M.
    Rafiq H.
    [J]. IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1076 - 1088
  • [6] A deep data augmentation framework based on generative adversarial networks
    Qiping Wang
    Ling Luo
    Haoran Xie
    Yanghui Rao
    Raymond Y.K. Lau
    Detian Zhang
    [J]. Multimedia Tools and Applications, 2022, 81 : 42871 - 42887
  • [7] A deep data augmentation framework based on generative adversarial networks
    Wang, Qiping
    Luo, Ling
    Xie, Haoran
    Rao, Yanghui
    Lau, Raymond Y. K.
    Zhang, Detian
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42871 - 42887
  • [8] EID-GAN: Generative Adversarial Nets for Extremely Imbalanced Data Augmentation
    Li, Wei
    Chen, Jinlin
    Cao, Jiannong
    Ma, Chao
    Wang, Jia
    Cui, Xiaohui
    Chen, Ping
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 3208 - 3218
  • [9] Data augmentation and transfer learning strategies for reaction prediction in low chemical data regimes
    Zhang, Yun
    Wang, Ling
    Wang, Xinqiao
    Zhang, Chengyun
    Ge, Jiamin
    Tang, Jing
    Su, An
    Duan, Hongliang
    [J]. ORGANIC CHEMISTRY FRONTIERS, 2021, 8 (07) : 1415 - 1423
  • [10] Rethinking data augmentation for adversarial robustness
    Eghbal-zadeh, Hamid
    Zellinger, Werner
    Pintor, Maura
    Grosse, Kathrin
    Koutini, Khaled
    Moser, Bernhard A.
    Biggio, Battista
    Widmer, Gerhard
    [J]. INFORMATION SCIENCES, 2024, 654