Deep Adversarial Data Augmentation for Extremely Low Data Regimes

被引：38

作者：

Zhang, Xiaofeng ^{[1
,2
]}

Wang, Zhangyang ^{[3
]}

Liu, Dong ^{[2
]}

Lin, Qifeng ^{[1
]}

Ling, Qing ^{[1
,4
]}

机构：

[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China

[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230027, Peoples R China

[3] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA

[4] Sun Yat Sen Univ, Guangdong Prov Key Lab Computat Sci, Guangzhou 510006, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2021年 / 31卷 / 01期

关键词：

Classification; extremely low data regime; GAN; data augmentation; object detection;

D O I：

10.1109/TCSVT.2020.2967419

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep learning has revolutionized the performance of classification and object detection, but meanwhile demands sufficient labeled data for training. Given insufficient data, while many techniques have been developed to help combat overfitting, the challenge remains if one tries to train deep networks, especially in the ill-posed extremely low data regimes: only a small set of labeled data are available, and nothing - including unlabeled data - else. Such regimes arise from practical situations where not only data labeling but also data collection itself is expensive. We propose a deep adversarial data augmentation (DADA) technique to address the problem, in which we elaborately formulate data augmentation as a problem of training a class-conditional and supervised generative adversarial network (GAN). Specifically, a new discriminator loss is proposed to fit the goal of data augmentation, through which both real and augmented samples are enforced to contribute to and be consistent in finding the decision boundaries. Tailored training techniques are developed accordingly. To quantitatively validate its effectiveness, we first perform extensive simulations to show that DADA substantially outperforms both traditional data augmentation and a few GAN-based options. We then extend experiments to three real-world small labeled classification datasets where existing data augmentation and/or transfer learning strategies are either less effective or infeasible. We also demonstrate that DADA to can be extended to the detection task. We improve the pedestrian synthesis work by substitute for our discriminator and training scheme. Validation experiment shows that DADA can improve the detection mean average precision (mAP) compared with some traditional data augmentation techniques in object detection. Source code is available at https://github.com/SchafferZhang/DADA.

引用

页码：15 / 28

页数：14

共 50 条

[1] DADA: DEEP ADVERSARIAL DATA AUGMENTATION FOR EXTREMELY LOW DATA REGIME CLASSIFICATION
Zhang, Xiaofeng
Wang, Zhangyang
Liu, Dong
Ling, Qing
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2807 - 2811
[2] Data augmentation and pre-trained networks for extremely low data regimes unsupervised visual inspection
Gutierrez, Pierre
Cordier, Antoine
Caldeira, Thais
Sautory, Theophile
[J]. AUTOMATED VISUAL INSPECTION AND MACHINE VISION IV, 2021, 11787
[3] Deep Adversarial Data Augmentation for Fabric Defect Classification With Scarce Defect Data
Lu, Bingyu
Zhang, Meng
Huang, Biqing
[J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[4] Deep Adversarial Data Augmentation for Fabric Defect Classification With Scarce Defect Data
Lu, Bingyu
Zhang, Meng
Huang, Biqing
[J]. IEEE Transactions on Instrumentation and Measurement, 2022, 71
[5] Evasion Generative Adversarial Network for Low Data Regimes
Randhawa R.H.
Aslam N.
Alauthman M.
Rafiq H.
[J]. IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1076 - 1088
[6] A deep data augmentation framework based on generative adversarial networks
Qiping Wang
Ling Luo
Haoran Xie
Yanghui Rao
Raymond Y.K. Lau
Detian Zhang
[J]. Multimedia Tools and Applications, 2022, 81 : 42871 - 42887
[7] A deep data augmentation framework based on generative adversarial networks
Wang, Qiping
Luo, Ling
Xie, Haoran
Rao, Yanghui
Lau, Raymond Y. K.
Zhang, Detian
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42871 - 42887
[8] EID-GAN: Generative Adversarial Nets for Extremely Imbalanced Data Augmentation
Li, Wei
Chen, Jinlin
Cao, Jiannong
Ma, Chao
Wang, Jia
Cui, Xiaohui
Chen, Ping
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 3208 - 3218
[9] Data augmentation and transfer learning strategies for reaction prediction in low chemical data regimes
Zhang, Yun
Wang, Ling
Wang, Xinqiao
Zhang, Chengyun
Ge, Jiamin
Tang, Jing
Su, An
Duan, Hongliang
[J]. ORGANIC CHEMISTRY FRONTIERS, 2021, 8 (07) : 1415 - 1423
[10] Rethinking data augmentation for adversarial robustness
Eghbal-zadeh, Hamid
Zellinger, Werner
Pintor, Maura
Grosse, Kathrin
Koutini, Khaled
Moser, Bernhard A.
Biggio, Battista
Widmer, Gerhard
[J]. INFORMATION SCIENCES, 2024, 654

← 1 2 3 4 5 →