Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study

被引:2
|
作者
Yu, Muzhou [1 ]
Zhang, Linfeng [2 ]
Ma, Kaisheng [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] Tsinghua Univ, Beijing, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
data augmentation; model compression;
D O I
10.1109/IJCNN54540.2023.10191736
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The excellent performance of deep neural networks is usually accompanied by a large number of parameters and computations, which have limited their usage on the resource-limited edge devices. To address this issue, abundant methods such as pruning, quantization and knowledge distillation have been proposed to compress neural networks and achieved significant breakthroughs. However, most of these compression methods focus on the architecture or the training method of neural networks but ignore the influence from data augmentation. In this paper, we revisit the usage of data augmentation in model compression and give a comprehensive study on the relation between model sizes and their optimal data augmentation policy. To sum up, we mainly have the following three observations: (A) Models in different sizes prefer data augmentation with different magnitudes. Hence, in iterative pruning, data augmentation with varying magnitudes leads to better performance than data augmentation with a consistent magnitude. (B) Data augmentation with a high magnitude may significantly improve the performance of large models but harm the performance of small models. Fortunately, small models can still benefit from strong data augmentations by firstly learning them with "additional parameters" and then discard these "additional parameters" during inference. (C) The prediction of a pre-trained large model can be utilized to measure the difficulty of data augmentation. Thus it can be utilized as a criterion to design better data augmentation policies. We hope this paper may promote more research on the usage of data augmentation in model compression.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Revisiting data augmentation for subspace clustering
    Abdolali, Maryam
    Gillis, Nicolas
    Knowledge-Based Systems, 2022, 258
  • [2] Revisiting data augmentation for subspace clustering
    Abdolali, Maryam
    Gillis, Nicolas
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [3] An Empirical Study on Comprehensive Model of Brand Communication
    Sha, Zhenquan
    Long, Chengzhi
    Wang, Xiaoyu
    EIGHTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III, 2009, : 629 - 634
  • [4] A comprehensive survey for generative data augmentation
    Chen, Yunhao
    Yan, Zihui
    Zhu, Yunjie
    NEUROCOMPUTING, 2024, 600
  • [5] An Empirical Study on Learning Models and Data Augmentation for IoT Anomaly Detection
    Khorasgani, Alireza Toghiani
    Shirani, Paria
    Majumdar, Suryadipta
    2024 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY, CNS 2024, 2024,
  • [6] Face Recognition Fairness Assessment based on Data Augmentation: An Empirical Study
    Tian, Fangyuan
    Liu, Wenhong
    Zhao, Shuang
    Liu, Jiawei
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 315 - 318
  • [7] Data Augmentation for Building Footprint Segmentation in SAR Images: An Empirical Study
    Wangiyana, Sandhi
    Samczynski, Piotr
    Gromek, Artur
    REMOTE SENSING, 2022, 14 (09)
  • [8] Investigating the effectiveness of data augmentation from similarity and diversity: An empirical study
    Yang, Suorong
    Guo, Suhan
    Zhao, Jian
    Shen, Furao
    PATTERN RECOGNITION, 2024, 148
  • [9] A Comprehensive Empirical Study of Data Privacy, Trust, and Consumer Autonomy
    Kesan, Jay P.
    Hayes, Carol M.
    Bashir, Masooda N.
    INDIANA LAW JOURNAL, 2016, 91 (02) : 267 - 352
  • [10] Empirical Evaluation of Variational Autoencoders for Data Augmentation
    Jorge, Javier
    Vieco, Jesus
    Paredes, Roberto
    Andreu Sanchez, Joan
    Miguel Benedi, Jose
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 96 - 104