Automatic Channel Pruning with Hyper-parameter Search and Dynamic Masking

被引:3
|
作者
Li, Baopu [1 ]
Fan, Yanwen [2 ]
Pan, Zhihong [1 ]
Bian, Yuchen [3 ]
Zhang, Gang [2 ]
机构
[1] Baidu USA LLC, Sunnyvale, CA 94089 USA
[2] Baidu Inc, VIS Dept, Beijing, Peoples R China
[3] Baidu Res, Beijing, Peoples R China
关键词
Model compression; Network pruning; Auto ML;
D O I
10.1145/3474085.3475370
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern deep neural network models tend to be large and computationally intensive. One typical solution to this issue is model pruning. However, most current model pruning algorithms depend on hand crafted rules or need to input the pruning ratio beforehand. To overcome this problem, we propose a learning based automatic channel pruning algorithm for deep neural network, which is inspired by recent automatic machine learning (Auto ML). A two objectives' pruning problem that aims for the weights and the remaining channels for each layer is first formulated. An alternative optimization approach is then proposed to derive the channel numbers and weights simultaneously. In the process of pruning, we utilize a searchable hyper-parameter, remaining ratio, to denote the number of channels in each convolution layer, and then a dynamic masking process is proposed to describe the corresponding channel evolution. To adjust the trade-off between accuracy of a model and the pruning ratio of floating point operations, a new loss function is further introduced. Extensive experimental results on benchmark datasets demonstrate that our scheme achieves competitive results for neural network pruning.
引用
收藏
页码:2121 / 2129
页数:9
相关论文
共 50 条
  • [21] Hyper-parameter Optimization for Latent Spaces
    Veloso, Bruno
    Caroprese, Luciano
    Konig, Matthias
    Teixeira, Sonia
    Manco, Giuseppe
    Hoos, Holger H.
    Gama, Joao
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 249 - 264
  • [22] Federated learning with hyper-parameter optimization
    Kundroo, Majid
    Kim, Taehong
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (09)
  • [23] Ensemble Adaptation Networks with low-cost unsupervised hyper-parameter search
    Haotian Zhang
    Shifei Ding
    Weikuan Jia
    Pattern Analysis and Applications, 2020, 23 : 1215 - 1224
  • [24] Revisiting Hyper-Parameter Tuning for Search-Based Test Data Generation
    Zamani, Shayan
    Hemmati, Hadi
    SEARCH-BASED SOFTWARE ENGINEERING, SSBSE 2019, 2019, 11664 : 137 - 152
  • [25] Research about pruning hyper-parameter optimization method based on transfer learning in geographic information system
    Zhang X.
    Li Y.
    Li Z.
    Arabian Journal of Geosciences, 2021, 14 (5)
  • [26] Multi-Fidelity Automatic Hyper-Parameter Tuning via Transfer Series Expansion
    Hu, Yi-Qi
    Yu, Yang
    Tu, Wei-Wei
    Yang, Qiang
    Chen, Yuqiang
    Dai, Wenyuan
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3846 - 3853
  • [27] ULTRASOUND IMAGING LV TRACKING WITH ADAPTIVE WINDOW SIZE AND AUTOMATIC HYPER-PARAMETER ESTIMATION
    Nascimento, Jacinto
    Sanches, Joao
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 553 - 556
  • [28] A pragmatic approach for hyper-parameter tuning in search-based test case generation
    Zamani, Shayan
    Hemmati, Hadi
    EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (06)
  • [29] AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models
    Yin, Yichun
    Chen, Cheng
    Shang, Lifeng
    Jiang, Xin
    Chen, Xiao
    Liu, Qun
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 5146 - 5157
  • [30] A pragmatic approach for hyper-parameter tuning in search-based test case generation
    Shayan Zamani
    Hadi Hemmati
    Empirical Software Engineering, 2021, 26