Heuristic-based automatic pruning of deep neural networks

被引:8
|
作者
Choudhary, Tejalal [1 ]
Mishra, Vipul [1 ]
Goswami, Anurag [1 ]
Sarangapani, Jagannathan [2 ]
机构
[1] Bennett Univ, Greater Noida 201310, India
[2] Missouri Univ Sci & Technol, Rolla, MO 65409 USA
来源
NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 06期
关键词
Deep neural network; Efficient inference; Convolutional neural network; Model compression and acceleration; Filter pruning;
D O I
10.1007/s00521-021-06679-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of a deep neural network (deep NN) is dependent upon a significant number of weight parameters that need to be trained which is a computational bottleneck. The growing trend of deeper architectures poses a restriction on the training and inference scheme on resource-constrained devices. Pruning is an important method for removing the deep NN's unimportant parameters and making their deployment easier on resource-constrained devices for practical applications. In this paper, we proposed a heuristics-based novel filter pruning method to automatically identify and prune the unimportant filters and make the inference process faster on devices with limited resource availability. The selection of the unimportant filters is made by a novel pruning estimator (c). The proposed method is tested on various convolutional architectures AlexNet, VGG16, ResNet34, and datasets CIFAR10, CIFAR100, and ImageNet. The experimental results on a large-scale ImageNet dataset show that the FLOPs of the VGG16 can be reduced up to 77.47%, achieving approximate to 5x inference speedup. The FLOPs of a more popular ResNet34 model are reduced by 41.94% while retaining competitive performance compared to other state-of-the-art methods.
引用
收藏
页码:4889 / 4903
页数:15
相关论文
共 50 条
  • [31] Automatic Creation of Heuristic-Based Truck Movement Paths for Construction Equipment Control
    Kim, Sung-Keun
    Jang, Jung-Woo
    Na, Wongi S.
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [32] Generalized Gradient Flow Based Saliency for Pruning Deep Convolutional Neural Networks
    Xinyu Liu
    Baopu Li
    Zhen Chen
    Yixuan Yuan
    [J]. International Journal of Computer Vision, 2023, 131 : 3121 - 3135
  • [33] Activation-Based Weight Significance Criterion for Pruning Deep Neural Networks
    Dong, Jiayu
    Zheng, Huicheng
    Lian, Lina
    [J]. IMAGE AND GRAPHICS (ICIG 2017), PT II, 2017, 10667 : 62 - 73
  • [34] Generalized Gradient Flow Based Saliency for Pruning Deep Convolutional Neural Networks
    Liu, Xinyu
    Li, Baopu
    Chen, Zhen
    Yuan, Yixuan
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (12) : 3121 - 3135
  • [35] A FRAMEWORK FOR PRUNING DEEP NEURAL NETWORKS USING ENERGY-BASED MODELS
    Salehinejad, Hojjat
    Valaee, Shahrokh
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3920 - 3924
  • [36] An optimal-score-based filter pruning for deep convolutional neural networks
    Sawant, Shrutika S.
    Bauer, J.
    Erick, F. X.
    Ingaleshwar, Subodh
    Holzer, N.
    Ramming, A.
    Lang, E. W.
    Goetz, Th
    [J]. APPLIED INTELLIGENCE, 2022, 52 (15) : 17557 - 17579
  • [37] An optimal-score-based filter pruning for deep convolutional neural networks
    Shrutika S. Sawant
    J. Bauer
    F. X. Erick
    Subodh Ingaleshwar
    N. Holzer
    A. Ramming
    E. W. Lang
    Th. Götz
    [J]. Applied Intelligence, 2022, 52 : 17557 - 17579
  • [38] Task dependent deep LDA pruning of neural networks
    Tian, Qing
    Arbel, Tal
    Clark, James J.
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 203
  • [39] Anonymous Model Pruning for Compressing Deep Neural Networks
    Zhang, Lechun
    Chen, Guangyao
    Shi, Yemin
    Zhang, Quan
    Tan, Mingkui
    Wang, Yaowei
    Tian, Yonghong
    Huang, Tiejun
    [J]. THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 161 - 164
  • [40] A New Pruning Method to Train Deep Neural Networks
    Guo, Haonan
    Ren, Xudie
    Li, Shenghong
    [J]. COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2018, 423 : 767 - 775