Heuristic-based automatic pruning of deep neural networks

被引:0
|
作者
Tejalal Choudhary
Vipul Mishra
Anurag Goswami
Jagannathan Sarangapani
机构
[1] Bennett University,
[2] Missouri University of Science and Technology,undefined
来源
关键词
Deep neural network; Efficient inference; Convolutional neural network; Model compression and acceleration; Filter pruning;
D O I
暂无
中图分类号
学科分类号
摘要
The performance of a deep neural network (deep NN) is dependent upon a significant number of weight parameters that need to be trained which is a computational bottleneck. The growing trend of deeper architectures poses a restriction on the training and inference scheme on resource-constrained devices. Pruning is an important method for removing the deep NN’s unimportant parameters and making their deployment easier on resource-constrained devices for practical applications. In this paper, we proposed a heuristics-based novel filter pruning method to automatically identify and prune the unimportant filters and make the inference process faster on devices with limited resource availability. The selection of the unimportant filters is made by a novel pruning estimator (γ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\gamma$$\end{document}). The proposed method is tested on various convolutional architectures AlexNet, VGG16, ResNet34, and datasets CIFAR10, CIFAR100, and ImageNet. The experimental results on a large-scale ImageNet dataset show that the FLOPs of the VGG16 can be reduced up to 77.47%, achieving ≈5x\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\approx ~5x$$\end{document} inference speedup. The FLOPs of a more popular ResNet34 model are reduced by 41.94% while retaining competitive performance compared to other state-of-the-art methods.
引用
收藏
页码:4889 / 4903
页数:14
相关论文
共 50 条
  • [21] EDropout: Energy-Based Dropout and Pruning of Deep Neural Networks
    Salehinejad, Hojjat
    Valaee, Shahrokh
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5279 - 5292
  • [22] Neuroplasticity-Based Pruning Method for Deep Convolutional Neural Networks
    Camacho, Jose David
    Villasenor, Carlos
    Lopez-Franco, Carlos
    Arana-Daniel, Nancy
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [23] SFP: Similarity-based filter pruning for deep neural networks
    Li, Guoqing
    Li, Rengang
    Li, Tuo
    Shen, Chaoyao
    Zou, Xiaofeng
    Wang, Jiuyang
    Wang, Changhong
    Li, Nanjun
    [J]. INFORMATION SCIENCES, 2025, 689
  • [24] Multi-label classifier for protein sequence using heuristic-based deep convolution neural network
    Vikas Chauhan
    Aruna Tiwari
    Niranjan Joshi
    Sahaj Khandelwal
    [J]. Applied Intelligence, 2022, 52 : 2820 - 2837
  • [26] Automatic crop disease recognition by improved abnormality segmentation along with heuristic-based concatenated deep learning model
    Farooqui, Nafees Akhter
    Mishra, Amit Kumar
    Mehra, Ritika
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2022, 16 (02): : 407 - 429
  • [27] Analysis of Heuristic-based MAC protocols for ad hoc Networks
    Oliveira, Rodolfo
    Bernardo, Luis
    Luis, Miguel
    [J]. 2011 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2011, : 191 - 196
  • [28] EACP: An effective automatic channel pruning for neural networks
    Liu, Yajun
    Wu, Dakui
    Zhou, Wenju
    Fan, Kefeng
    Zhou, Zhiheng
    [J]. NEUROCOMPUTING, 2023, 526 : 131 - 142
  • [29] A Heuristic-Based Wormhole Routing Algorithm for Hypercube Multicomputer Networks
    Mostafa I. Abd-El-Barr
    Mohammad M. Nadeem
    Khalid Al-Tawil
    [J]. Cluster Computing, 2001, 4 (3) : 253 - 262
  • [30] Generalized Gradient Flow Based Saliency for Pruning Deep Convolutional Neural Networks
    Xinyu Liu
    Baopu Li
    Zhen Chen
    Yixuan Yuan
    [J]. International Journal of Computer Vision, 2023, 131 : 3121 - 3135