An efficient pruning scheme of deep neural networks for Internet of Things applications

被引:6
|
作者
Qi, Chen [1 ]
Shen, Shibo [1 ]
Li, Rongpeng [1 ]
Zhao, Zhifeng [2 ]
Liu, Qing [3 ]
Liang, Jing [3 ]
Zhang, Honggang [1 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China
[2] Zhejiang Lab, Hangzhou, Peoples R China
[3] Huawei Technol Co Ltd, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural networks; Deep learning; Internet of Things; Resource-limited edge computing; Pruning; Efficiency; IOT;
D O I
10.1186/s13634-021-00744-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays, deep neural networks (DNNs) have been rapidly deployed to realize a number of functionalities like sensing, imaging, classification, recognition, etc. However, the computational-intensive requirement of DNNs makes it difficult to be applicable for resource-limited Internet of Things (IoT) devices. In this paper, we propose a novel pruning-based paradigm that aims to reduce the computational cost of DNNs, by uncovering a more compact structure and learning the effective weights therein, on the basis of not compromising the expressive capability of DNNs. In particular, our algorithm can achieve efficient end-to-end training that transfers a redundant neural network to a compact one with a specifically targeted compression rate directly. We comprehensively evaluate our approach on various representative benchmark datasets and compared with typical advanced convolutional neural network (CNN) architectures. The experimental results verify the superior performance and robust effectiveness of our scheme. For example, when pruning VGG on CIFAR-10, our proposed scheme is able to significantly reduce its FLOPs (floating-point operations) and number of parameters with a proportion of 76.2% and 94.1%, respectively, while still maintaining a satisfactory accuracy. To sum up, our scheme could facilitate the integration of DNNs into the common machine-learning-based IoT framework and establish distributed training of neural networks in both cloud and edge.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] An efficient pruning scheme of deep neural networks for Internet of Things applications
    Chen Qi
    Shibo Shen
    Rongpeng Li
    Zhifeng Zhao
    Qing Liu
    Jing Liang
    Honggang Zhang
    [J]. EURASIP Journal on Advances in Signal Processing, 2021
  • [2] Compacting Deep Neural Networks for Internet of Things: Methods and Applications
    Zhang, Ke
    Ying, Hanbo
    Dai, Hong-Ning
    Li, Lin
    Peng, Yuanyuan
    Guo, Keyi
    Yu, Hongfang
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (15): : 11935 - 11959
  • [3] Trained Rank Pruning for Efficient Deep Neural Networks
    Xu, Yuhui
    Li, Yuxi
    Zhang, Shuai
    Wen, Wei
    Wang, Botao
    Dai, Wenrui
    Qi, Yingyong
    Chen, Yiran
    Lin, Weiyao
    Xiong, Hongkai
    [J]. FIFTH WORKSHOP ON ENERGY EFFICIENT MACHINE LEARNING AND COGNITIVE COMPUTING - NEURIPS EDITION (EMC2-NIPS 2019), 2019, : 14 - 17
  • [4] Holistic Filter Pruning for Efficient Deep Neural Networks
    Enderich, Lukas
    Timm, Fabian
    Burgard, Wolfram
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2595 - 2604
  • [5] Energy Efficient Data Transmission Scheme for Internet of Things Applications
    Banerjee, Puja
    Barnwal, Rajesh P.
    [J]. JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2022, 81 (06): : 633 - 642
  • [6] TRP: Trained Rank Pruning for Efficient Deep Neural Networks
    Xu, Yuhui
    Li, Yuxi
    Zhang, Shuai
    Wen, Wei
    Wang, Botao
    Qi, Yingyong
    Chen, Yiran
    Lin, Weiyao
    Xiong, Hongkai
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 977 - 983
  • [7] Distributed Deep Convolutional Neural Networks for the Internet-of-Things
    Disabato, Simone
    Roveri, Manuel
    Alippi, Cesare
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (08) : 1239 - 1252
  • [8] Internet of Things (IoT) Discovery Using Deep Neural Networks
    Lo, Ephraim
    Kohl, JoHannah
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 795 - 803
  • [9] Efficient and Verifiable Encryption Scheme in Lightweight Narrowband Internet of Things Applications
    Qian, Hanjia
    Wang, Yihuai
    Peng, Tao
    Chen, Cheng
    Luo, Xizhao
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (05): : 1112 - 1122
  • [10] Efficient Distributed Inference of Deep Neural Networks via Restructuring and Pruning
    Abdi, Afshin
    Rashidi, Saeed
    Fekri, Faramarz
    Krishna, Tushar
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6640 - 6648