An efficient pruning scheme of deep neural networks for Internet of Things applications

被引：6

作者：

Qi, Chen ^{[1
]}

Shen, Shibo ^{[1
]}

Li, Rongpeng ^{[1
]}

Zhao, Zhifeng ^{[2
]}

Liu, Qing ^{[3
]}

Liang, Jing ^{[3
]}

Zhang, Honggang ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China

[2] Zhejiang Lab, Hangzhou, Peoples R China

[3] Huawei Technol Co Ltd, Shanghai, Peoples R China

来源：

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2021年 / 2021卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Deep neural networks; Deep learning; Internet of Things; Resource-limited edge computing; Pruning; Efficiency; IOT;

D O I：

10.1186/s13634-021-00744-4

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Nowadays, deep neural networks (DNNs) have been rapidly deployed to realize a number of functionalities like sensing, imaging, classification, recognition, etc. However, the computational-intensive requirement of DNNs makes it difficult to be applicable for resource-limited Internet of Things (IoT) devices. In this paper, we propose a novel pruning-based paradigm that aims to reduce the computational cost of DNNs, by uncovering a more compact structure and learning the effective weights therein, on the basis of not compromising the expressive capability of DNNs. In particular, our algorithm can achieve efficient end-to-end training that transfers a redundant neural network to a compact one with a specifically targeted compression rate directly. We comprehensively evaluate our approach on various representative benchmark datasets and compared with typical advanced convolutional neural network (CNN) architectures. The experimental results verify the superior performance and robust effectiveness of our scheme. For example, when pruning VGG on CIFAR-10, our proposed scheme is able to significantly reduce its FLOPs (floating-point operations) and number of parameters with a proportion of 76.2% and 94.1%, respectively, while still maintaining a satisfactory accuracy. To sum up, our scheme could facilitate the integration of DNNs into the common machine-learning-based IoT framework and establish distributed training of neural networks in both cloud and edge.

引用

页数：21

共 50 条

[1] An efficient pruning scheme of deep neural networks for Internet of Things applications
Chen Qi
Shibo Shen
Rongpeng Li
Zhifeng Zhao
Qing Liu
Jing Liang
Honggang Zhang
[J]. EURASIP Journal on Advances in Signal Processing, 2021
[2] Compacting Deep Neural Networks for Internet of Things: Methods and Applications
Zhang, Ke
Ying, Hanbo
Dai, Hong-Ning
Li, Lin
Peng, Yuanyuan
Guo, Keyi
Yu, Hongfang
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (15): : 11935 - 11959
[3] Trained Rank Pruning for Efficient Deep Neural Networks
Xu, Yuhui
Li, Yuxi
Zhang, Shuai
Wen, Wei
Wang, Botao
Dai, Wenrui
Qi, Yingyong
Chen, Yiran
Lin, Weiyao
Xiong, Hongkai
[J]. FIFTH WORKSHOP ON ENERGY EFFICIENT MACHINE LEARNING AND COGNITIVE COMPUTING - NEURIPS EDITION (EMC2-NIPS 2019), 2019, : 14 - 17
[4] Holistic Filter Pruning for Efficient Deep Neural Networks
Enderich, Lukas
Timm, Fabian
Burgard, Wolfram
[J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2595 - 2604
[5] Energy Efficient Data Transmission Scheme for Internet of Things Applications
Banerjee, Puja
Barnwal, Rajesh P.
[J]. JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2022, 81 (06): : 633 - 642
[6] TRP: Trained Rank Pruning for Efficient Deep Neural Networks
Xu, Yuhui
Li, Yuxi
Zhang, Shuai
Wen, Wei
Wang, Botao
Qi, Yingyong
Chen, Yiran
Lin, Weiyao
Xiong, Hongkai
[J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 977 - 983
[7] Distributed Deep Convolutional Neural Networks for the Internet-of-Things
Disabato, Simone
Roveri, Manuel
Alippi, Cesare
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (08) : 1239 - 1252
[8] Internet of Things (IoT) Discovery Using Deep Neural Networks
Lo, Ephraim
Kohl, JoHannah
[J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 795 - 803
[9] Efficient and Verifiable Encryption Scheme in Lightweight Narrowband Internet of Things Applications
Qian, Hanjia
Wang, Yihuai
Peng, Tao
Chen, Cheng
Luo, Xizhao
[J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (05): : 1112 - 1122
[10] Efficient Distributed Inference of Deep Neural Networks via Restructuring and Pruning
Abdi, Afshin
Rashidi, Saeed
Fekri, Faramarz
Krishna, Tushar
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6640 - 6648

← 1 2 3 4 5 →