An efficient pruning scheme of deep neural networks for Internet of Things applications

被引:6
|
作者
Qi, Chen [1 ]
Shen, Shibo [1 ]
Li, Rongpeng [1 ]
Zhao, Zhifeng [2 ]
Liu, Qing [3 ]
Liang, Jing [3 ]
Zhang, Honggang [1 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China
[2] Zhejiang Lab, Hangzhou, Peoples R China
[3] Huawei Technol Co Ltd, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural networks; Deep learning; Internet of Things; Resource-limited edge computing; Pruning; Efficiency; IOT;
D O I
10.1186/s13634-021-00744-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays, deep neural networks (DNNs) have been rapidly deployed to realize a number of functionalities like sensing, imaging, classification, recognition, etc. However, the computational-intensive requirement of DNNs makes it difficult to be applicable for resource-limited Internet of Things (IoT) devices. In this paper, we propose a novel pruning-based paradigm that aims to reduce the computational cost of DNNs, by uncovering a more compact structure and learning the effective weights therein, on the basis of not compromising the expressive capability of DNNs. In particular, our algorithm can achieve efficient end-to-end training that transfers a redundant neural network to a compact one with a specifically targeted compression rate directly. We comprehensively evaluate our approach on various representative benchmark datasets and compared with typical advanced convolutional neural network (CNN) architectures. The experimental results verify the superior performance and robust effectiveness of our scheme. For example, when pruning VGG on CIFAR-10, our proposed scheme is able to significantly reduce its FLOPs (floating-point operations) and number of parameters with a proportion of 76.2% and 94.1%, respectively, while still maintaining a satisfactory accuracy. To sum up, our scheme could facilitate the integration of DNNs into the common machine-learning-based IoT framework and establish distributed training of neural networks in both cloud and edge.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Pruning deep convolutional neural networks for efficient edge computing in condition assessment of infrastructures
    Wu, Rih-Teng
    Singla, Ankush
    Jahanshahi, Mohammad R.
    Bertino, Elisa
    Ko, Bong Jun
    Verma, Dinesh
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2019, 34 (09) : 774 - 789
  • [42] A Novel Clustering-Based Filter Pruning Method for Efficient Deep Neural Networks
    Wei, Xiaohui
    Shen, Xiaoxian
    Zhou, Changbao
    Yue, Hengshan
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II, 2020, 12453 : 245 - 258
  • [43] Automated Pruning of Neural Networks for Mobile Applications
    Glinserer, Andreas
    Lechner, Martin
    Wendt, Alexander
    [J]. 2021 IEEE 19TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2021,
  • [44] DEEP LEARNING BASED METHOD FOR PRUNING DEEP NEURAL NETWORKS
    Li, Lianqiang
    Zhu, Jie
    Sun, Ming-Ting
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 312 - 317
  • [45] A secure and efficient certificateless signature scheme for Internet of Things
    Xiang, Dengmei
    Li, Xuelian
    Gao, Juntao
    Zhang, Xiachuan
    [J]. AD HOC NETWORKS, 2022, 124
  • [46] DeepPet: A Pet Animal Tracking System in Internet of Things using Deep Neural Networks
    Hammam, Ahmed Ali
    Soliman, Mona M.
    Hassanein, Aboul Ella
    [J]. PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 38 - 43
  • [47] Multi-fidelity deep neural networks for adaptive inference in the internet of multimedia things
    Leroux, Sam
    Bohez, Steven
    De Coninck, Elias
    Van Molle, Pieter
    Vankeirsbilck, Bert
    Verbelen, Tim
    Simoens, Pieter
    Dhoedt, Bart
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 355 - 360
  • [48] Brain MRI analysis using deep neural network for medical of internet things applications
    Masood, Momina
    Maham, Rabbia
    Javed, Ali
    Tariq, Usman
    Khan, Muhammad Attique
    Kadry, Seifedine
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103
  • [49] A Distributed Efficient Blockchain Oracle Scheme for Internet of Things
    Xian, Youquan
    Zhou, Lianghaojie
    Jiang, Jianyong
    Wang, Boyi
    Huo, Hao
    Liu, Peng
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2024, E107B (09) : 573 - 582
  • [50] EDAS: Efficient Data Aggregation Scheme for Internet of Things
    Ogundoyin, Sunday Oyinlola
    Awoyemi, Sunday Oladele
    [J]. JOURNAL OF APPLIED SECURITY RESEARCH, 2018, 13 (03) : 347 - 375