Compressed Superposition of Neural Networks for Deep Learning in Edge Computing

被引:6
|
作者
Zeman, Marko [1 ]
Osipov, Evgeny [2 ]
Bosnic, Zoran [1 ]
机构
[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana, Slovenia
[2] Lulea Univ Technol, Dept Comp Sci Elect & Space Engn, Lulea, Sweden
关键词
D O I
10.1109/IJCNN52387.2021.9533602
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates a combination of the two recently proposed techniques: superposition of multiple neural networks into one and neural network compression. We show that these two techniques can be successfully combined to deliver a great potential for trimming down deep convolutional neural networks. The work can be relevant in the context of implementing deep learning on low-end computing devices as it enables neural networks to fit edge devices with constrained computational resources (e.g. sensors, mobile devices, controllers). We study the trade-offs between the model compression rate and the accuracy of the superimposed tasks and present a CNN pipeline where the fully connected layers are isolated from the convolutional layers and serve as a general purpose neural processing unit for several CNN models. We show how deep models can be highly compressed with a limited accuracy degradation when additional compression is performed within the superposition principle.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Random sketch learning for deep neural networks in edge computing
    Li, Bin
    Chen, Peijun
    Liu, Hongfu
    Guo, Weisi
    Cao, Xianbin
    Du, Junzhao
    Zhao, Chenglin
    Zhang, Jun
    NATURE COMPUTATIONAL SCIENCE, 2021, 1 (03): : 221 - 228
  • [2] Random sketch learning for deep neural networks in edge computing
    Bin Li
    Peijun Chen
    Hongfu Liu
    Weisi Guo
    Xianbin Cao
    Junzhao Du
    Chenglin Zhao
    Jun Zhang
    Nature Computational Science, 2021, 1 : 221 - 228
  • [3] Efficient Deep Neural Networks for Edge Computing
    Alnemari, Mohammed
    Bagherzadeh, Nader
    2019 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING (IEEE EDGE), 2019, : 1 - 7
  • [4] BottleFit: Learning Compressed Representations in Deep Neural Networks for Effective and Efficient Split Computing
    Matsubara, Yoshitomo
    Callegaro, Davide
    Singh, Sameer
    Levorato, Marco
    Restuccia, Francesco
    2022 IEEE 23RD INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM 2022), 2022, : 337 - 346
  • [5] The Case for Adaptive Deep Neural Networks in Edge Computing
    McNamee, Francis
    Dustdar, Schahram
    Kilpatrick, Peter
    Shi, Weisong
    Spence, Ivor
    Varghese, Blesson
    2021 IEEE 14TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2021), 2021, : 43 - 52
  • [6] Quantization of Deep Neural Networks for Accurate Edge Computing
    Chen, Wentao
    Qiu, Hailong
    Zhuang, Jian
    Zhang, Chutong
    Hu, Yu
    Lu, Qing
    Wang, Tianchen
    Shi, Yiyu
    Huang, Meiping
    Xu, Xiaowe
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2021, 17 (04)
  • [7] Multi-Task Federated Learning for Personalised Deep Neural Networks in Edge Computing
    Mills, Jed
    Hu, Jia
    Min, Geyong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (03) : 630 - 641
  • [8] Targeted and Automatic Deep Neural Networks Optimization for Edge Computing
    Giovannesi, Luca
    Mattia, Gabriele Proietti
    Beraldi, Roberto
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 5, AINA 2024, 2024, 203 : 57 - 68
  • [9] Foothill: A Quasiconvex Regularization for Edge Computing of Deep Neural Networks
    Belbahri, Mouloud
    Sari, Eyyub
    Darabi, Sajad
    Nia, Vahid Partovi
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2019), PT II, 2019, 11663 : 3 - 14
  • [10] OptDNN: Automatic deep neural networks optimizer for edge computing
    Giovannesi, Luca
    Mattia, Gabriele Proietti
    Beraldi, Roberto
    SOFTWARE IMPACTS, 2024, 20