Rate-Accuracy Optimization of Deep Convolutional Neural Network Models

被引:0
|
作者
Filini, Alessandro [1 ]
Ascenso, Joao [2 ]
Leonardi, Riccardo [1 ]
机构
[1] Univ Brescia, Dipartimento Ingn Informaz, Brescia, Italy
[2] Inst Super Tecn, Inst Telecomunicacoes, Lisbon, Portugal
关键词
D O I
10.1109/ISM.2017.121
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, deep learning has enjoyed a great deal of success for computer vision problems due to its capability to model highly complex tasks, such as image classification, object detection, face recognition, among many others. Although these neural networks are nowadays very powerful, there is a huge amount of parameters (i.e. the model) that need to be learned and require considerable storage space and bandwidth during transmission. This paper addresses the problems of storage and transmission of large deep learning models by proposing a compression solution that is independent of the model being trained as well as the data used for training. An efficient compression framework for the parameters of a neural network, more precisely the weights that interconnect. the different neurons, which consume a significant amount of resources (memory, storage and bandwidth) is proposed. Several quantization strategies are considered as well as a statistical models 14 the different layers of a neural network, which are exploited by an arithmetic coding engine. Experimental results show that up to 92% bitrate savings can he obtained with minimal impact in terms of image classification accuracy.
引用
收藏
页码:91 / 98
页数:8
相关论文
共 50 条
  • [1] RATE-ACCURACY TRADE-OFF IN VIDEO CLASSIFICATION WITH DEEP CONVOLUTIONAL NEURAL NETWORKS
    Abbas, Alhabib
    Jubran, Mohammad
    Chadha, Aaron
    Andreopoulos, Yiannis
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 793 - 797
  • [2] Rate-Accuracy Trade-Off in Video Classification With Deep Convolutional Neural Networks
    Jubran, Mohammad
    Abbas, Alhabib
    Chadha, Aaron
    Andreopoulos, Yiannis
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (01) : 145 - 154
  • [3] RATE-ACCURACY OPTIMIZATION OF BINARY DESCRIPTORS
    Redondi, Alessandro
    Baroffio, Luca
    Ascenso, Joao
    Cesana, Matteo
    Tagliasacchi, Marco
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2910 - 2914
  • [4] RATE-ACCURACY OPTIMIZATION IN VISUAL WIRELESS SENSOR NETWORKS
    Redondi, A.
    Cesana, M.
    Tagliasacchi, M.
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1105 - 1108
  • [5] Development of Deep Convolutional Neural Network for Structural Topology Optimization
    Seo, Junhyeon
    Kapania, Rakesh K.
    AIAA JOURNAL, 2023, 61 (03) : 1366 - 1379
  • [6] Development of Deep Convolutional Neural Network for Structural Topology Optimization
    Seo, Junhyeon
    Kapania, Rakesh K.
    AIAA Journal, 2023, 61 (03): : 1366 - 1379
  • [7] Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy
    He, Zhezhi
    Gong, Boqing
    Fan, Deliang
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 913 - 921
  • [8] Deep convolutional neural network models for the diagnosis of thyroid cancer
    Ha, Eun Ju
    Baek, Jung Hwan
    Na, Dong Gyu
    LANCET ONCOLOGY, 2019, 20 (03): : E130 - E130
  • [9] Deep Convolutional Neural Network
    Zhou, Yu
    Fang, Rui
    Liu, Peng
    Liu, Kai
    2019 PROCEEDINGS OF THE CONFERENCE ON CONTROL AND ITS APPLICATIONS, CT, 2019, : 46 - 51
  • [10] Optimization of deep convolutional neural network for large scale image retrieval
    Bai, Cong
    Huang, Ling
    Pan, Xiang
    Zheng, Jianwei
    Chen, Shengyong
    NEUROCOMPUTING, 2018, 303 : 60 - 67