Deep Neural Network Quantization via Layer-Wise Optimization Using Limited Training Data

被引:0
|
作者
Chen, Shangyu [1 ]
Wang, Wenya [1 ]
Pan, Sinno Jialin [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advancement of deep models poses great challenges to real-world deployment because of the limited computational ability and storage space on edge devices. To solve this problem, existing works have made progress to prune or quantize deep models. However, most existing methods rely heavily on a supervised training process to achieve satisfactory performance, acquiring large amount of labeled training data, which may not be practical for real deployment. In this paper, we propose a novel layer-wise quantization method for deep neural networks, which only requires limited training data (1% of original dataset). Specifically, we formulate parameters quantization for each layer as a discrete optimization problem, and solve it using Alternative Direction Method of Multipliers (ADMM), which gives an efficient closed-form solution. We prove that the final performance drop after quantization is bounded by a linear combination of the reconstructed errors caused at each layer. Based on the proved theorem, we propose an algorithm to quantize a deep neural network layer by layer with an additional weights update step to minimize the final error. Extensive experiments on benchmark deep models are conducted to demonstrate the effectiveness of our proposed method using 1% of CIFAR10 and ImageNet datasets. Codes are available in: https://github.com/csyhhu/L-DNQ
引用
收藏
页码:3329 / 3336
页数:8
相关论文
共 50 条
  • [41] Interpreting Convolutional Neural Networks via Layer-Wise Relevance Propagation
    Jia, Wohuan
    Zhang, Shaoshuai
    Jiang, Yue
    Xu, Li
    [J]. ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 457 - 467
  • [42] Sequence-based Protein-Protein Interaction Prediction using Greedy Layer-Wise Training of Deep Neural Networks
    Hanggara, Faruq Sandi
    Anam, Khairul
    [J]. CLIMATE CHANGE AND SUSTAINABILITY ENGINEERING IN ASEAN 2019, 2020, 2278
  • [43] Layer-Wise Network Compression Using Gaussian Mixture Model
    Lee, Eunho
    Hwang, Youngbae
    [J]. ELECTRONICS, 2021, 10 (01) : 1 - 16
  • [44] Supervised Greedy Layer-Wise Training for Deep Convolutional Networks with Small Datasets
    Rueda-Plata, Diego
    Ramos-Pollan, Raul
    Gonzalez, Fabio A.
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT I, 2015, 9329 : 275 - 284
  • [45] Optimizing the Deep Neural Networks by Layer-Wise Refined Pruning and the Acceleration on FPGA
    Li, Hengyi
    Yue, Xuebin
    Wang, Zhichen
    Chai, Zhilei
    Wang, Wenwen
    Tomiyama, Hiroyuki
    Meng, Lin
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [46] Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures
    Mern, John
    Gupta, Jayesh K.
    Kochenderfer, Mykel J.
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3314 - 3321
  • [47] Towards Layer-Wise Optimization of Contextual Neural Networks with Constant Field of Aggregation
    Mikusova, Miroslava
    Fuchs, Antonin
    Karasinski, Adrian
    Baruah, Rashmi Dutta
    Palak, Rafal
    Burnell, Erik Dawid
    Wolk, Krzysztof
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 743 - 753
  • [48] Sensitivity-Oriented Layer-Wise Acceleration and Compression for Convolutional Neural Network
    Zhou, Wei
    Niu, Yue
    Zhang, Guanwen
    [J]. IEEE ACCESS, 2019, 7 : 38264 - 38272
  • [49] Using Layer-Wise Training for Road Semantic Segmentation in Autonomous Cars
    Shashaani, Shahrzad
    Teshnehlab, Mohammad
    Khodadadian, Amirreza
    Parvizi, Maryam
    Wick, Thomas
    Noii, Nima
    [J]. IEEE ACCESS, 2023, 11 : 46320 - 46329
  • [50] Guided Layer-Wise Learning for Deep Models Using Side Information
    Sulimov, Pavel
    Sukmanova, Elena
    Chereshnev, Roman
    Kertesz-Farkas, Attila
    [J]. ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS (AIST 2019), 2020, 1086 : 50 - 61