Flexible Quantization for Efficient Convolutional Neural Networks

被引:1
|
作者
Zacchigna, Federico Giordano [1 ]
Lew, Sergio [2 ,3 ]
Lutenberg, Ariel [1 ,3 ]
机构
[1] Univ Buenos Aires, Fac Ingn FIUBA, Lab Sistemas Embebidos LSE, C1063ACV, Buenos Aires, Argentina
[2] Univ Buenos Aires, Fac Ingn FIUBA, Inst Ingn Biomed IIBM, C1063ACV, Buenos Aires, Argentina
[3] Consejo Nacl Invest Cient & Tecn CONICET, C1425FQB, Buenos Aires, Argentina
关键词
CNN; quantization; uniform; non-uniform; mixed-precision; FPGA; ASIC; edge devices; embedded systems; CNN;
D O I
10.3390/electronics13101923
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work focuses on the efficient quantization of convolutional neural networks (CNNs). Specifically, we introduce a method called non-uniform uniform quantization (NUUQ), a novel quantization methodology that combines the benefits of non-uniform quantization, such as high compression levels, with the advantages of uniform quantization, which enables an efficient implementation in fixed-point hardware. NUUQ is based on decoupling the quantization levels from the number of bits. This decoupling allows for a trade-off between the spatial and temporal complexity of the implementation, which can be leveraged to further reduce the spatial complexity of the CNN, without a significant performance loss. Additionally, we explore different quantization configurations and address typical use cases. The NUUQ algorithm demonstrates the capability to achieve compression levels equivalent to 2 bits without an accuracy loss and even levels equivalent to similar to 1.58 bits, but with a loss in performance of only similar to 0.6%.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] XNORAM: An Efficient Computing-in-Memory Architecture for Binary Convolutional Neural Networks with Flexible Dataflow Mapping
    Liu, Shiwei
    Zhu, Haozhe
    Chen, Chixiao
    Zhang, Lihua
    Shi, C-J Richard
    [J]. 2020 2ND IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2020), 2020, : 21 - 25
  • [32] An Energy-Efficient and Flexible Accelerator based on Reconfigurable Computing for Multiple Deep Convolutional Neural Networks
    Yang, Chen
    Zhang, HaiBo
    Wang, XiaoLi
    Geng, Li
    [J]. 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1389 - 1391
  • [33] Efficient Design of Pruned Convolutional Neural Networks on FPGA
    Vestias, Mario
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (05): : 531 - 544
  • [34] Efficient Weighted Kernel Sharing Convolutional Neural Networks
    Zhou, Helong
    Chen, Yie-Tarng
    Zhang, Jie
    Fang, Wen-Hsien
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [35] A Survey on Efficient Convolutional Neural Networks and Hardware Acceleration
    Ghimire, Deepak
    Kil, Dayoung
    Kim, Seong-heum
    [J]. ELECTRONICS, 2022, 11 (06)
  • [36] FlexFlow: A Flexible Dataflow Accelerator Architecture for Convolutional Neural Networks
    Lu, Wenyan
    Yan, Guihai
    Li, Jiajun
    Gong, Shijun
    Han, Yinhe
    Li, Xiaowei
    [J]. 2017 23RD IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2017, : 553 - 564
  • [37] Binarized Convolutional Neural Networks for Efficient Inference on GPUs
    Khan, Mir
    Huttunen, Heikki
    Boutellier, Jani
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 682 - 686
  • [38] Efficient Design of Pruned Convolutional Neural Networks on FPGA
    Mário Véstias
    [J]. Journal of Signal Processing Systems, 2021, 93 : 531 - 544
  • [39] CONVOLUTIONAL NEURAL NETWORKS APPLIED TO FLEXIBLE PIPES FOR FATIGUE CALCULATIONS
    Machado da Silva, Vinicius Ribeiro
    de Araujo, Breno Serrano
    [J]. PROCEEDINGS OF THE ASME 39TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, OMAE2020, VOL 4, 2020,
  • [40] FSConv: Flexible and separable convolution for convolutional neural networks compression
    Zhu, Yangyang
    Xie, Luofeng
    Xie, Zhengfeng
    Yin, Ming
    Yin, Guofu
    [J]. PATTERN RECOGNITION, 2023, 140