Bit-width Adaptive Accelerator Design for Convolution Neural Network

被引:1
|
作者
Guo, Jianxin [1 ]
Yin, Shouyi [1 ]
Ouyang, Peng [1 ]
Tu, Fengbin [1 ]
Tang, Shibin [1 ]
Liu, Leibo [1 ]
Wei, Shaojun [1 ]
机构
[1] Tsinghua Univ, Inst Microelect, Beijing 100084, Peoples R China
关键词
D O I
10.1109/ISCAS.2018.8351666
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural networks (CNNs) have achieved great success in many applications. Recently, various FPGA-based accelerators have been proposed to improve the performance of CNNs. However, current most FPGA-based methods only use the same bit-width selection for all CNN layers which lead to very low resource utilization and difficulty in further performance improvement. In this paper, we propose a bit-width adaptive accelerator design approach which can adapt to the CNN layers with various bit-width requirements in a same network. We construct multiple different bit-width convolutional processors to compute the CNN layers in parallel way. We partition the FPGA DSP resources and use our optimization approach to find the optimal resource allocation. On a Xilinx Virtex-7 FPGA, our design approach achieves higher throughput than the state-of-the-art FPGA-based CNN accelerators from 5.48x to 7.25x and by 6.20x on average, when we evaluate the convolutional layers of AlexNet and deeper VGG CNNs.
引用
收藏
页数:5
相关论文
共 50 条
  • [11] Residual Quantization for Low Bit-Width Neural Networks
    Li, Zefan
    Ni, Bingbing
    Yang, Xiaokang
    Zhang, Wenjun
    Gao, Wen
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 214 - 227
  • [12] Combinatorial optimization for low bit-width neural networks
    Zhou, Han
    Ashrafi, Aida
    Blaschko, Matthew B.
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2246 - 2252
  • [13] Smilodon: An Efficient Accelerator for Low Bit-Width CNNs with Task Partitioning
    Chen, Qinyu
    Fu, Yuxiang
    Cheng, Kaifeng
    Song, Wenqing
    Lu, Zhonghai
    Li, Li
    Zhang, Chuan
    [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [14] Design of Approximate FFT with Bit-width Selection Algorithms
    Liao, Qicong
    Liu, Weiqiang
    Qiao, Fei
    Wang, Chenghua
    Lombardi, Fabrizio
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [15] AdaQAT: Adaptive Bit-Width Quantization-Aware Training
    Gernigon, Cedric
    Filip, Silviu-Ioan
    Sentieys, Olivier
    Coggiola, Clement
    Bruno, Mickael
    [J]. 2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 442 - 446
  • [16] Arbitrary Bit-width Network: A Joint Layer-Wise Quantization and Adaptive Inference Approach
    Tang, Chen
    Zhai, Haoyu
    Ouyang, Kai
    Wang, Zhi
    Zhu, Yifei
    Zhu, Wenwu
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2899 - 2908
  • [17] Cell Division: Weight Bit-Width Reduction Technique for Convolutional Neural Network Hardware Accelerators
    Park, Hanmin
    Choi, Kiyoung
    [J]. 24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 286 - 291
  • [18] Effects of Data's Bit-width on Digital Logic Design
    Chang, Sheng
    Cai, Heng
    Wang, Hao
    He, Jin
    Huang, Qijun
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1670 - +
  • [19] Reducing energy dissipation of frame memory by adaptive bit-width compression
    Moshnyaga, VG
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (08) : 713 - 718
  • [20] Accelerating Low Bit-Width Convolutional Neural Networks With Embedded FPGA
    Jiao, Li
    Luo, Cheng
    Cao, Wei
    Zhou, Xuegong
    Wang, Lingli
    [J]. 2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,