Smilodon: An Efficient Accelerator for Low Bit-Width CNNs with Task Partitioning

被引:1
|
作者
Chen, Qinyu [1 ]
Fu, Yuxiang [1 ]
Cheng, Kaifeng [1 ]
Song, Wenqing [1 ]
Lu, Zhonghai [2 ]
Li, Li [1 ]
Zhang, Chuan [3 ,4 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing, Jiangsu, Peoples R China
[2] KTH Royal Inst Technol, Stockholm, Sweden
[3] Southeast Univ, Lab Efficient Architectures Digital Commun & Sign, Nanjing, Jiangsu, Peoples R China
[4] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing, Jiangsu, Peoples R China
关键词
Low bit-width CNNs; 3D systolic-like array; task partitioning; parallel streaming architecture;
D O I
10.1109/iscas.2019.8702547
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional Neural Networks (CNNs) have been widely applied in various fields such as image and video recognition, recommender systems, and natural language processing. However, the massive size and intensive computation loads prevent its feasible deployment in practice, especially on the embedded systems. As a highly competitive candidate, low bit-width CNNs are proposed to enable efficient implementation. In this paper, we propose Smilodon, a scalable, efficient accelerator for low bit-width CNNs based on a parallel streaming architecture, optimized with a task partitioning strategy. We also present the 3D systolic-like computing arrays fitting for convolutional layers. Our design is implemented on Zynq XC7ZO20 FPGA, which can satisfy the needs of real-time with a frame rate of 1, 622 FPS throughput, while consuming 2.1 Watt. To the best of our knowledge, our accelerator is superior to the state-of-the-art works in the tradeoff among throughput, power efficiency, and area efficiency.
引用
收藏
页数:5
相关论文
共 44 条
  • [1] An Efficient Streaming Accelerator for Low Bit-Width Convolutional Neural Networks
    Chen, Qinyu
    Fu, Yuxiang
    Song, Wenqing
    Cheng, Kaifeng
    Lu, Zhonghai
    Zhang, Chuan
    Li, Li
    [J]. ELECTRONICS, 2019, 8 (04)
  • [2] An Energy-Efficient Accelerator for Hybrid Bit-width DNNs
    Liu, Bo
    Ruan, Xing
    Xia, Mengwen
    Gong, Yu
    Yang, Jinjiang
    Ge, Wei
    Yang, Jun
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3306 - 3313
  • [3] Bit-width Adaptive Accelerator Design for Convolution Neural Network
    Guo, Jianxin
    Yin, Shouyi
    Ouyang, Peng
    Tu, Fengbin
    Tang, Shibin
    Liu, Leibo
    Wei, Shaojun
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [4] Bit-width Reduction and Customized Register for Low Cost Convolutional Neural Network Accelerator
    Choi, Kyungrak
    Choi, Woong
    Shin, Kyungho
    Park, Jongsun
    [J]. 2017 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2017,
  • [5] Bit-width Based Resource Partitioning for CNN Acceleration on FPGA
    Guo, Jianxin
    Yin, Shouyi
    Ouyang, Peng
    Liu, Leibo
    Wei, Shaojun
    [J]. 2017 IEEE 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2017), 2017, : 31 - 31
  • [6] Residual Quantization for Low Bit-Width Neural Networks
    Li, Zefan
    Ni, Bingbing
    Yang, Xiaokang
    Zhang, Wenjun
    Gao, Wen
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 214 - 227
  • [7] Low Bit-Width Convolutional Neural Network on RRAM
    Cai, Yi
    Tang, Tianqi
    Xia, Lixue
    Li, Boxun
    Wang, Yu
    Yang, Huazhong
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (07) : 1414 - 1427
  • [8] Combinatorial optimization for low bit-width neural networks
    Zhou, Han
    Ashrafi, Aida
    Blaschko, Matthew B.
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2246 - 2252
  • [9] An Efficient Combined Bit-Width Reducing Method for Ising Models
    Yachi, Yuta
    Tawada, Masashi
    Togawa, Nozomu
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (04) : 495 - 508
  • [10] Low Cost Convolutional Neural Network Accelerator Based on Bi-Directional Filtering and Bit-Width Reduction
    Choi, Woong
    Choi, Kyungrak
    Park, Jongsun
    [J]. IEEE ACCESS, 2018, 6 : 14734 - 14746