Exploiting Variable Precision Computation Array for Scalable Neural Network Accelerators

被引:0
|
作者
Yang, Shaofei [1 ]
Liu, Longjun [1 ]
Li, Baoting [1 ]
Sun, Hongbin [1 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep Neural Networks; Accelerator; Energy Efficiency Computing Array; Dynamic Quantization; Resiliency;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a flexible Variable Precision Computation Array (VPCA) component for different accelerators, which leverages a sparsification scheme for activations and a low bits serial-parallel combination computation unit for improving the efficiency and resiliency of accelerators. The VPCA can dynamically decompose the width of activation/weights (from 32bit to 3bit in different accelerators) into 2-bits serial computation units while the 2bits computing units can be combined in parallel computing for high throughput. We propose an on-the-fly compressing and calculating strategy SLE-CLC (single lane encoding, cross lane calculation), which could further improve performance of 2-bit parallel computing. The experiments results on image classification datasets show VPCA can outperforms DaDianNao, Stripes, Loom-2bit by 4.67x, 2.42x, 1.52x without other overhead on convolution layers.
引用
收藏
页码:315 / 319
页数:5
相关论文
共 50 条
  • [11] Reconfigurable Dataflow Optimization for Spatiotemporal Spiking Neural Computation on Systolic Array Accelerators
    Lee, Jeong-Jun
    Li, Peng
    2020 IEEE 38TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2020), 2020, : 57 - 64
  • [12] Exploiting and Enhancing Computation Latency Variability for High-Performance Time-Domain Computing-in-Memory Neural Network Accelerators
    Wang, Chia-Chun
    Lo, Yun-Chen
    Wu, Jun-Shen
    Tsai, Yu-Chih
    Chang, Chia-Cheng
    Hsu, Tsen-Wei
    Chu, Min-Wei
    Lai, Chuan-Yao
    Liu, Ren-Shuo
    2023 IEEE 41ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2023, : 515 - 522
  • [13] A variable discretization precision rough logic neural network
    Zhang Dong-bo
    Wang Yao-nan
    PROCEEDINGS OF 2006 CHINESE CONTROL AND DECISION CONFERENCE, 2006, : 253 - 258
  • [14] Refresh Triggered Computation: Improving the Energy Efficiency of Convolutional Neural Network Accelerators
    Jafri, Syed M. A. H.
    Hassan, Hasan
    Hemani, Ahmed
    Mutlu, Onur
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (01)
  • [15] A RRAM-based Coarse Grain Reconfigurable Array for Neural Network Accelerators
    Chen, Zhengyu
    Zhou, Hai
    Gu, Jie
    2018 IEEE SOI-3D-SUBTHRESHOLD MICROELECTRONICS TECHNOLOGY UNIFIED CONFERENCE (S3S), 2018,
  • [16] Tetris: Re-architecting Convolutional Neural Network Computation for Machine Learning Accelerators
    Lu, Hang
    Wei, Xin
    Lin, Ning
    Yan, Guihai
    Li, Xiao-Wei
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
  • [17] Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and Communication
    Lee, Sunwoo
    Jha, Dipendra
    Agrawal, Ankit
    Choudhary, Alok
    Liao, Wei-keng
    2017 IEEE 24TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2017, : 183 - 192
  • [18] Dynamic Precision-Scalable Thermal Mapping Algorithm for Three Dimensional Systolic-Array Based Neural Network Accelerator
    Lin, Shu-Yen
    Tsai, Chun-Kuan
    Kao, Wen-Chun
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 757 - 769
  • [19] A survey of neural network accelerators
    Li, Zhen
    Wang, Yuqing
    Zhi, Tian
    Chen, Tianshi
    FRONTIERS OF COMPUTER SCIENCE, 2017, 11 (05) : 746 - 761
  • [20] A survey of neural network accelerators
    Zhen Li
    Yuqing Wang
    Tian Zhi
    Tianshi Chen
    Frontiers of Computer Science, 2017, 11 : 746 - 761