Exploiting Variable Precision Computation Array for Scalable Neural Network Accelerators

被引：0

作者：

Yang, Shaofei ^{[1
]}

Liu, Longjun ^{[1
]}

Li, Baoting ^{[1
]}

Sun, Hongbin ^{[1
]}

Zheng, Nanning ^{[1
]}

机构：

[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China

来源：

2020 2ND IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2020) | 2020年

基金：

中国国家自然科学基金;

关键词：

Deep Neural Networks; Accelerator; Energy Efficiency Computing Array; Dynamic Quantization; Resiliency;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a flexible Variable Precision Computation Array (VPCA) component for different accelerators, which leverages a sparsification scheme for activations and a low bits serial-parallel combination computation unit for improving the efficiency and resiliency of accelerators. The VPCA can dynamically decompose the width of activation/weights (from 32bit to 3bit in different accelerators) into 2-bits serial computation units while the 2bits computing units can be combined in parallel computing for high throughput. We propose an on-the-fly compressing and calculating strategy SLE-CLC (single lane encoding, cross lane calculation), which could further improve performance of 2-bit parallel computing. The experiments results on image classification datasets show VPCA can outperforms DaDianNao, Stripes, Loom-2bit by 4.67x, 2.42x, 1.52x without other overhead on convolution layers.

引用

页码：315 / 319

页数：5

共 50 条

[11] Reconfigurable Dataflow Optimization for Spatiotemporal Spiking Neural Computation on Systolic Array Accelerators
Lee, Jeong-Jun
Li, Peng
2020 IEEE 38TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2020), 2020, : 57 - 64
[12] Exploiting and Enhancing Computation Latency Variability for High-Performance Time-Domain Computing-in-Memory Neural Network Accelerators
Wang, Chia-Chun
Lo, Yun-Chen
Wu, Jun-Shen
Tsai, Yu-Chih
Chang, Chia-Cheng
Hsu, Tsen-Wei
Chu, Min-Wei
Lai, Chuan-Yao
Liu, Ren-Shuo
2023 IEEE 41ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2023, : 515 - 522
[13] A variable discretization precision rough logic neural network
Zhang Dong-bo
Wang Yao-nan
PROCEEDINGS OF 2006 CHINESE CONTROL AND DECISION CONFERENCE, 2006, : 253 - 258
[14] Refresh Triggered Computation: Improving the Energy Efficiency of Convolutional Neural Network Accelerators
Jafri, Syed M. A. H.
Hassan, Hasan
Hemani, Ahmed
Mutlu, Onur
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (01)
[15] A RRAM-based Coarse Grain Reconfigurable Array for Neural Network Accelerators
Chen, Zhengyu
Zhou, Hai
Gu, Jie
2018 IEEE SOI-3D-SUBTHRESHOLD MICROELECTRONICS TECHNOLOGY UNIFIED CONFERENCE (S3S), 2018,
[16] Tetris: Re-architecting Convolutional Neural Network Computation for Machine Learning Accelerators
Lu, Hang
Wei, Xin
Lin, Ning
Yan, Guihai
Li, Xiao-Wei
2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
[17] Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and Communication
Lee, Sunwoo
Jha, Dipendra
Agrawal, Ankit
Choudhary, Alok
Liao, Wei-keng
2017 IEEE 24TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2017, : 183 - 192
[18] Dynamic Precision-Scalable Thermal Mapping Algorithm for Three Dimensional Systolic-Array Based Neural Network Accelerator
Lin, Shu-Yen
Tsai, Chun-Kuan
Kao, Wen-Chun
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 757 - 769
[19] A survey of neural network accelerators
Li, Zhen
Wang, Yuqing
Zhi, Tian
Chen, Tianshi
FRONTIERS OF COMPUTER SCIENCE, 2017, 11 (05) : 746 - 761
[20] A survey of neural network accelerators
Zhen Li
Yuqing Wang
Tian Zhi
Tianshi Chen
Frontiers of Computer Science, 2017, 11 : 746 - 761

← 1 2 3 4 5 →