A Quantized Training Framework for Robust and Accurate ReRAM-based Neural Network Accelerators

被引：7

作者：

Zhang, Chenguang ^{[1
]}

Zhou, Pingqiang ^{[1
]}

机构：

[1] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China

来源：

2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC) | 2021年

关键词：

ReRAM; Neural Network; Variation; Robust; Quantize;

D O I：

10.1145/3394885.3431528

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Neural networks (NN), especially deep neural networks (DNN), have achieved great success in lots of fields. ReRAM crossbar, as a promising candidate, is widely employed to accelerate neural network owing to its nature of processing MVM. However, ReRAM crossbar suffers high conductance variation due to many non-ideal effects, resulting in great inference accuracy degradation. Recent works use uniform quantization to enhance the tolerance of conductance variation, but these methods still suffer high accuracy loss with large variation. In this paper, firstly, we analyze the impact of the quantization and conductance variation on the accuracy. Then, based on two observation, we propose a quantized training framework to enhance the robustness and accuracy of the neural network running on the accelerator, by introducing a smart non-uniform quantizer. This framework consists of a robust trainable quantizer and a corresponding training method, and needs no extra hardware overhead and compatible with a standard neural network training procedure. Experimental results show that our proposed method can improve inference accuracy by 10% similar to 30% under large variation, compared with uniform quantization method.

引用

页码：43 / 48

页数：6

共 50 条

[41] ReHarvest: An ADC Resource-Harvesting Crossbar Architecture for ReRAM-Based DNN Accelerators
Xu, Jiahong
Li, Haikun
Duan, Zhuohui
Liao, Xiaofei
Jin, Hai
Yang, Xiaokang
Li, Huize
Liu, Cong
Mao, Fubing
Zhang, Yu
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 21 (03)
[42] DAT: Leveraging Device-Specific Noise for Efficient and Robust AI Training in ReRAM-based Systems
Park, Chanwoo
Jeon, Jongwook
Cho, Hyunbo
2023 INTERNATIONAL CONFERENCE ON SIMULATION OF SEMICONDUCTOR PROCESSES AND DEVICES, SISPAD, 2023, : 289 - 292
[43] On Minimizing Analog Variation Errors to Resolve the Scalability Issue of ReRAM-Based Crossbar Accelerators
Kang, Yao-Wen
Wu, Chun-Feng
Chang, Yuan-Hao
Kuo, Tei-Wei
Ho, Shu-Yin
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (11) : 3856 - 3867
[44] An Empirical Fault Vulnerability Exploration of ReRAM-Based Process-in-Memory CNN Accelerators
Dorostkar, Aniseh
Farbeh, Hamed
Zarandi, Hamid R.
IEEE TRANSACTIONS ON RELIABILITY, 2024, : 1 - 15
[45] MAX2: An ReRAM-Based Neural Network Accelerator That Maximizes Data Reuse and Area Utilization
Mao, Manqing
Peng, Xiaochen
Liu, Rui
Li, Jingtao
Yu, Shimeng
Chakrabarti, Chaitali
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2019, 9 (02) : 398 - 410
[46] A Framework for Accelerating Transformer-Based Language Model on ReRAM-Based Architecture
Kang, Myeonggu
Shin, Hyein
Kim, Lee-Sup
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (09) : 3026 - 3039
[47] PRIME: A Novel Processing-in-memory Architecture for Neural Network Computation in ReRAM-based Main Memory
Chi, Ping
Li, Shuangchen
Xu, Cong
Zhang, Tao
Zhao, Jishen
Liu, Yongpan
Wang, Yu
Xie, Yuan
2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, : 27 - 39
[48] RRAMedy: Protecting ReRAM-based Neural Network from Permanent and Soft Faults During Its Lifetime
Li, Wen
Wang, Ying
Li, Huawei
Li, Xiaowei
2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019), 2019, : 91 - 99
[49] Runtime Row/Column Activation Pruning for ReRAM-based Processing-in-Memory DNN Accelerators
Jiang, Xikun
Shen, Zhaoyan
Sun, Siqing
Yin, Ping
Jia, Zhiping
Ju, Lei
Zhang, Zhiyong
Yu, Dongxiao
2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
[50] REC: REtime Convolutional Layers to Fully Exploit Harvested Energy for ReRAM-based CNN Accelerators
Zhou, Kunyu
Qiu, Keni
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2024, 23 (06) : 33 - 33

← 1 2 3 4 5 →