Reliability-Aware Training and Performance Modeling for Processing-In-Memory Systems

被引：1

作者：

Sun, Hanbo ^{[1
]}

Zhu, Zhenhua ^{[1
]}

Cai, Yi ^{[1
]}

Zeng, Shulin ^{[1
]}

Qiu, Kaizhong ^{[1
]}

Wang, Yu ^{[1
]}

Yang, Huazhong ^{[1
]}

机构：

[1] Tsinghua Univ, BNRist, Dept EE, Beijing, Peoples R China

来源：

2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC) | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1145/3394885.3431633

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Memristor based Processing-In-Memory (PIM) systems give alternative solutions to boost the computing energy efficiency of Convolutional Neural Network (CNN) based algorithms. However, Analog-to-Digital Converters' (ADCs) high interface costs and the limited size of the memristor crossbars make it challenging to map CNN models onto PIM systems with both high accuracy and high energy efficiency. Besides, it takes a long time to simulate the performance of large-scale PIM systems, resulting in unacceptable development time for the PIM system. To address these problems, we propose a reliability-aware training framework and a behavior-level modeling tool (MNSIM 2.0) for PIM accelerators. The proposed reliability-aware training framework, containing network splitting/merging analysis and a PIM-based non-uniform activation quantization scheme, can improve the energy efficiency by reducing the ADC resolution requirements in memristor crossbars. Moreover, MNSIM 2.0 provides a general modeling method for PIM architecture design and computation data flow; it can evaluate both accuracy and hardware performance within a short time. Experiments based on MNSIM 2.0 show that the reliability-aware training framework can improve 3.4x energy efficiency of PIM accelerators with little accuracy loss. The equivalent energy efficiency is 9.02 TOPS/W, nearly 2.6 similar to 4.2x compared with the existing work. We also evaluate more case studies of MNSIM 2.0, which help us balance the trade-off between accuracy and hardware performance.

引用

页码：847 / 852

页数：6

共 50 条

[41] Reliability-aware performance model for optimal GPU-enabled cluster environment
Supada Laosooksathit
Raja Nassar
Chokchai Leangsuksun
Mihaela Paun
The Journal of Supercomputing, 2014, 68 : 1630 - 1651
[42] Soft and Hard Reliability-Aware Scheduling for Multicore Embedded Systems with Energy Harvesting
Xiang, Yi
Pasricha, Sudeep
IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS, 2015, 1 (04): : 220 - 235
[43] A Reliability-aware Task Scheduling Algorithm Based on Replication on Heterogeneous Computing Systems
Wang, Shuli
Li, Kenli
Mei, Jing
Xiao, Guoqing
Li, Keqin
JOURNAL OF GRID COMPUTING, 2017, 15 (01) : 23 - 39
[44] PM3: Power Modeling and Power Management for Processing-in-Memory
Zhang, Chao
Meng, Tong
Sun, Guangyu
2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2018, : 558 - 570
[45] A Case Study of Processing-in-Memory in off-the-Shelf Systems
Nider, Joel
Mustard, Craig
Zoltan, Andrada
Ramsden, John
Liu, Larry
Grossbard, Jacob
Dashti, Mohammad
Jodin, Romaric
Ghiti, Alexandre
Chauzi, Jordi
Fedorova, Alexandra
PROCEEDINGS OF THE 2021 USENIX ANNUAL TECHNICAL CONFERENCE, 2021, : 855 - 862
[46] Global Reliability-Aware Power Management for Multiprocessor Real-Time Systems
Qi, Xuan
Zhu, Dakai
Aydin, Hakan
16TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA 2010), 2010, : 183 - 192
[47] A Reliability-aware Task Scheduling Algorithm Based on Replication on Heterogeneous Computing Systems
Shuli Wang
Kenli Li
Jing Mei
Guoqing Xiao
Keqin Li
Journal of Grid Computing, 2017, 15 : 23 - 39
[48] Reliability-aware techno-economic assessment of floating solar power systems
Goswami, Anik
Aizpurua, Jose I.
SUSTAINABLE ENERGY GRIDS & NETWORKS, 2024, 40
[49] Reliability-aware performance model for optimal GPU-enabled cluster environment
Laosooksathit, Supada
Nassar, Raja
Leangsuksun, Chokchai
Paun, Mihaela
JOURNAL OF SUPERCOMPUTING, 2014, 68 (03): : 1630 - 1651
[50] Processing-in-memory (PIM)-based Manycore Architecture for Training Graph Neural Networks
Pande, Partha P.
2023 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI-TSA/VLSI-DAT, 2023,

← 1 2 3 4 5 →