Reliability-Aware Training and Performance Modeling for Processing-In-Memory Systems

被引:1
|
作者
Sun, Hanbo [1 ]
Zhu, Zhenhua [1 ]
Cai, Yi [1 ]
Zeng, Shulin [1 ]
Qiu, Kaizhong [1 ]
Wang, Yu [1 ]
Yang, Huazhong [1 ]
机构
[1] Tsinghua Univ, BNRist, Dept EE, Beijing, Peoples R China
来源
2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC) | 2021年
基金
中国国家自然科学基金;
关键词
D O I
10.1145/3394885.3431633
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memristor based Processing-In-Memory (PIM) systems give alternative solutions to boost the computing energy efficiency of Convolutional Neural Network (CNN) based algorithms. However, Analog-to-Digital Converters' (ADCs) high interface costs and the limited size of the memristor crossbars make it challenging to map CNN models onto PIM systems with both high accuracy and high energy efficiency. Besides, it takes a long time to simulate the performance of large-scale PIM systems, resulting in unacceptable development time for the PIM system. To address these problems, we propose a reliability-aware training framework and a behavior-level modeling tool (MNSIM 2.0) for PIM accelerators. The proposed reliability-aware training framework, containing network splitting/merging analysis and a PIM-based non-uniform activation quantization scheme, can improve the energy efficiency by reducing the ADC resolution requirements in memristor crossbars. Moreover, MNSIM 2.0 provides a general modeling method for PIM architecture design and computation data flow; it can evaluate both accuracy and hardware performance within a short time. Experiments based on MNSIM 2.0 show that the reliability-aware training framework can improve 3.4x energy efficiency of PIM accelerators with little accuracy loss. The equivalent energy efficiency is 9.02 TOPS/W, nearly 2.6 similar to 4.2x compared with the existing work. We also evaluate more case studies of MNSIM 2.0, which help us balance the trade-off between accuracy and hardware performance.
引用
收藏
页码:847 / 852
页数:6
相关论文
共 50 条
  • [21] Reliability-Aware Design Strategies for Stateful Logic-in-Memory Architectures
    Zanotti, Tommaso
    Puglisi, Francesco Maria
    Pavan, Paolo
    IEEE TRANSACTIONS ON DEVICE AND MATERIALS RELIABILITY, 2020, 20 (02) : 278 - 285
  • [22] Active Memory Cube: A processing-in-memory architecture for exascale systems
    Nair, R.
    Antao, S. F.
    Bertolli, C.
    Bose, P.
    Brunheroto, J. R.
    Chen, T.
    Cher, C. -Y.
    Costa, C. H. A.
    Doi, J.
    Evangelinos, C.
    Fleischer, B. M.
    Fox, T. W.
    Gallo, D. S.
    Grinberg, L.
    Gunnels, J. A.
    Jacob, A. C.
    Jacob, P.
    Jacobson, H. M.
    Karkhanis, T.
    Kim, C.
    Moreno, J. H.
    O'Brien, J. K.
    Ohmacht, M.
    Park, Y.
    Prener, D. A.
    Rosenburg, B. S.
    Ryu, K. D.
    Sallenave, O.
    Serrano, M. J.
    Siegl, P. D. M.
    Sugavanam, K.
    Sura, Z.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2015, 59 (2-3)
  • [23] A framework for reliability-aware design exploration on MPSoC based systems
    Jia Huang
    Andreas Raabe
    Kai Huang
    Christian Buckl
    Alois Knoll
    Design Automation for Embedded Systems, 2012, 16 : 189 - 220
  • [24] Reliability-aware scheduling strategy for heterogeneous distributed computing systems
    Tang, Xiaoyong
    Li, Kenli
    Li, Renfa
    Veeravalli, Bharadwaj
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (09) : 941 - 952
  • [25] Reliability-Aware Energy Management in Mixed-Criticality Systems
    Taherin, Amir
    Salehi, Mohammad
    Ejlali, Alireza
    IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2018, 3 (03): : 195 - 208
  • [26] Accelerating CNN Training With Concurrent Execution of GPU and Processing-in-Memory
    Choi, Jungwoo
    Lee, Hyuk-Jae
    Sohn, Kyomin
    Yu, Hak-Soo
    Rhee, Chae Eun
    IEEE ACCESS, 2024, 12 : 160190 - 160204
  • [27] RADAR: A Skew-Resistant and Hotness-Aware Ordered Index Design for Processing-in-Memory Systems
    Hua, Yifan
    Zheng, Shengan
    Kong, Weihan
    Zhou, Cong
    Huang, Kaixin
    Ma, Ruoyan
    Huang, Linpeng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (09) : 1598 - 1614
  • [28] TransPimLib: Efficient Transcendental Functions for Processing-in-Memory Systems
    Item, Maurus
    Gomez-Luna, Juan
    Guo, Yuxin
    Oliveira, Geraldo F.
    Sadrosadati, Mohammad
    Mutlu, Onur
    2023 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS, 2023, : 235 - 247
  • [29] GSPN-Based Reliability-Aware Performance Evaluation of IoT Services
    Li, Songyuan
    Huang, Jiwei
    2017 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC), 2017, : 483 - 486
  • [30] Reliability-aware task scheduling for energy efficiency on heterogeneous multiprocessor systems
    Zexi Deng
    Dunqian Cao
    Hong Shen
    Zihan Yan
    Huimin Huang
    The Journal of Supercomputing, 2021, 77 : 11643 - 11681