Reliability-Aware Training and Performance Modeling for Processing-In-Memory Systems

被引:1
|
作者
Sun, Hanbo [1 ]
Zhu, Zhenhua [1 ]
Cai, Yi [1 ]
Zeng, Shulin [1 ]
Qiu, Kaizhong [1 ]
Wang, Yu [1 ]
Yang, Huazhong [1 ]
机构
[1] Tsinghua Univ, BNRist, Dept EE, Beijing, Peoples R China
来源
2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC) | 2021年
基金
中国国家自然科学基金;
关键词
D O I
10.1145/3394885.3431633
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memristor based Processing-In-Memory (PIM) systems give alternative solutions to boost the computing energy efficiency of Convolutional Neural Network (CNN) based algorithms. However, Analog-to-Digital Converters' (ADCs) high interface costs and the limited size of the memristor crossbars make it challenging to map CNN models onto PIM systems with both high accuracy and high energy efficiency. Besides, it takes a long time to simulate the performance of large-scale PIM systems, resulting in unacceptable development time for the PIM system. To address these problems, we propose a reliability-aware training framework and a behavior-level modeling tool (MNSIM 2.0) for PIM accelerators. The proposed reliability-aware training framework, containing network splitting/merging analysis and a PIM-based non-uniform activation quantization scheme, can improve the energy efficiency by reducing the ADC resolution requirements in memristor crossbars. Moreover, MNSIM 2.0 provides a general modeling method for PIM architecture design and computation data flow; it can evaluate both accuracy and hardware performance within a short time. Experiments based on MNSIM 2.0 show that the reliability-aware training framework can improve 3.4x energy efficiency of PIM accelerators with little accuracy loss. The equivalent energy efficiency is 9.02 TOPS/W, nearly 2.6 similar to 4.2x compared with the existing work. We also evaluate more case studies of MNSIM 2.0, which help us balance the trade-off between accuracy and hardware performance.
引用
收藏
页码:847 / 852
页数:6
相关论文
共 50 条
  • [31] Reliability-aware task scheduling for energy efficiency on heterogeneous multiprocessor systems
    Deng, Zexi
    Cao, Dunqian
    Shen, Hong
    Yan, Zihan
    Huang, Huimin
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (10): : 11643 - 11681
  • [32] Energy-Efficient Reliability-Aware Scheduling Algorithm on Heterogeneous Systems
    Tang, Xiaoyong
    Tan, Weizhen
    SCIENTIFIC PROGRAMMING, 2016, 2016
  • [33] Reliability-Aware SPICE Compatible Compact Modeling of IGZO Inverters on a Flexible Substrate
    Kim, Je-Hyuk
    Seo, Youngjin
    Jang, Jun Tae
    Park, Shinyoung
    Kang, Dongyeon
    Park, Jaewon
    Han, Moonsup
    Kim, Changwook
    Park, Dong-Wook
    Kim, Dae Hwan
    APPLIED SCIENCES-BASEL, 2021, 11 (11):
  • [34] Live Demonstration for Input-Sparsity-Aware RRAM Processing-in-Memory Chip
    Wang, Junjie
    Liu, Shuang
    Pan, Ruicheng
    Yan, Shiqin
    Liu, Yihe
    Liu, Yang
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [35] PIMSAB: A Processing-In-Memory System with Spatially-Aware Communication and Bit-Serial-Aware Computation
    Ma, Siyuan
    Mhatre, Kaustubh
    Weng, Jian
    Hanindhito, Bagus
    Wang, Zhengrong
    Nowatzki, Tony
    John, Lizy
    Arora, Aman
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 21 (04)
  • [36] Aggressive Performance Improvement on Processing-in-Memory Devices by Adopting Hugepages
    Santos, Paulo Cesar
    Forlin, Bruno E.
    Alves, Marco A. Z.
    Carro, Luigi
    2022 IEEE 33RD INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2022, : 60 - 63
  • [37] Optimization of OLAP In-Memory Database Management Systems with Processing-In-Memory Architecture
    Hosseinzadeh, Shima
    Parvaresh, Amirhossein
    Fey, Dietmar
    ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2023, 2023, 13949 : 264 - 278
  • [38] NPC: A Non-Conflicting Processing-in-Memory Controller in DDR Memory Systems
    Lee, Seungyong
    Lee, Sanghyun
    Seo, Minseok
    Park, Chunmyung
    Shin, Woojae
    Lee, Hyuk-Jae
    Kim, Hyun
    IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (03) : 1025 - 1039
  • [39] Reliability-aware low energy scheduling in real time systems with shared resources
    Zhang, Yi-wen
    Zhang, Hui-zhen
    Wang, Cheng
    MICROPROCESSORS AND MICROSYSTEMS, 2017, 52 : 312 - 324
  • [40] Reliability-Aware Task Processing and Offloading for Data-Intensive Applications in Edge computing
    Liang, Jingyu
    Ma, Bowen
    Feng, Zihan
    Huang, Jiwei
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (04): : 4668 - 4680