Reliability-Aware Training and Performance Modeling for Processing-In-Memory Systems

被引:1
|
作者
Sun, Hanbo [1 ]
Zhu, Zhenhua [1 ]
Cai, Yi [1 ]
Zeng, Shulin [1 ]
Qiu, Kaizhong [1 ]
Wang, Yu [1 ]
Yang, Huazhong [1 ]
机构
[1] Tsinghua Univ, BNRist, Dept EE, Beijing, Peoples R China
来源
2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC) | 2021年
基金
中国国家自然科学基金;
关键词
D O I
10.1145/3394885.3431633
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memristor based Processing-In-Memory (PIM) systems give alternative solutions to boost the computing energy efficiency of Convolutional Neural Network (CNN) based algorithms. However, Analog-to-Digital Converters' (ADCs) high interface costs and the limited size of the memristor crossbars make it challenging to map CNN models onto PIM systems with both high accuracy and high energy efficiency. Besides, it takes a long time to simulate the performance of large-scale PIM systems, resulting in unacceptable development time for the PIM system. To address these problems, we propose a reliability-aware training framework and a behavior-level modeling tool (MNSIM 2.0) for PIM accelerators. The proposed reliability-aware training framework, containing network splitting/merging analysis and a PIM-based non-uniform activation quantization scheme, can improve the energy efficiency by reducing the ADC resolution requirements in memristor crossbars. Moreover, MNSIM 2.0 provides a general modeling method for PIM architecture design and computation data flow; it can evaluate both accuracy and hardware performance within a short time. Experiments based on MNSIM 2.0 show that the reliability-aware training framework can improve 3.4x energy efficiency of PIM accelerators with little accuracy loss. The equivalent energy efficiency is 9.02 TOPS/W, nearly 2.6 similar to 4.2x compared with the existing work. We also evaluate more case studies of MNSIM 2.0, which help us balance the trade-off between accuracy and hardware performance.
引用
收藏
页码:847 / 852
页数:6
相关论文
共 50 条
  • [41] Reliability-aware performance model for optimal GPU-enabled cluster environment
    Supada Laosooksathit
    Raja Nassar
    Chokchai Leangsuksun
    Mihaela Paun
    The Journal of Supercomputing, 2014, 68 : 1630 - 1651
  • [42] Soft and Hard Reliability-Aware Scheduling for Multicore Embedded Systems with Energy Harvesting
    Xiang, Yi
    Pasricha, Sudeep
    IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS, 2015, 1 (04): : 220 - 235
  • [43] A Reliability-aware Task Scheduling Algorithm Based on Replication on Heterogeneous Computing Systems
    Wang, Shuli
    Li, Kenli
    Mei, Jing
    Xiao, Guoqing
    Li, Keqin
    JOURNAL OF GRID COMPUTING, 2017, 15 (01) : 23 - 39
  • [44] PM3: Power Modeling and Power Management for Processing-in-Memory
    Zhang, Chao
    Meng, Tong
    Sun, Guangyu
    2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2018, : 558 - 570
  • [45] A Case Study of Processing-in-Memory in off-the-Shelf Systems
    Nider, Joel
    Mustard, Craig
    Zoltan, Andrada
    Ramsden, John
    Liu, Larry
    Grossbard, Jacob
    Dashti, Mohammad
    Jodin, Romaric
    Ghiti, Alexandre
    Chauzi, Jordi
    Fedorova, Alexandra
    PROCEEDINGS OF THE 2021 USENIX ANNUAL TECHNICAL CONFERENCE, 2021, : 855 - 862
  • [46] Global Reliability-Aware Power Management for Multiprocessor Real-Time Systems
    Qi, Xuan
    Zhu, Dakai
    Aydin, Hakan
    16TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA 2010), 2010, : 183 - 192
  • [47] A Reliability-aware Task Scheduling Algorithm Based on Replication on Heterogeneous Computing Systems
    Shuli Wang
    Kenli Li
    Jing Mei
    Guoqing Xiao
    Keqin Li
    Journal of Grid Computing, 2017, 15 : 23 - 39
  • [48] Reliability-aware techno-economic assessment of floating solar power systems
    Goswami, Anik
    Aizpurua, Jose I.
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2024, 40
  • [49] Reliability-aware performance model for optimal GPU-enabled cluster environment
    Laosooksathit, Supada
    Nassar, Raja
    Leangsuksun, Chokchai
    Paun, Mihaela
    JOURNAL OF SUPERCOMPUTING, 2014, 68 (03): : 1630 - 1651
  • [50] Processing-in-memory (PIM)-based Manycore Architecture for Training Graph Neural Networks
    Pande, Partha P.
    2023 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI-TSA/VLSI-DAT, 2023,