PIMFlow: Compiler and Runtime Support for CNN Models on Processing-in-Memory DRAM

被引:0
|
作者
Shin, Yongwon [1 ]
Park, Juseong [2 ]
Cho, Sungjun [2 ]
Sung, Hyojin [1 ,2 ]
机构
[1] POSTECH, Grad Sch AI, Pohang, South Korea
[2] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
基金
新加坡国家研究基金会;
关键词
Processing-in-memory; CNN models;
D O I
10.1145/3579990.3580009
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Processing-in-Memory (PIM) has evolved over decades into a feasible solution to addressing the exacerbating performance bottleneck with main memory by placing computational logic in or near memory. Recent proposals from DRAM manufacturers highlighted the HW constraint-aware design of PIM-enabled DRAM with specialized MAC logic, providing an order of magnitude speedup for memory-intensive operations in DL models. Although the main target for PIM acceleration did not initially include convolutional neural networks due to their high compute intensity, recent CNN models are increasingly adopting computationally lightweight implementation. Motivated by the potential for the software stack to enable CNN models on DRAM-PIM hardware without invasive changes, we propose PIMFlow, an end-to-end compiler and runtime support, to accelerate CNN models on a PIM-enabled GPU memory. PIMFlow transforms model graphs to create inter-node parallelism across GPU and PIM, explores possible task- and data-parallel execution scenarios for optimal execution time, and provides a code-generating back-end and execution engine for DRAM-PIM. PIMFlow achieves up to 82% end-to-end speedup and reduces energy consumption by 26% on average for CNN model inferences.
引用
收藏
页码:249 / 262
页数:14
相关论文
共 22 条
  • [1] Runtime Support for Accelerating CNN Models on Digital DRAM Processing-in-Memory Hardware
    Shin, Yongwon
    Park, Juseong
    Hong, Jeongmin
    Sung, Hyojin
    IEEE COMPUTER ARCHITECTURE LETTERS, 2022, 21 (02) : 33 - 36
  • [2] A Compiler for Automatic Selection of Suitable Processing-in-Memory Instructions
    Ahmed, Hameeza
    Santos, Paulo C.
    Lima, Joao P. C.
    Moura, Rafael F.
    Alves, Marco A. Z.
    Beck, Antonio C. S.
    Carro, Luigi
    2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 564 - 569
  • [3] Extending the ONNX Runtime Framework for the Processing-in-Memory Execution
    Kim, Seok Young
    Lee, Jaewook
    Kim, Chang Hyun
    Lee, Won Jun
    Kim, Seon Wook
    2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
  • [4] Compiler and runtime support for efficient software transactional memory
    Adl-Tabatabai, Ali-Reza
    Lewis, Brian T.
    Menon, Vijay
    Murphy, Brian R.
    Saha, Bratin
    Shpeisman, Tatiana
    ACM SIGPLAN NOTICES, 2006, 41 (06) : 26 - 37
  • [5] A General Offloading Approach for Near-DRAM Processing-In-Memory Architectures
    Chen, Dan
    Jin, Hai
    Zheng, Long
    Huang, Yu
    Yao, Pengcheng
    Gui, Chuangyi
    Wang, Qinggang
    Liu, Haifeng
    He, Haiheng
    Liao, Xiaofei
    Zheng, Ran
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 246 - 257
  • [6] Language Runtime Support for NVM/DRAM Hybrid Main Memory
    Nakagawa, Gaku
    Oikawa, Shuichi
    2014 IEEE COOL CHIPS XVII, 2014,
  • [7] Accelerating CNN Training With Concurrent Execution of GPU and Processing-in-Memory
    Choi, Jungwoo
    Lee, Hyuk-Jae
    Sohn, Kyomin
    Yu, Hak-Soo
    Rhee, Chae Eun
    IEEE Access, 2024, 12 : 160190 - 160204
  • [8] Compiler and runtime support for shared memory parallelization of data mining algorithms
    Li, XG
    Jin, RM
    Agrawal, G
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2005, 2481 : 265 - 279
  • [9] CAIRO: A Compiler-Assisted Technique for Enabling Instruction-Level Offloading of Processing-In-Memory
    Hadidi, Ramyad
    Nai, Lifeng
    Kim, Hyojong
    Kim, Hyesoon
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2017, 14 (04)
  • [10] Runtime Row/Column Activation Pruning for ReRAM-based Processing-in-Memory DNN Accelerators
    Jiang, Xikun
    Shen, Zhaoyan
    Sun, Siqing
    Yin, Ping
    Jia, Zhiping
    Ju, Lei
    Zhang, Zhiyong
    Yu, Dongxiao
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,