A Heterogeneous PIM Hardware-Software Co-Design for Energy-Efficient Graph Processing

被引:34
|
作者
Huang, Yu [1 ]
Zheng, Long [1 ]
Yao, Pengcheng [1 ]
Zhao, Jieshan [1 ]
Liao, Xiaofei [1 ]
Jin, Hai [1 ]
Xue, Jingling [2 ]
机构
[1] Huazhong Univ Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Wuhan, Peoples R China
[2] UNSW Sydney, Sydney, NSW, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
accelerator; graph processing; heterogeneous architecture; processing-in-memory; PERFORMANCE;
D O I
10.1109/IPDPS47924.2020.00076
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Processing-In-Memory (PIM) is an emerging technology that addresses the memory bottleneck of graph processing. In general, analog memristor-based PIM promises high parallelism provided that the underlying matrix-structured crossbar can be fully utilized while digital CMOS-based PIM has a faster single-edge execution but its parallelism can be low. In this paper, we observe that there is no absolute winner between these two representative PIM technologies for graph applications, which often exhibit irregular workloads. To reap the best of both worlds, we introduce a new heterogeneous PIM hardware, called Hetraph, to facilitate energy-efficient graph processing. Hetraph incorporates memristor-based analog computation units (for high-parallelism computing) and CMOS-based digital computation cores (for efficient computing) on the same logic layer of a 3D die-stacked memory device. To maximize the hardware utilization, our software design offers a hardware heterogeneity-aware execution model and a workload offloading mechanism. For performance speedups, such a hardware-software co-design outperforms the state-of-the-art by 7.54x (CPU), 1.56x (GPU), 4.13x (memristor-based PIM) and 3.05x (CMOS-based PIM), on average. For energy savings, Hetraph reduces the energy consumption by 57.58x (CPU), 19.93x (GPU), 14.02x (memristor-based PIM) and 10.48x (CMOS-based PIM), on average.
引用
收藏
页码:684 / 695
页数:12
相关论文
共 50 条
  • [31] Speed optimization of AES algorithm with Hardware-Software Co-design
    Dixit, Preeti
    Zalke, Jitendra
    Admane, Sharmik
    2017 2ND INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2017, : 793 - 798
  • [32] HSCoNAS: Hardware-Software Co-Design of Efficient DNNs via Neural Architecture Search
    Luo, Xiangzhong
    Liu, Di
    Huai, Shuo
    Liu, Weichen
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 418 - 421
  • [33] Hardware-software co-design of an automatic fingerprint acquisition system
    Fons, M
    Fons, F
    Canyellas, N
    Cantó, E
    López, M
    ISIE 2005: Proceedings of the IEEE International Symposium on Industrial Electronics 2005, Vols 1- 4, 2005, : 1123 - 1128
  • [34] APPROXIMATE COMPUTING: Unlocking Efficiency with Hardware-Software Co-Design
    Ceze, Luis
    Sampson, Adrian
    GETMOBILE-MOBILE COMPUTING & COMMUNICATIONS REVIEW, 2016, 20 (03) : 12 - 16
  • [35] Hardware-Software Co-design for BLDC Motor Speed Controller Design
    Alecsa, Bogdan
    Onea, Alexandru
    ADVANCED MATERIALS RESEARCH II, PTS 1 AND 2, 2012, 463-464 : 1256 - +
  • [36] HAAC: A Hardware-Software Co-Design to Accelerate Garbled Circuits
    Mo, Jianqiao
    Gopinath, Jayanth
    Reagen, Brandon
    PROCEEDINGS OF THE 2023 THE 50TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, ISCA 2023, 2023, : 128 - 140
  • [37] A Novel Hardware-Software Co-Design and Implementation of the HOG Algorithm
    Ghaffari, Sina
    Soleimani, Parastoo
    Li, Kin Fun
    Capson, David W.
    SENSORS, 2020, 20 (19) : 1 - 21
  • [38] Hardware-software co-design of a fingerprint image enhancement algorithm
    Lopez, Mariano
    Canto, Enrique
    Fons, Mariano
    IECON 2006 - 32ND ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS, VOLS 1-11, 2006, : 3655 - +
  • [39] Hardware-Software Co-design to Accelerate Neural Network Applications
    Imani, Mohsen
    Garcia, Ricardo
    Gupta, Saransh
    Rosing, Tajana
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2019, 15 (02)
  • [40] System level memory optimization for hardware-software co-design
    Danckaert, K
    Catthoor, F
    DeMan, H
    PROCEEDINGS OF THE FIFTH INTERNATIONAL WORKSHOP ON HARDWARE/SOFTWARE CODESIGN (CODES/CASHE '97), 1997, : 55 - 59