A Heterogeneous PIM Hardware-Software Co-Design for Energy-Efficient Graph Processing

被引:34
|
作者
Huang, Yu [1 ]
Zheng, Long [1 ]
Yao, Pengcheng [1 ]
Zhao, Jieshan [1 ]
Liao, Xiaofei [1 ]
Jin, Hai [1 ]
Xue, Jingling [2 ]
机构
[1] Huazhong Univ Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Wuhan, Peoples R China
[2] UNSW Sydney, Sydney, NSW, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
accelerator; graph processing; heterogeneous architecture; processing-in-memory; PERFORMANCE;
D O I
10.1109/IPDPS47924.2020.00076
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Processing-In-Memory (PIM) is an emerging technology that addresses the memory bottleneck of graph processing. In general, analog memristor-based PIM promises high parallelism provided that the underlying matrix-structured crossbar can be fully utilized while digital CMOS-based PIM has a faster single-edge execution but its parallelism can be low. In this paper, we observe that there is no absolute winner between these two representative PIM technologies for graph applications, which often exhibit irregular workloads. To reap the best of both worlds, we introduce a new heterogeneous PIM hardware, called Hetraph, to facilitate energy-efficient graph processing. Hetraph incorporates memristor-based analog computation units (for high-parallelism computing) and CMOS-based digital computation cores (for efficient computing) on the same logic layer of a 3D die-stacked memory device. To maximize the hardware utilization, our software design offers a hardware heterogeneity-aware execution model and a workload offloading mechanism. For performance speedups, such a hardware-software co-design outperforms the state-of-the-art by 7.54x (CPU), 1.56x (GPU), 4.13x (memristor-based PIM) and 3.05x (CMOS-based PIM), on average. For energy savings, Hetraph reduces the energy consumption by 57.58x (CPU), 19.93x (GPU), 14.02x (memristor-based PIM) and 10.48x (CMOS-based PIM), on average.
引用
收藏
页码:684 / 695
页数:12
相关论文
共 50 条
  • [21] OpenCL-based Hardware-Software Co-design Methodology for Image Processing Implementation on Heterogeneous FPGA Platform
    Ayat, Sayed Omid
    Khalil-Hani, Mohamed
    Bakhteri, Rabia
    PROCEEDINGS 5TH IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2015), 2015, : 36 - 41
  • [22] Hardware-Software Co-Design for Brain-Computer Interfaces
    Karageorgos, Ioannis
    Sriram, Karthik
    Vesely, Jan
    Wu, Michael
    Powell, Marc
    Borton, David
    Manohar, Rajit
    Bhattacharjee, Abhishek
    2020 ACM/IEEE 47TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2020), 2020, : 391 - 404
  • [23] Hardware-Software Co-design Approach for Deep Learning Inference
    Paul, Debdeep
    Singh, Jawar
    Mathew, Jimson
    2019 7TH INTERNATIONAL CONFERENCE ON SMART COMPUTING & COMMUNICATIONS (ICSCC), 2019, : 118 - 122
  • [24] Hardware-software Co-design of Slimmed Optical Neural Networks
    Zhao, Zheng
    Liu, Derong
    Li, Meng
    Ying, Zhoufeng
    Zhang, Lu
    Xu, Biying
    Yu, Bei
    Chen, Ray T.
    Pan, David Z.
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 705 - 710
  • [25] Hardware-Software Co-Design for Face Recognition on FPGA SoCs
    Wang, Hao
    Cao, Shan
    Xu, Shugong
    Zhang, Shunqing
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [26] Energy-Efficient Inference With Software-Hardware Co-Design for Sustainable Artificial Intelligence of Things
    Dai, Shengxin
    Luo, Zheng
    Luo, Wendian
    Wang, Siyi
    Dai, Cheng
    Guo, Bing
    Zhou, Xiaokang
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (24): : 39170 - 39182
  • [27] Efficient Implementation of QRD-RLS Algorithm using Hardware-Software Co-design
    Lodha, Nupur
    Rai, Nivesh
    Krishnamurthy, Aarthy
    Venkataraman, Hrishikesh
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 2973 - +
  • [28] A novel hardware-software co-design for automatic white balance
    Chen, Chin-Hsing
    Tan, Sun-Yen
    Huang, Wen-Tzeng
    LECTURE NOTES IN SIGNAL SCIENCE, INTERNET AND EDUCATION (SSIP'07/MIV'07/DIWEB'07), 2007, : 203 - +
  • [29] Enclavisor: A Hardware-Software Co-Design for Enclaves on Untrusted Cloud
    Gu, Jinyu
    Wu, Xinyue
    Zhu, Bojun
    Xia, Yubin
    Zang, Binyu
    Guan, Haibing
    Chen, Haibo
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (10) : 1598 - 1611
  • [30] Hardware-software co-design of resource constrained systems on a chip
    Thepayasuwan, N
    Doboli, A
    24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS, PROCEEDINGS, 2004, : 818 - 823