High-throughput and Energy-efficient Graph Processing on FPGA

被引:60
|
作者
Zhou, Shijie [1 ]
Chelmis, Charalampos [1 ]
Prasanna, Viktor K. [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90007 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/FCCM.2016.35
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel design for large-scale graph processing on FPGA. Our design uses large external memory for storing massive graph data and FPGA for acceleration, and leverages edge-centric computing principles. We propose a data layout which optimizes the external memory performance and leads to an efficient memory activation schedule to reduce on-chip memory power consumption. Further, we develop a parallel architecture on FPGA which can saturate the external memory bandwidth and concurrently process multiple input data to increase throughput. We use our design to accelerate several classic graph algorithms, including single-source shortest path, weakly connected component, and minimum spanning tree. Experimental results show that for all the considered graph algorithms, our design achieves high throughput of over 600 million traversed edges per second (MTEPS) and high energy-efficiency of over 30 MTEPS/W. Compared with a baseline design, our optimizations result in over 3.6x throughput and 5.8x energy-efficiency improvements, respectively. Our design achieves 32% throughput improvement when compared with state-of-the-art FPGA designs, and up to 7.8x speedup when compared with state-of-the-art multi-core implementation.
引用
收藏
页码:103 / 110
页数:8
相关论文
共 50 条
  • [31] Energy-Efficient and High-Throughput CNN Inference on Embedded CPUs-GPUs MPSoCs
    Tang, Erqian
    Minakova, Svetlana
    Stefanov, Todor
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2021, 2022, 13227 : 127 - 143
  • [32] An Energy-Efficient FPGA-Based Packet Processing Framework
    Daniel Horvath
    Imre Bertalan
    Istvan Moldovan
    Tuan Anh Trinh
    NETWORKED SERVICES AND APPLICATIONS - ENGINEERING, CONTROL AND MANAGEMENT, 2010, 6164 : 31 - +
  • [33] Energy-Efficient Stochastic Matrix Function Estimator for Graph Analytics on FPGA
    Giefers, Heiner
    Staar, Peter
    Polig, Raphael
    2016 26TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2016,
  • [34] Area- and energy-efficient high-throughput QC-LDPC encoder for space applications
    Li, Lintao
    Lv, Jiayi
    Li, Yimin
    ELECTRONICS LETTERS, 2023, 59 (23)
  • [35] Energy-Efficient Histogram on FPGA
    Sanny, Andrea
    Yang, Yi-Hua E.
    Prasanna, Viktor K.
    2014 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2014,
  • [36] Domino: Graph Processing Services on Energy-efficient Hardware Accelerator
    Xu, Chongchong
    Wang, Chao
    Gong, Lei
    Jin, Lihui
    Li, Xi
    Zhou, Xuehai
    2018 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2018), 2018, : 274 - 281
  • [37] HLS-based High-throughput andWork-efficient Synthesizable Graph Processing Template Pipeline
    Ahangari, Hamzeh
    Ozdal, Muhammet Mustafa
    Ozturk, Ozcan
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (02)
  • [38] FPGA implementation of a high-throughput memory-efficient LDPC decoder
    School of Electronic Engineering, Xidian Univ., Xi'an 710071, China
    不详
    Xi'an Dianzi Keji Daxue Xuebao, 2008, 3 (427-432):
  • [39] LRADNN: High-Throughput and Energy-Efficient Deep Neural Network Accelerator using Low Rank Approximation
    Zhu, Jingyang
    Qian, Zhiliang
    Tsui, Chi-Ying
    2016 21ST ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2016, : 581 - 586
  • [40] A High-Throughput Energy-Efficient Implementation of Successive Cancellation Decoder for Polar Codes Using Combinational Logic
    Dizdar, Onur
    Arikan, Erdal
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2016, 63 (03) : 436 - 447