A Reconfigurable Computing Approach for Efficient and Scalable Parallel Graph Exploration

被引:0
|
作者
Betkaoui, Brahim [1 ]
Wang, Yu [2 ]
Thomas, David B. [3 ]
Luk, Wayne [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Comp, London, England
[2] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing, Peoples R China
[3] Univ London Imperial Coll Sci Technol & Med, Dept Elect & Elect Engn, London, England
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
BREADTH-1ST SEARCH; ALGORITHMS; NETWORK;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In many application domains, data are represented using large graphs involving millions of vertices and billions of edges. Graph exploration algorithms, such as breadth-first search (BFS), are largely dominated by memory latency and are challenging to process efficiently. In this paper, we present a reconfigurable hardware methodology for efficient parallel processing of large-scale graph exploration problems. Our methodology is based on a reconfigurable hardware architecture which decouples computation and communication while keeping multiple memory requests in flight at any given time, taking advantage of the hardware capabilities of both FPGAs and the parallel memory subsystem. To validate our methodology, we provide a detailed design description of the Breadth-First Search algorithm on an FPGA-based high performance computing system. Using graph data based on the power-law graphs found in real-word problems, we are able to achieve performance results that are superior to those of high performance multi-core systems in the recent literature for large graph instances, and a throughput in excess of 2.5 billion traversed edges per second on RMAT graphs with 16 million vertices and over a billion edges. Using four Virtex-5 LX330 FPGAs based on 65nm technology and running at 75MHz, our BFS design achieves more than twice the speed of a 32-core Xeon X7560 based on 45nm technology and running at 2.26GHz.
引用
收藏
页码:8 / 15
页数:8
相关论文
共 50 条
  • [1] Efficient Parallel Computing of Graph Edit Distance
    Wang, Ran
    Fang, Yixiang
    Feng, Xing
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 233 - 240
  • [2] Efficient and Scalable Graph Parallel Processing With Symbolic Execution
    Zheng, Long
    Liao, Xiaofei
    Jin, Hai
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2018, 15 (01)
  • [3] Work Efficient Parallel Algorithms for Large Graph Exploration
    Banerjee, Dip Sankar
    Sharma, Shashank
    Kothapalli, Kishore
    [J]. 2013 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2013, : 433 - 442
  • [4] Highly Efficient Reconfigurable Parallel Graph Cuts for Embedded Vision
    Nikitakis, Antonis
    Papaefstathiou, Ioannis
    [J]. PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 1405 - 1410
  • [5] Theoretically Efficient Parallel Graph Algorithms Can Be Fast and Scalable
    Dhulipala, Laxman
    Blelloch, Guy E.
    Shun, Julian
    [J]. ACM TRANSACTIONS ON PARALLEL COMPUTING, 2021, 8 (01)
  • [6] Theoretically Efficient Parallel Graph Algorithms Can Be Fast and Scalable
    Dhulipala, Laxman
    Blelloch, Guy E.
    Shun, Julian
    [J]. SPAA'18: PROCEEDINGS OF THE 30TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2018, : 393 - 404
  • [7] A Graph Database Approach for Efficient and Scalable Management of Simulations
    Hwang, Jeong-Hyon
    Birnbaum, Jeremy
    Vabbalareddy, Rohini
    Ravi, S. S.
    Park, Chanyeol
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1310 - +
  • [8] Kernel scheduling techniques for efficient solution space exploration in reconfigurable computing
    Maestre, R
    Kurdahi, F
    Fernandez, M
    Hermida, R
    Bagherzadeh, N
    Singh, H
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2001, 47 (3-4) : 277 - 292
  • [9] A scalable and efficient algorithm for computing the city block distance transform on reconfigurable meshes
    Pan, Y
    Trahan, JL
    Vaidyanathan, R
    [J]. COMPUTER JOURNAL, 1997, 40 (07): : 435 - 440
  • [10] Designing scalable and efficient parallel clustering algorithms on arrays with reconfigurable optical buses
    Wu, CH
    Horng, SJ
    Chen, YW
    Lee, WY
    [J]. IMAGE AND VISION COMPUTING, 2000, 18 (13) : 1033 - 1043