MultiLyra: Scalable Distributed Evaluation of Batches of Iterative Graph Queries

被引:0
|
作者
Mazloumi, Abbas [1 ]
Jiang, Xiaolin [1 ]
Gupta, Rajiv [1 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
关键词
Distributed Graph Processing; Query Batching; Reuse; Redundancy Elimination; FRAMEWORK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph analytics is being increasingly used for analyzing large scale networks representing entities and relationships in many domains. Various distributed graph processing frameworks have been developed to deliver scalable performance for evaluation of individual iterative graph queries. In practice though, we may need to evaluate many queries. In this paper we develop MultiLyra, a distributed framework that efficiently evaluates a batch of graph queries. To deliver high performance, this system is designed to amortize the communication and synchronization costs of distributed query evaluation across multiple queries. Our experiments with MultiLyra for four iterative algorithms on a cluster of four 32-core machines show the following. Basic hatching technique for amortizing communication and synchronization costs yield maximum speedups ranging from 3.08x to 5.55x across different batch sizes, algorithms and input graphs. After employing optimizations that improve scalability of expensive phases and perform reuse across the distributed computation, the improved maximum speedups range from 7.35x to 11.86x. MultiLyra also delivers superior scalahilty than the Quegel batch processing system.
引用
收藏
页码:349 / 358
页数:10
相关论文
共 50 条
  • [1] BEAD: Batched Evaluation of Iterative Graph Queries with Evolving Analytics Demands
    Mazloumi, Abbas
    Xu, Chengshuo
    Zhao, Zhijia
    Gupta, Rajiv
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 461 - 468
  • [2] SimGQ: Simultaneously Evaluating Iterative Graph Queries
    Xu, Chengshuo
    Mazloumi, Abbas
    Jiang, Xiaolin
    Gupta, Rajiv
    [J]. 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), 2020, : 1 - 10
  • [3] Scalable Iterative Graph Duplicate Detection
    Herschel, Melanie
    Naumann, Felix
    Szott, Sascha
    Taubert, Maik
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (11) : 2094 - 2108
  • [4] A Scalable Distributed Graph Partitioner
    Margo, Daniel
    Seltzer, Margo
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 1478 - 1489
  • [5] Graph partitioning for scalable distributed graph computations
    Buluc, Aydin
    Madduri, Kamesh
    [J]. GRAPH PARTITIONING AND GRAPH CLUSTERING, 2013, 588 : 83 - +
  • [6] Scalable Processing of Flexible Graph Pattern Queries on the Cloud
    Ravindra, Padmashree
    Anyanwu, Kemafor
    [J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 169 - 170
  • [7] Distributed Graph Path Queries using Spark
    Balaji, Janani
    Sunderraman, Rajshekhar
    [J]. PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC), VOL 2, 2016, : 326 - 331
  • [8] Graph Topology Abstraction for Distributed Path Queries
    Balaji, Janani
    Sunderraman, Rajshekhar
    [J]. PROCEEDINGS OF THE ACM WORKSHOP ON HIGH PERFORMANCE GRAPH PROCESSING (HPGP'16), 2016, : 27 - 34
  • [9] Critical Parameters for Scalable Distributed Learning with Large Batches and Asynchronous Updates
    Stich, Sebastian U.
    Mohtashami, Amirkeivan
    Jaggi, Martin
    [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [10] Sparklify: A Scalable Software Component for Efficient Evaluation of SPARQL Queries over Distributed RDF Datasets
    Stadler, Claus
    Sejdiu, Gezim
    Graux, Damien
    Lehmann, Jens
    [J]. SEMANTIC WEB - ISWC 2019, PT II, 2019, 11779 : 293 - 308