Bipartite-Oriented Distributed Graph Partitioning for Big Learning

被引:8
|
作者
Chen, Rong [1 ]
Shi, Jia-Xin [1 ]
Chen, Hai-Bo [1 ]
Zang, Bin-Yu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Key Lab Scalable Comp & Syst, Inst Parallel & Distributed Syst, Shanghai 200240, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金; 上海市科技启明星计划;
关键词
bipartite graph; graph partitioning; graph-parallel system;
D O I
10.1007/s11390-015-1501-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many machine learning and data mining (MLDM) problems like recommendation, topic modeling, and medical diagnosis can be modeled as computing on bipartite graphs. However, most distributed graph-parallel systems are oblivious to the unique characteristics in such graphs and existing online graph partitioning algorithms usually cause excessive replication of vertices as well as significant pressure on network communication. This article identifies the challenges and opportunities of partitioning bipartite graphs for distributed MLDM processing and proposes BiGraph, a set of bipartite-oriented graph partitioning algorithms. BiGraph leverages observations such as the skewed distribution of vertices, discriminated computation load and imbalanced data sizes between the two subsets of vertices to derive a set of optimal graph partitioning algorithms that result in minimal vertex replication and network communication. BiGraph has been implemented on Power Graph and is shown to have a performance boost up to 17.75X (from 1.16X) for four typical MLDM algorithms, due to reducing up to 80% vertex replication, and up to 96% network traffic.
引用
收藏
页码:20 / 29
页数:10
相关论文
共 50 条
  • [1] Bipartite-Oriented Distributed Graph Partitioning for Big Learning
    Rong Chen
    Jia-Xin Shi
    Hai-Bo Chen
    Bin-Yu Zang
    [J]. Journal of Computer Science and Technology, 2015, 30 : 20 - 29
  • [2] BiFennel: Fast Bipartite Graph Partitioning Algorithm for Big Data
    Wang, Lyu-Wei
    Chen, Shih-Chang
    Chen, Wenguang
    Hsiao, Hung-Chang
    Chung, Yeh-Ching
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 715 - 720
  • [3] Tradeoff Options for Bipartite Graph Partitioning
    Mackenzie, Joel
    Petri, Matthias
    Moffat, Alistair
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 8644 - 8657
  • [4] Preserving patterns in bipartite graph partitioning
    Hu, Tianming
    Qu, Chao
    Tan, Chew Lim
    Sung, Sam Yuan
    Zhou, Wenjun
    [J]. ICTAI-2006: EIGHTEENTH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, : 489 - 496
  • [5] Partitioning the vertex set of a bipartite graph into complete bipartite subgraphs
    Duginov, Oleg
    [J]. DISCRETE MATHEMATICS AND THEORETICAL COMPUTER SCIENCE, 2014, 16 (03): : 203 - 214
  • [6] Oriented bipartite graphs and the Goldbach graph
    Das, Sandip
    Ghosh, Prantar
    Ghosh, Shamik
    Sen, Sagnik
    [J]. DISCRETE MATHEMATICS, 2021, 344 (09)
  • [7] Graph Partitioning for Distributed Graph Processing
    Onizuka M.
    Fujimori T.
    Shiokawa H.
    [J]. Data Science and Engineering, 2017, 2 (1) : 94 - 105
  • [8] Faster Index Reordering with Bipartite Graph Partitioning
    Mackenzie, Joel
    Petri, Matthias
    Moffat, Alistair
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1910 - 1914
  • [9] Bipartite Grid Partitioning of a Random Geometric Graph
    Chen, Zizhen
    Matula, David W.
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS), 2017, : 163 - 169
  • [10] LOCAL BIPARTITE TURAN GRAPHS AND GRAPH PARTITIONING
    LEE, J
    RYAN, J
    [J]. NETWORKS, 1994, 24 (03) : 131 - 145