PPR-partitioning: a distributed graph partitioning algorithm based on the personalized PageRank vectors in vertex-centric systems

被引:0
|
作者
Nasrin Mazaheri Soudani
Afsaneh Fatemi
Mohammadali Nematbakhsh
机构
[1] University of Isfahan,Department of Computer Engineering, Faculty of Software Engineering
来源
关键词
Graph partitioning; Big graphs; Personalized PageRank; Vertex-centric systems;
D O I
暂无
中图分类号
学科分类号
摘要
Relations among data items can be modeled with graphs in most of big data sets such as social networks’ data. This modeling creates big graphs with many vertices and edges. Balanced k-way graph partitioning is a common problem with big graphs. It has many applications in several fields. There are many approximate solutions for this problem; however, most of them do not have enough scalability for big graph partitioning and cannot be executed in a distributed manner. Vertex-centric model has been introduced recently as a scalable distributed processing method for big graphs. There are a few methods for graph partitioning based on this model. Existing approaches only consider one-step neighbors of vertices for graph partitioning and do not consider neighbors with higher steps. In this paper, a distributed method is introduced based on vertex-centric model for balanced k-way graph partitioning. This method applies the personalized PageRank vectors of vertices and partitions to decide how vertices are joined partitions. This method has been implemented in the Giraph system. The proposed method has been evaluated with several synthetic and real graphs. Experimental results have shown that this method has scalability for partitioning big graphs. It was also found that this method produces partitions with higher quality compared to the state-of-the-art stream-based methods and distributed methods based on vertex-centric programming model. Its result is close to the results of Metis method.
引用
收藏
页码:847 / 871
页数:24
相关论文
共 18 条
  • [1] PPR-partitioning: a distributed graph partitioning algorithm based on the personalized PageRank vectors in vertex-centric systems
    Soudani, Nasrin Mazaheri
    Fatemi, Afsaneh
    Nematbakhsh, Mohammadali
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (02) : 847 - 871
  • [2] An investigation of big graph partitioning methods for distribution of graphs in vertex-centric systems
    Mazaheri Soudani, Nasrin
    Fatemi, Afsaneh
    Nematbakhsh, Mohammadali
    DISTRIBUTED AND PARALLEL DATABASES, 2020, 38 (01) : 1 - 29
  • [3] An investigation of big graph partitioning methods for distribution of graphs in vertex-centric systems
    Nasrin Mazaheri Soudani
    Afsaneh Fatemi
    Mohammadali Nematbakhsh
    Distributed and Parallel Databases, 2020, 38 : 1 - 29
  • [4] Revolver: Vertex-centric Graph Partitioning Using Reinforcement Learning
    Mofrad, Mohammad Hasanzadeh
    Melhem, Rami
    Hammoud, Mohammad
    PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 818 - 821
  • [5] Fast Failure Recovery in Vertex-Centric Distributed Graph Processing Systems
    Lu, Wei
    Shen, Yanyan
    Wang, Tongtong
    Zhang, Meihui
    Jagadish, H. V.
    Du, Xiaoyong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (04) : 733 - 746
  • [6] iPartition: a distributed partitioning algorithm for block-centric graph processing systems
    Masoud Sagharichian
    Morteza Alipour Langouri
    The Journal of Supercomputing, 2023, 79 : 21116 - 21143
  • [7] iPartition: a distributed partitioning algorithm for block-centric graph processing systems
    Sagharichian, Masoud
    Langouri, Morteza Alipour
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (18): : 21116 - 21143
  • [8] VC-Tune: Tuning and Exploring Distributed Vertex-Centric Graph Systems
    Zhu, Zichen
    Luo, Siqiang
    Xiao, Xiaokui
    Yang, Yin
    Mo, Dingheng
    Han, Yufei
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 3142 - 3145
  • [9] ScaleG: A Distributed Disk-Based System for Vertex-Centric Graph Processing
    Wang, Xubo
    Wen, Dong
    Qin, Lu
    Chang, Lijun
    Zhang, Ying
    Zhang, Wenjie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 2019 - 2033
  • [10] AKIN : A Streaming Graph Partitioning Algorithm for Distributed Graph Storage Systems
    Zhang, Wei
    Chen, Yong
    Dai, Dong
    2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 183 - 192