DPM: A novel distributed large-scale social graph processing framework for link prediction algorithms

被引:13
|
作者
Corbellini, Alejandro [1 ]
Godoy, Daniela [1 ]
Mateos, Cristian [1 ]
Schiaffino, Silvia [1 ]
Zunino, Alejandro [1 ]
机构
[1] UNICEN, ISISTAN CONICET, Paraje Arroyo Seco Campus Univ, RA-7000 Buenos Aires, DF, Argentina
关键词
Distributed graph processing; Recommendation algorithms; Online Social Networks;
D O I
10.1016/j.future.2017.02.025
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large-scale graphs have become ubiquitous in social media. Computer-based recommendations in these huge graphs pose challenges in terms of algorithm design and resource usage efficiency when processing recommendations in distributed computing environments. Moreover, recommendation algorithms for graphs, particularly link prediction algorithms, have different requirements depending of the way the underlying graph is traversed. Path-based algorithms usually perform traversals in different directions to build a large ranking of vertices to recommend, whereas random walk-based algorithms build an initial subgraph and perform several iterations on those vertices to compute the final ranking. In this work, we propose a distributed graph processing framework called Distributed Partitioned Merge (DPM), which supports both types of algorithms and we compare its performance and resource usage w.r.t. two relevant frameworks, namely Fork-Join and Pregel. In our experiments, we show that in most tests DPM outperforms both Pregel and Fork-Join in terms of recommendation time, with a minor penalization in network usage in some scenarios. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:474 / 480
页数:7
相关论文
共 50 条
  • [1] Distributed large-scale graph processing on FPGAs
    Sahebi, Amin
    Barbone, Marco
    Procaccini, Marco
    Luk, Wayne
    Gaydadjiev, Georgi
    Giorgi, Roberto
    [J]. JOURNAL OF BIG DATA, 2023, 10 (01)
  • [2] Distributed large-scale graph processing on FPGAs
    Amin Sahebi
    Marco Barbone
    Marco Procaccini
    Wayne Luk
    Georgi Gaydadjiev
    Roberto Giorgi
    [J]. Journal of Big Data, 10
  • [3] Marbor: A Novel Large-Scale Graph Data Storage and Processing Framework
    Zhou, Wei
    Gao, Yun
    Han, Jizhong
    Xu, Zhiyong
    [J]. 2014 IEEE INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2014,
  • [4] A Novel Clustering Algorithm for Large-Scale Graph Processing
    Qu, Zhaoyang
    Ding, Wei
    Qu, Nan
    Yan, Jia
    Wang, Ling
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 349 - 358
  • [5] OpenBioLink: a benchmarking framework for large-scale biomedical link prediction
    Breit, Anna
    Ott, Simon
    Agibetov, Asan
    Samwald, Matthias
    [J]. BIOINFORMATICS, 2020, 36 (13) : 4097 - 4098
  • [6] Novel link prediction for large-scale miRNA-lncRNA interaction network in a bipartite graph
    Zhi-An Huang
    Yu-An Huang
    Zhu-Hong You
    Zexuan Zhu
    Yiwen Sun
    [J]. BMC Medical Genomics, 11
  • [7] Novel link prediction for large-scale miRNA-lncRNA interaction network in a bipartite graph
    Huang, Zhi-An
    Huang, Yu-An
    You, Zhu-Hong
    Zhu, Zexuan
    Sun, Yiwen
    [J]. BMC MEDICAL GENOMICS, 2018, 11
  • [8] An Analysis of Distributed Programming Models and Frameworks for Large-scale Graph Processing
    Corbellini, Alejandro
    Godoy, Daniela
    Mateos, Cristian
    Schiaffino, Silvia
    Zunino, Alejandro
    [J]. IETE JOURNAL OF RESEARCH, 2022, 68 (04) : 3065 - 3073
  • [9] Distributed frameworks and parallel algorithms for processing large-scale geographic data
    Hawick, KA
    Coddington, PD
    James, HA
    [J]. PARALLEL COMPUTING, 2003, 29 (10) : 1297 - 1333
  • [10] Importance of Runtime Considerations in Performance Engineering of Large-Scale Distributed Graph Algorithms
    Firoz, Jesun Sahariar
    Kanewala, Thejaka Amila
    Zalewski, Marcin
    Barnas, Martina
    Lumsdaine, Andrew
    [J]. EURO-PAR 2015: PARALLEL PROCESSING WORKSHOPS, 2015, 9523 : 553 - 564