Towards Large-Scale Graph Stream Processing Platform

被引:19
|
作者
Suzumura, Toyotaro [1 ]
Nishii, Shunsuke [2 ]
Ganse, Masaru [2 ]
机构
[1] JST CREST, IBM Res, Tokyo, Japan
[2] Tokyo Inst Technol, Tokyo, Japan
关键词
DSMS; Data Stream Management System; Page Rank; Random Walk with Restart; Distributed computing; Graph algorithms;
D O I
10.1145/2567948.2580051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, real-time data mining for large-scale time evolving graphs is becoming a hot research topic. Most of the prior arts target relatively static graphs and also process them in store-and-process batch processing model. In this paper we propose a method of applying on-the-fly and incremental graph stream computing model to such dynamic graph analysis. To process large-scale graph streams on a cluster of nodes dynamically in a scalable fashion, we propose an incremental large-scale graph processing model called "Incremental GIM-V (Generalized Iterative Matrix-Vector Multiplication)". We also design and implement UNICORN, a system that adopts the proposed incremental processing model on top of IBM InfoSphere Streams. Our performance evaluation demonstrates that our method achieves up to 48% speedup on PageRank with Scale 16 Log-normal Graph (vertexes=65,536, edges=8,364,525) with 4 nodes, 3023% speedup on Random walk with Restart with Kronecker Graph with Scale 18 (vertexes=262,144, edges=8,388,608) with 4 nodes against original GIM-V.
引用
收藏
页码:1321 / 1326
页数:6
相关论文
共 50 条
  • [1] Towards GPU-Accelerated Large-Scale Graph Processing in the Cloud
    Zhong, Jianlong
    He, Bingsheng
    [J]. 2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), VOL 1, 2013, : 9 - 16
  • [2] Towards a framework for large-scale multimedia data storage and processing on Hadoop platform
    Wei Kuang Lai
    Yi-Uan Chen
    Tin-Yu Wu
    Mohammad S. Obaidat
    [J]. The Journal of Supercomputing, 2014, 68 : 488 - 507
  • [3] Towards a framework for large-scale multimedia data storage and processing on Hadoop platform
    Lai, Wei Kuang
    Chen, Yi-Uan
    Wu, Tin-Yu
    Obaidat, Mohammad S.
    [J]. JOURNAL OF SUPERCOMPUTING, 2014, 68 (01): : 488 - 507
  • [4] Large-scale graph processing systems: a survey
    Liu, Ning
    Li, Dong-sheng
    Zhang, Yi-ming
    Li, Xiong-lve
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (03) : 384 - 404
  • [5] Distributed large-scale graph processing on FPGAs
    Sahebi, Amin
    Barbone, Marco
    Procaccini, Marco
    Luk, Wayne
    Gaydadjiev, Georgi
    Giorgi, Roberto
    [J]. JOURNAL OF BIG DATA, 2023, 10 (01)
  • [6] Large-scale graph processing systems: a survey
    Ning Liu
    Dong-sheng Li
    Yi-ming Zhang
    Xiong-lve Li
    [J]. Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 384 - 404
  • [7] Distributed large-scale graph processing on FPGAs
    Amin Sahebi
    Marco Barbone
    Marco Procaccini
    Wayne Luk
    Georgi Gaydadjiev
    Roberto Giorgi
    [J]. Journal of Big Data, 10
  • [8] Optimizing data stream processing for large-scale applications
    Cappellari, Paolo
    Roantree, Mark
    Chun, Soon Ae
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2018, 48 (09): : 1607 - 1641
  • [9] Large-Scale Graph Processing on Emerging Storage Devices
    Elyasi, Nima
    Choi, Changho
    Sivasubramaniam, Anand
    [J]. PROCEEDINGS OF THE 17TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2019, : 309 - 316
  • [10] A Novel Clustering Algorithm for Large-Scale Graph Processing
    Qu, Zhaoyang
    Ding, Wei
    Qu, Nan
    Yan, Jia
    Wang, Ling
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 349 - 358