An Efficient Data Distribution Strategy for Distributed Graph Processing System

被引:0
|
作者
Mukherjee, Aradhita [1 ]
Chaki, Rituparna [2 ]
Chaki, Nabendu [1 ]
机构
[1] Univ Calcutta, Dept Comp Sci & Engn, JD II,Sect III, Kolkata 700106, India
[2] Univ Calcutta, AK Choudhury Sch Informat Technol, JD II,Sect III, Kolkata 700106, India
关键词
Distributed graph processing; Data placement; Genetic algorithm; GENETIC ALGORITHM;
D O I
10.1007/978-3-031-10539-5_26
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data applications like social networks, biological networks, etc. are often realized on graphs. Graph processing, if done on a single node, increases time complexity. Partitioning of graphs has been proved to be useful towards handle this well-known issue. There are several partitioning algorithms that are used to partition a graph. Each partition is assigned to a node within a cluster. However, the storage capacity of a node is limited. Therefore, an effective data distribution mechanism is required. This work aims to propose a novel strategy that would define an efficient distribution of graphs into nodes using genetic algorithms. The proposed data distribution strategy, when applied on two benchmark data set, shows improved data availability without increasing the number of replicas. It has also observed that the execution time will almost became half after applying the proposed method.
引用
收藏
页码:360 / 373
页数:14
相关论文
共 50 条
  • [1] Kylin: An Efficient and Scalable Graph Data Processing System
    Ho, Li-Yung
    Li, Tsung-Han
    Wu, Jan-Jan
    Liu, Pangfeng
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [2] Towards Efficient Graph Processing in Geo-Distributed Data Centers
    Yao, Feng
    Tao, Qian
    Lin, Shengyuan
    Zhang, Yanfeng
    Yu, Wenyuan
    Gong, Shufeng
    Wang, Qiange
    Yu, Ge
    Zhou, Jingren
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (11) : 2147 - 2160
  • [3] Efficient Distributed Query Processing on Large Scale RDF Graph Data
    Wang X.
    Xu Q.
    Chai L.-L.
    Yang Y.-J.
    Chai Y.-P.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 498 - 514
  • [4] Data Replication for Distributed Graph Processing
    Ho, Li-Yung
    Wu, Jan-Jan
    Liu, Pangfeng
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 319 - 326
  • [5] On Achieving Efficient Data Transfer for Graph Processing in Geo-Distributed Datacenters
    Zhou, Amelie Chi
    Ibrahim, Shadi
    He, Bingsheng
    [J]. 2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 1397 - 1407
  • [6] An Efficient Graph Processing System
    Zhou, Xianke
    Chang, Pengfei
    Chen, Gang
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 401 - 412
  • [7] RGraph: Effective Distributed Graph Data Processing System Based on RDMA
    Cui P.-J.
    Yuan Y.
    Li C.-H.
    Zhang C.
    Wang G.-R.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 1018 - 1042
  • [8] An efficient memory data organization strategy for application-characteristic graph processing
    Peng FANG
    Fang WANG
    Zhan SHI
    Dan FENG
    Qianxu YI
    Xianghao XU
    Yongxuan ZHANG
    [J]. Frontiers of Computer Science, 2022, 16 (01) : 182 - 184
  • [9] An efficient memory data organization strategy for application-characteristic graph processing
    Fang, Peng
    Wang, Fang
    Shi, Zhan
    Feng, Dan
    Yi, Qianxu
    Xu, Xianghao
    Zhang, Yongxuan
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (01)
  • [10] An efficient memory data organization strategy for application-characteristic graph processing
    Peng Fang
    Fang Wang
    Zhan Shi
    Dan Feng
    Qianxu Yi
    Xianghao Xu
    Yongxuan Zhang
    [J]. Frontiers of Computer Science, 2022, 16