BRGP: a balanced RDF graph partitioning algorithm for cloud storage

被引:11
|
作者
Leng, Yonglin [1 ]
Chen Zhikui [1 ]
Zhong, Fangming [1 ]
Li, Xiongjiu [1 ]
Hu, Yueming [2 ]
Yang, Chao [3 ]
机构
[1] Dalian Univ Technol, Sch Software Technol, Dalian 116620, Peoples R China
[2] South China Agr Univ, Coll Nat Resources & Environm, Guangzhou 510642, Guangdong, Peoples R China
[3] Huabei Oilfield Co, Petr Prod Engn Res Inst, Renqiu 062550, Peoples R China
来源
关键词
cloud storage; RDF; balanced clustering partition; multi-level label propagation; SPARQL;
D O I
10.1002/cpe.3896
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The continuous growth of resource description framework (RDF) data poses an important challenge on RDF data partitioning that is a vital technique for effective cloud storage. Recently, many partitioning algorithms for large RDF data have been developed, and most of them are based on graph partitioning. However, existing graph partitioning methods could not partition asymmetric RDF data effectively, resulting in a lower performance for cloud storage. This paper proposes a balanced RDF graph partitioning algorithm for storing massive RDF data on cloud. We first devise a modularity-based multi-level label propagation algorithm (MMLP) to partition RDF graph roughly and then use a balanced K-mediods clustering algorithm for final k-way partitioning. Balanced RDF graph partitioning algorithm designs an effective label update rule and a balanced modification strategy to achieve a high quality coarsening result and make the partition as equilibrium as possible. Experiments are carried on two representative RDF benchmarks and one real RDF dataset by comparison with two representative graph partitioning methods, that is, METIS and MLP+METIS. Results demonstrate that our proposed scheme can produce a high-quality partition for massive RDF data storage on cloud. Copyright (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] An exact algorithm for graph partitioning
    William W. Hager
    Dzung T. Phan
    Hongchao Zhang
    Mathematical Programming, 2013, 137 : 531 - 556
  • [42] Balanced Graph Partitioning: Optimizing Graph Cut Based on Label Swapping
    Zhang, Huajian
    PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC, SOCIO-CULTURAL COMPUTING (BESC), 2015, : 184 - 187
  • [43] Storage, partitioning, indexing and retrieval in Big RDF frameworks: A survey
    Chawla, Tanvi
    Singh, Girdhari
    Pilli, Emmanuel S.
    Govil, M. C.
    COMPUTER SCIENCE REVIEW, 2020, 38
  • [44] HyPSo: Hybrid Partitioning for Big RDF Storage and Query Processing
    Chawla, Tanvi
    Singh, Girdhari
    Pilli, Emmanuel S.
    PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, : 188 - 194
  • [45] Dynamic Partitioning Supporting Load Balancing for Distributed RDF Graph Stores
    Bok, Kyoungsoo
    Kim, Junwon
    Yoo, Jaesoo
    SYMMETRY-BASEL, 2019, 11 (07):
  • [46] Optimizing RDF storage removing redundancies: An algorithm
    Iannone, L
    Palmisano, I
    Redavid, D
    INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2005, 3533 : 732 - 742
  • [47] α-Partitioning algorithm:: Vertical partitioning based on the fuzzy graph
    Son, JH
    Kim, MH
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, 2001, 2113 : 537 - 546
  • [48] A fast algorithm for balanced graph clustering
    Huang, Mao Lin
    Nguyen, Quang Vinh
    11TH INTERNATIONAL CONFERENCE INFORMATION VISUALIZATION, 2007, : 46 - +
  • [49] GSBRL : Efficient RDF graph storage based on reinforcement learning
    Zheng, Lei
    Shen, Ziming
    Wang, Hongzhi
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (02): : 763 - 784
  • [50] GSBRL : Efficient RDF graph storage based on reinforcement learning
    Lei Zheng
    Ziming Shen
    Hongzhi Wang
    World Wide Web, 2022, 25 : 763 - 784