StarMR: An Efficient Star-Decomposition Based Query Processor for SPARQL Basic Graph Patterns Using MapReduce

被引:3
|
作者
Xu, Qiang [1 ]
Wang, Xin [1 ,2 ]
Li, Jianxin [2 ,3 ]
Gan, Ying [1 ]
Chai, Lele [1 ]
Wang, Junhu [4 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[2] Tianjin Key Lab Cognit Comp & Applicat, Tianjin, Peoples R China
[3] Univ Western Australia, Dept Comp Sci & Software Engn, Perth, WA, Australia
[4] Griffith Univ, Sch Informat & Commun Technol, Brisbane, Qld, Australia
来源
基金
中国国家自然科学基金;
关键词
Star decomposition; SPARQL; BGP; MapReduce; RDF graphs; RDF; COMPLEXITY; ENGINE;
D O I
10.1007/978-3-319-96890-2_34
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the proliferation of knowledge graphs, large amounts of RDF graphs have been released, which raises the need for addressing the challenge of distributed SPARQL queries. In this paper, we propose an efficient distributed method, called StarMR, to answer the SPARQL basic graph pattern (BGP) queries on big RDF graphs using MapReduce. In our method, query graphs are decomposed into a set of stars that utilize the semantic and structural information embedded RDF graphs as heuristics. Two optimization techniques are proposed to further improve the efficiency of our algorithms. One filters out invalid input data, the other postpones the Cartesian product operations. The extensive experiments on both synthetic and real-world datasets show that our StarMR method outperforms the state-of-the-art method S2X by an order of magnitude.
引用
收藏
页码:415 / 430
页数:16
相关论文
共 5 条
  • [1] GQARDF : A Graph-Based Approach Towards Efficient SPARQL Query Answering
    Wang, Xi
    Zhang, Qianzhen
    Guo, Deke
    Zhao, Xiang
    Yang, Jianye
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT II, 2020, 12113 : 551 - 568
  • [2] Efficient Graph Reachability Query Answering Using Tree Decomposition
    Wei, Fang
    [J]. REACHABILITY PROBLEMS, 2010, 6227 : 183 - 197
  • [3] An efficient and scalable SPARQL query processing framework for big data using MapReduce and hybrid optimum load balancing
    Kumar, V. Naveen
    Kumar, P. S. Ashok
    [J]. DATA & KNOWLEDGE ENGINEERING, 2023, 148
  • [4] Efficient subspace skyline query based on user preference using MapReduce
    Li, Yuanyuan
    Li, Zhiyang
    Dong, Mianxiong
    Qu, Wenyu
    Ji, Changqing
    Wu, Junfeng
    [J]. AD HOC NETWORKS, 2015, 35 : 105 - 115
  • [5] Efficient Graph-Based Resource Allocation Scheme Using Maximal Independent Set for Randomly- Deployed Small Star Networks
    Zhou, Jian
    Wang, Lusheng
    Wang, Weidong
    Zhou, Qingfeng
    [J]. SENSORS, 2017, 17 (11):