Processing SPARQL queries over distributed RDF graphs

被引:59
|
作者
Peng, Peng [1 ]
Zou, Lei [1 ]
Ozsu, M. Tamer [2 ]
Chen, Lei [3 ]
Zhao, Dongyan [1 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
[2] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
[3] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
来源
VLDB JOURNAL | 2016年 / 25卷 / 02期
关键词
RDF; SPARQL; RDF graph; Distributed queries; EFFICIENT; COMPLEXITY; ENGINE;
D O I
10.1007/s00778-015-0415-0
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose techniques for processing SPARQL queries over a large RDF graph in a distributed environment. We adopt a "partial evaluation and assembly" framework. Answering a SPARQL query Q is equivalent to finding subgraph matches of the query graph Q over RDF graph G. Based on properties of subgraph matching over a distributed graph, we introduce local partial match as partial answers in each fragment of RDF graph G. For assembly, we propose two methods: centralized and distributed assembly. We analyze our algorithms from both theoretically and experimentally. Extensive experiments over both real and benchmark RDF repositories of billions of triples confirm that our method is superior to the state-of-the-art methods in both the system's performance and scalability.
引用
收藏
页码:243 / 268
页数:26
相关论文
共 50 条
  • [31] Clustering Remote RDF Data Using SPARQL Update Queries
    Qi, Letao
    Lin, Harris T.
    Honavar, Vasant
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2013, : 236 - 242
  • [32] Optimizing Aggregate SPARQL Queries Using Materialized RDF Views
    Ibragimov, Dilshod
    Hose, Katja
    Pedersen, Torben Bach
    Zimanyi, Esteban
    [J]. SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 341 - 359
  • [33] Querying distributed RDF data sources with SPARQL
    Quilitz, Bastian
    Leser, Ulf
    [J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2008, 5021 : 524 - 538
  • [34] Scaling Queries over Big RDF Graphs with Semantic Hash Partitioning
    Lee, Kisung
    Liu, Ling
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14): : 1894 - 1905
  • [35] Holistic and Compact Selectivity Estimation for Hybrid Queries over RDF Graphs
    Wagner, Andreas
    Bicer, Veli
    Tran, Thanh
    Studer, Rudi
    [J]. SEMANTIC WEB - ISWC 2014, PT II, 2014, 8797 : 97 - 113
  • [36] Distributed Query Evaluation over Large RDF Graphs
    Peng, Peng
    [J]. WEB AND BIG DATA, APWEB-WAIM 2019, 2019, 11809 : 3 - 7
  • [37] Efficient Distributed Regular Path Queries on RDF Graphs Using Partial Evaluation
    Wang, Xin
    Wang, Junhu
    Zhang, Xiaowang
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1933 - 1936
  • [38] RDF partitioning for scalable SPARQL query processing
    Xiaoyan WANG
    Tao YANG
    Jinchuan CHEN
    Long HE
    Xiaoyong DU
    [J]. Frontiers of Computer Science, 2015, 9 (06) : 919 - 933
  • [39] RDF partitioning for scalable SPARQL query processing
    Wang, Xiaoyan
    Yang, Tao
    Chen, Jinchuan
    He, Long
    Du, Xiaoyong
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2015, 9 (06) : 919 - 933
  • [40] RDF partitioning for scalable SPARQL query processing
    Xiaoyan Wang
    Tao Yang
    Jinchuan Chen
    Long He
    Xiaoyong Du
    [J]. Frontiers of Computer Science, 2015, 9 : 919 - 933