Distributed processing of regular path queries in RDF graphs

被引:0
|
作者
Xintong Guo
Hong Gao
Zhaonian Zou
机构
[1] Harbin Institute of Technology,
来源
关键词
Knowledge graph; RDF/SPARQL; Regular path queries; Graph summarization; Graph partitioning;
D O I
暂无
中图分类号
学科分类号
摘要
SPARQL 1.1 offers a type of navigational query for RDF systems, called regular path query (RPQ). A regular path query allows for retrieving node pairs with the paths between them satisfying regular expressions. Regular path queries are always difficult to be evaluated efficiently because of the possible large search space. Thus there has been no scalable and practical solution so far. In this paper, we present Leon+, an in-memory distributed framework, to address the RPQ problem in the context of the knowledge graph. To reduce search space and mitigate mounting communication costs, Leon+ takes advantage of join-ahead pruning via a novel RDF summarization technique together with a path partitioning strategy. We also develop a subtle cost model to devise query plans to achieve high efficiency for complex RPQs. As there has been no available RPQ benchmark, we create micro-benchmarks on both synthetic and real-world datasets. A thorough experimental evaluation is presented between our approach and the state-of-the-art RDF stores. The results show that our approach outperforms 5x faster than the competitors on single RPQ. For query workload, it saves up to 1/2 time and 2/3 communication overheads over the baseline method.
引用
收藏
页码:993 / 1027
页数:34
相关论文
共 50 条
  • [1] Distributed processing of regular path queries in RDF graphs
    Guo, Xintong
    Gao, Hong
    Zou, Zhaonian
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (04) : 993 - 1027
  • [2] Efficient Distributed Regular Path Queries on RDF Graphs Using Partial Evaluation
    Wang, Xin
    Wang, Junhu
    Zhang, Xiaowang
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1933 - 1936
  • [3] Distributed Efficient Provenance-Aware Regular Path Queries on Large RDF Graphs
    Xin, Yueqi
    Wang, Xin
    Jin, Di
    Wang, Simiao
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2018, PT I, 2018, 10827 : 766 - 782
  • [4] Processing SPARQL queries over distributed RDF graphs
    Peng Peng
    Lei Zou
    M. Tamer Özsu
    Lei Chen
    Dongyan Zhao
    [J]. The VLDB Journal, 2016, 25 : 243 - 268
  • [5] Processing SPARQL queries over distributed RDF graphs
    Peng, Peng
    Zou, Lei
    Ozsu, M. Tamer
    Chen, Lei
    Zhao, Dongyan
    [J]. VLDB JOURNAL, 2016, 25 (02): : 243 - 268
  • [6] Processing Regular Path Queries on Arbitrarily Distributed Data
    Davoust, Alan
    Esfandiari, Babak
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2016 CONFERENCES, 2016, 10033 : 844 - 861
  • [7] ProvRPQ: An Interactive Tool for Provenance-Aware Regular Path Queries on RDF Graphs
    Wang, Xin
    Wang, Junhu
    [J]. DATABASES THEORY AND APPLICATIONS, (ADC 2016), 2016, 9877 : 480 - 484
  • [8] Distributed Pregel-based provenance-aware regular path query processing on RDF knowledge graphs
    Wang, Xin
    Wang, Simiao
    Xin, Yueqi
    Yang, Yajun
    Li, Jianxin
    Wang, Xiaofei
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (03): : 1465 - 1496
  • [9] Distributed Pregel-based provenance-aware regular path query processing on RDF knowledge graphs
    Xin Wang
    Simiao Wang
    Yueqi Xin
    Yajun Yang
    Jianxin Li
    Xiaofei Wang
    [J]. World Wide Web, 2020, 23 : 1465 - 1496
  • [10] Regular Path Queries on Massive Graphs
    Koschmieder, Andre
    Leser, Ulf
    [J]. 28TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM) 2016), 2016,