Distributed subgraph query for RDF graph data based on MapReduce

被引:1
|
作者
Su, Qianxiang [1 ]
Huang, Qingrong [1 ]
Wu, Nan [1 ]
Pan, Ying [1 ]
机构
[1] Nanning Normal Univ, Guangxi Key Lab Human Machine Interact & Intellige, Nanning 530001, Peoples R China
基金
中国国家自然科学基金;
关键词
RDF query; RDF subgraph; Distributed environment; MapReduce; Star matching;
D O I
10.1016/j.compeleceng.2022.108221
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, Resource Description Framework (RDF) query has been widely used in social net-works, biomedicine and other fields. With the explosion of RDF data due to the Internet of Things and Semantic Web, people's demand for intelligent computing and intelligent search is increasing, effectively querying RDF has become a major challenge. The current query methods often introduce a large number of join operations, and repeatedly traverse in some subgraphs during the query process, which makes the query efficiency and query performance poor. To address the above problems, this paper proposes a subgraph query algorithm for RDF graph data in a distributed environment. The graph structure is used to decompose the stars of the RDF graph, and the optimal query sequence of the stars is calculated. Fewer intermediate results can be produced based on the query sequence to reduce repeated calculations. Besides, adjacency lists are used to store RDF graphs, which are distributed across multiple tables. Multiple table oper-ations can reduce the scope of subject node traversal, and further improve the query efficiency of RDF subgraph by matching one star per iteration. Experimental results show that our work can improve the query efficiency of RDF subgraphs.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] KREAG: Keyword query approach over RDF data based on entity-triple association graph
    Li H.-Y.
    Qu Y.-Z.
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2011, 34 (05): : 825 - 835
  • [32] Graph-Based RDF Data Management
    Zou L.
    Özsu M.T.
    [J]. Data Science and Engineering, 2017, 2 (1) : 56 - 70
  • [33] Graph Pattern Based RDF Data Compression
    Pan, Jeff Z.
    Gomez Perez, Jose Manuel
    Ren, Yuan
    Wu, Honghan
    Wang, Haofen
    Zhu, Man
    [J]. SEMANTIC TECHNOLOGY (JIST 2014), 2015, 8943 : 239 - 256
  • [34] Dominance-Partitioned Subgraph Matching on Large RDF Graph
    Ning, Bo
    Sun, Yunhao
    Zhao, Deji
    Xing, Weikang
    Li, Guanyu
    [J]. COMPLEXITY, 2020, 2020
  • [35] Distributed Frequent Subgraph Mining Using Gaston and MapReduce
    Rao, Jagannadha D. B.
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2021, 17 (02) : 41 - 58
  • [36] Distributed SPARQL over Big RDF Data A Comparative Analysis using Presto and MapReduce
    Mammo, Mulugeta
    Bansal, Srividya K.
    [J]. 2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 33 - 40
  • [37] RDF Chain Query Optimization in a Distributed Environment
    Hogenboom, Alexander
    Niewenhuijse, Ewout
    Jansen, Milan
    Frasincar, Flavius
    Vandic, Damir
    [J]. 30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 353 - 359
  • [38] Impact analysis of data placement strategies on query efforts in distributed RDF stores
    Janke, Daniel
    Staab, Steffen
    Thimm, Matthias
    [J]. JOURNAL OF WEB SEMANTICS, 2018, 50 : 21 - 48
  • [39] An Intermediate Algebra for Optimizing RDF Graph Pattern Matching on MapReduce
    Ravindra, Padmashree
    Kim, HyeongSik
    Anyanwu, Kemafor
    [J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, PT II, 2011, 6644 : 46 - 61
  • [40] PDSM: Pregel-Based Distributed Subgraph Matching on Large Scale RDF Graphs
    Xu, Qiang
    Wang, Xin
    Xin, Yueqi
    Feng, Zhiyong
    Chen, Renhai
    [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 17 - 18