A Distributed Query Method for RDF Data on Spark

被引:0
|
作者
Guo, Minru [1 ]
Wang, Jingbin [1 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350108, Peoples R China
来源
关键词
Distributed; Spark; RDF; Index; Query;
D O I
10.1007/978-981-10-0457-5_11
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the upcoming data deluge of semantic data, the fast growth of RDF data has brought significant challenges in query. A new distributed RDF query algorithm RQCCP (RDF data Query combined with Classes Correlations with Property) on Spark platform is proposed to solve the problem of low efficiency for RDF data query. It splits and stores RDF data by the class of Subject, Predicate and the class of Object, simultaneously building index file of classes correlations with property; the index is applied to narrow the scope of input for query, filtering out irrelevant triples in advance and intermediate results of query cached in memory as resilient distributed dataset to reduce disk and network I/O. The results of experiments conducted on large-scale RDF datasets show that RQCCP has high query performance.
引用
收藏
页码:102 / 115
页数:14
相关论文
共 50 条
  • [1] Query Optimization for massive RDF data based on Spark
    Li, Shaohui
    Shen, Derong
    Kou, Yue
    Yang, Dan
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM 2018), 2018, : 219 - 224
  • [2] Distributed Join Query Processing for Big RDF Data
    Elzein, Nahla Mohammed
    Majid, Mazlina Abdul
    Fakherldin, Mohammed
    Hashem, Ibrahim Abaker Targio
    [J]. ADVANCED SCIENCE LETTERS, 2018, 24 (10) : 7758 - 7761
  • [3] Distributed RDF Query Answering with Dynamic Data Exchange
    Potter, Anthony
    Motik, Boris
    Nenov, Yavor
    Horrocks, Ian
    [J]. SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 480 - 497
  • [4] Query Optimization of Distributed RDF Data Based on MapReduce
    Zhang, Yanqin
    Wang, Jingbin
    [J]. MACHINERY ELECTRONICS AND CONTROL ENGINEERING III, 2014, 441 : 970 - 973
  • [5] Adaptive and Optimized RDF Query Interface for Distributed WFS Data
    Zhao, Tian
    Zhang, Chuanrong
    Li, Weidong
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (04)
  • [6] Distributed subgraph query for RDF graph data based on MapReduce
    Su, Qianxiang
    Huang, Qingrong
    Wu, Nan
    Pan, Ying
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [7] Distributed SPARQL query answering over RDF data streams
    Leida, Marcello
    Chu, Andrej
    [J]. 2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 369 - 378
  • [8] Query Answering On Uncertain Big RDF Data Using Apache Spark Framework
    Benbernou, Salima
    Ouziri, Mourad
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4854 - 4860
  • [9] Architecture for distributed query processing using the RDF data in cloud environment
    C. Ranichandra
    B. K. Tripathy
    [J]. Evolutionary Intelligence, 2021, 14 : 567 - 575
  • [10] Efficient Distributed Query Processing on Large Scale RDF Graph Data
    Wang, Xin
    Xu, Qiang
    Chai, Le-Le
    Yang, Ya-Jun
    Chai, Yun-Peng
    [J]. Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 498 - 514