Distributed RDF Archives Querying with Spark

被引:0
|
作者
Bahri, Afef [1 ]
Laajimi, Meriem [2 ]
Ayadi, Nadia Yacoubi [3 ]
机构
[1] Univ Sfax, MIRACL Lab, Sfax, Tunisia
[2] High Inst Management Tunis, Tunis, Tunisia
[3] Univ Manouba, ENSI, RIADI Res Lab, Manouba 2010, Tunisia
来源
关键词
RDF archives; Distributed systems; Versioning queries; SPARQL; SPARK; SPARK SQL;
D O I
10.1007/978-3-319-98192-5_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prevalence of open data and the expansion of published information on the web have engendered a large scale of available RDF data. When dealing with the evolution of the published datasets, users may need to access to not only the actual version of a dataset but equally the previous ones and would like to track the evolution of data over time. To this direction, single-machine RDF archiving systems and Benchmarks have been proposed but do not scale well to query large RDF archives. Distributed data management systems present a promising direction for providing scalability and parallel processing of large volume of RDF data. In this paper, we study and compare commonly used RDF archiving techniques and querying strategies with the distributed computing platform Spark. We propose a formal mapping of versioning queries defined with SPARQL into SQL SPARK. We make a series of experimentation of these queries to study the effects of RDF archives partitioning and distribution.
引用
收藏
页码:451 / 465
页数:15
相关论文
共 50 条
  • [11] RAL: An algebra for querying RDF
    Frasincar, F
    Houben, GJ
    Vdovjak, R
    Barna, P
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2004, 7 (01): : 83 - 109
  • [12] A relaxed approach to RDF querying
    Hurtado, Carlos A.
    Poulovassilis, Alexandra
    Wood, Peter T.
    Semantic Web - ISEC 2006, Proceedings, 2006, 4273 : 314 - 328
  • [13] Taming Existence in RDF Querying
    Bry, Francois
    Furche, Tim
    Ley, Clemens
    Linse, Benedikt
    Marnette, Bruno
    WEB REASONING AND RULE SYSTEMS, PROCEEDINGS, 2008, 5341 : 236 - +
  • [14] Algebra of RDF Graphs for Querying Large-Scale Distributed Triple-Store
    Savnik, Iztok
    Nitta, Kiyoshi
    AVAILABILITY, RELIABILITY, AND SECURITY IN INFORMATION SYSTEMS, CD-ARES 2016, PAML 2016, 2016, 9817 : 3 - 18
  • [15] Sesame: A generic architecture for storing and querying RDF and RDF schema
    Broekstra, J
    Kampman, A
    van Harmelen, F
    SEMANTIC WEB - ISWC 2002, 2002, 2342 : 54 - 68
  • [16] Querying RDF Dictionaries in Compressed Space
    Martinez-Prieto, Miguel A.
    Fernandez, Javier D.
    Canovas, Rodrigo
    APPLIED COMPUTING REVIEW, 2012, 12 (02): : 64 - 77
  • [17] SPARQLByE: Querying RDF data by example
    Diaz, Gonzalo
    Arenas, Marcelo
    Benedikt, Michael
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1533 - 1536
  • [18] Representing and Querying Negative Knowledge in RDF
    Darari, Fariz
    SEMANTIC WEB: ESWC 2013 SATELLITE EVENTS, 2013, 7955 : 275 - 276
  • [19] Querying incomplete information in RDF with SPARQL
    Nikolaou, Charalampos
    Koubarakis, Manolis
    ARTIFICIAL INTELLIGENCE, 2016, 237 : 138 - 171
  • [20] Querying Trust in RDF Data with tSPARQL
    Hartig, Olaf
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, 2009, 5554 : 5 - 20