Distributed RDF Archives Querying with Spark

被引:0
|
作者
Bahri, Afef [1 ]
Laajimi, Meriem [2 ]
Ayadi, Nadia Yacoubi [3 ]
机构
[1] Univ Sfax, MIRACL Lab, Sfax, Tunisia
[2] High Inst Management Tunis, Tunis, Tunisia
[3] Univ Manouba, ENSI, RIADI Res Lab, Manouba 2010, Tunisia
来源
关键词
RDF archives; Distributed systems; Versioning queries; SPARQL; SPARK; SPARK SQL;
D O I
10.1007/978-3-319-98192-5_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prevalence of open data and the expansion of published information on the web have engendered a large scale of available RDF data. When dealing with the evolution of the published datasets, users may need to access to not only the actual version of a dataset but equally the previous ones and would like to track the evolution of data over time. To this direction, single-machine RDF archiving systems and Benchmarks have been proposed but do not scale well to query large RDF archives. Distributed data management systems present a promising direction for providing scalability and parallel processing of large volume of RDF data. In this paper, we study and compare commonly used RDF archiving techniques and querying strategies with the distributed computing platform Spark. We propose a formal mapping of versioning queries defined with SPARQL into SQL SPARK. We make a series of experimentation of these queries to study the effects of RDF archives partitioning and distribution.
引用
收藏
页码:451 / 465
页数:15
相关论文
共 50 条
  • [41] Querying RDF Graphs Over Partitioned Indexes
    Gai, Lei
    Liu, Junmin
    Wang, Xiaoming
    Li, Jian
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 2262 - 2267
  • [42] Querying and Reasoning with RDF(S)/OWL in XQuery
    Almendros-Jimenez, Jesus M.
    WEB TECHNOLOGIES AND APPLICATIONS, 2011, 6612 : 450 - 459
  • [43] An Experimental Evaluation of Relational RDF Storage and Querying Techniques
    MahmoudiNasab, Hooran
    Sakr, Sherif
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2010, 6193 : 215 - +
  • [44] Querying RDF and OWL Data Source using SPARQL
    Kumar, Naveen
    Kumar, Suresh
    2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [45] Approximate querying of RDF graphs via path alignment
    De Virgilio, Roberto
    Maccioni, Antonio
    Torlone, Riccardo
    DISTRIBUTED AND PARALLEL DATABASES, 2015, 33 (04) : 555 - 581
  • [46] Self-Indexing RDF Archives
    Cerdeira-Pena, Ana
    Farina, Antonio
    Fernandez, Javier D.
    Martinez-Prieto, Miguel A.
    2016 DATA COMPRESSION CONFERENCE (DCC), 2016, : 526 - 535
  • [47] An RDF Design Pattern for the Structural Representation and Querying of Expressions
    Ferre, Sebastien
    KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, EKAW 2016, 2016, 10024 : 196 - 211
  • [48] A Fuzzy Extension of SPARQL for Querying Gradual RDF Data
    Pivert, Olivier
    Slama, Olfa
    Smits, Gregory
    Thion, Virginie
    2016 IEEE TENTH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2016, : 707 - 708
  • [49] Optimized index structures for querying RDF from the Web
    Harth, A
    Decker, S
    Third Latin American Web Congress, Proceedings, 2005, : 71 - 80
  • [50] Approximate querying of RDF graphs via path alignment
    Roberto De Virgilio
    Antonio Maccioni
    Riccardo Torlone
    Distributed and Parallel Databases, 2015, 33 : 555 - 581