Distributed RDF Archives Querying with Spark

被引:0
|
作者
Bahri, Afef [1 ]
Laajimi, Meriem [2 ]
Ayadi, Nadia Yacoubi [3 ]
机构
[1] Univ Sfax, MIRACL Lab, Sfax, Tunisia
[2] High Inst Management Tunis, Tunis, Tunisia
[3] Univ Manouba, ENSI, RIADI Res Lab, Manouba 2010, Tunisia
来源
关键词
RDF archives; Distributed systems; Versioning queries; SPARQL; SPARK; SPARK SQL;
D O I
10.1007/978-3-319-98192-5_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prevalence of open data and the expansion of published information on the web have engendered a large scale of available RDF data. When dealing with the evolution of the published datasets, users may need to access to not only the actual version of a dataset but equally the previous ones and would like to track the evolution of data over time. To this direction, single-machine RDF archiving systems and Benchmarks have been proposed but do not scale well to query large RDF archives. Distributed data management systems present a promising direction for providing scalability and parallel processing of large volume of RDF data. In this paper, we study and compare commonly used RDF archiving techniques and querying strategies with the distributed computing platform Spark. We propose a formal mapping of versioning queries defined with SPARQL into SQL SPARK. We make a series of experimentation of these queries to study the effects of RDF archives partitioning and distribution.
引用
收藏
页码:451 / 465
页数:15
相关论文
共 50 条
  • [21] Presto-RDF: SPARQL Querying over Big RDF Data
    Mammo, Mulugeta
    Bansal, Srividya K.
    DATABASES THEORY AND APPLICATIONS, 2015, 9093 : 281 - 293
  • [22] RDF_QDAG in Action: Efficient RDF Data Querying at Scale
    Saidi, Boumediene
    Yousfi, Houssameddine
    Mesmoudi, Amin
    Benkabou, Seif-Eddine
    Hadjali, Allel
    Matallah, Houcine
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 : 633 - 640
  • [23] Applied Temporal RDF: Efficient Temporal Querying of RDF Data with SPARQL
    Tappolet, Jonas
    Bernstein, Abraham
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, 2009, 5554 : 308 - 322
  • [24] Scalable SPARQL Querying of Large RDF Graphs
    Huang, Jiewen
    Abadi, Daniel J.
    Ren, Kun
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (11): : 1123 - 1134
  • [25] Querying by example astronomical archives
    Pasian, F
    Smareglia, R
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS VII (ADASS), 1998, 145 : 429 - 432
  • [26] Modeling and Querying Versioned Source Code in RDF
    Bellamy-McIntyre, Jacob
    SEMANTIC WEB: ESWC 2018 SATELLITE EVENTS, 2018, 11155 : 251 - 261
  • [27] Querying RDF Databases with Sub-CONSTRUCTs
    Duval, Dominique
    Echahed, Rachid
    Prost, Frederic
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2021, (342): : 49 - 64
  • [28] Temporal Data Representation and Querying Based on RDF
    Zhang, Fu
    Wang, Ke
    Li, Zhiyin
    Cheng, Jingwei
    IEEE ACCESS, 2019, 7 : 85000 - 85023
  • [29] SQL to SPARQL Conversion for Direct RDF Querying
    Abatal, Ahmed
    Alaoui, Khadija
    Bahaj, Mohamed
    Alaoui, Larbi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (11) : 599 - 604
  • [30] A general Framework for querying Possibilistic RDF Data
    Abidi, Amna
    Bach Tobji, Mohamed Anis
    Hadjali, Allel
    Ben Yaghlane, Boutheina
    2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 158 - 162