Optimizing storage of RDF archives using bidirectional delta chains

被引:3
|
作者
Taelman, Ruben [1 ]
Mahieu, Thibault [1 ]
Vanbrabant, Martin [1 ]
Verborgh, Ruben [1 ]
机构
[1] Univ Ghent, IMEC, Dept Elect & Informat Syst, IDLab, Ghent, Belgium
关键词
Linked Data; RDF archiving; Semantic Data Versioning; storage; indexing; WEB;
D O I
10.3233/SW-210449
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linked Open Datasets on the Web that are published as RDF can evolve over time. There is a need to be able to store such evolving RDF datasets, and query across their versions. Different storage strategies are available for managing such versioned datasets, each being efficient for specific types of versioned queries. In recent work, a hybrid storage strategy has been introduced that combines these different strategies to lead to more efficient query execution for all versioned query types at the cost of increased ingestion time. While this trade-off is beneficial in the context of Web querying, it suffers from exponential ingestion times in terms of the number of versions, which becomes problematic for RDF datasets with many versions. As such, there is a need for an improved storage strategy that scales better in terms of ingestion time for many versions. We have designed, implemented, and evaluated a change to the hybrid storage strategy where we make use of a bidirectional delta chain instead of the default unidirectional delta chain. In this article, we introduce a concrete architecture for this change, together with accompanying ingestion and querying algorithms. Experimental results from our implementation show that the ingestion time is significantly reduced. As an additional benefit, this change also leads to lower total storage size and even improved query execution performance in some cases. This work shows that modifying the structure of delta chains within the hybrid storage strategy can be highly beneficial for RDF archives. In future work, other modifications to this delta chain structure deserve to be investigated, to further improve the scalability of ingestion and querying of datasets with many versions.
引用
收藏
页码:705 / 734
页数:30
相关论文
共 50 条
  • [1] Evaluating query and storage strategies for RDF archives
    Fernandez, Javier D.
    Umbrich, Juergen
    Polleres, Axel
    Knuth, Magnus
    [J]. SEMANTIC WEB, 2019, 10 (02) : 247 - 291
  • [2] Optimizing RDF storage removing redundancies: An algorithm
    Iannone, L
    Palmisano, I
    Redavid, D
    [J]. INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2005, 3533 : 732 - 742
  • [3] Triple storage for random-access versioned querying of RDF archives
    Taelman, Ruben
    Sande, Vander
    Van Herwegen, Joachim
    Mannens, Erik
    Verborgh, Ruben
    [J]. JOURNAL OF WEB SEMANTICS, 2019, 54 : 4 - 28
  • [4] Exposing RDF Archives Using Triple Pattern Fragments
    Taelman, Ruben
    Verborgh, Ruben
    Mannens, Erik
    [J]. KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, 2017, 10180 : 188 - 192
  • [5] Optimizing Aggregate SPARQL Queries Using Materialized RDF Views
    Ibragimov, Dilshod
    Hose, Katja
    Pedersen, Torben Bach
    Zimanyi, Esteban
    [J]. SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 341 - 359
  • [6] Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce
    Husain, Mohammad Farhan
    Doshi, Pankil
    Khan, Latifur
    Thuraisingham, Bhavani
    [J]. CLOUD COMPUTING, PROCEEDINGS, 2009, 5931 : 680 - 686
  • [7] Enhancing Resilience of KPS Using Bidirectional Hash Chains and Application on Sensornet
    Dalai, Deepak Kumar
    Sarkar, Pinaki
    [J]. NETWORK AND SYSTEM SECURITY, 2017, 10394 : 683 - 693
  • [8] Optimizing energy storage devices using Ragone plots
    Christen, T
    Ohler, C
    [J]. JOURNAL OF POWER SOURCES, 2002, 110 (01) : 107 - 116
  • [9] MuSe: a multi-level storage scheme for big RDF data using MapReduce
    Chawla, Tanvi
    Singh, Girdhari
    Pilli, Emmanuel S.
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [10] MuSe: a multi-level storage scheme for big RDF data using MapReduce
    Tanvi Chawla
    Girdhari Singh
    Emmanuel S. Pilli
    [J]. Journal of Big Data, 8