Optimizing storage of RDF archives using bidirectional delta chains

被引:3
|
作者
Taelman, Ruben [1 ]
Mahieu, Thibault [1 ]
Vanbrabant, Martin [1 ]
Verborgh, Ruben [1 ]
机构
[1] Univ Ghent, IMEC, Dept Elect & Informat Syst, IDLab, Ghent, Belgium
关键词
Linked Data; RDF archiving; Semantic Data Versioning; storage; indexing; WEB;
D O I
10.3233/SW-210449
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linked Open Datasets on the Web that are published as RDF can evolve over time. There is a need to be able to store such evolving RDF datasets, and query across their versions. Different storage strategies are available for managing such versioned datasets, each being efficient for specific types of versioned queries. In recent work, a hybrid storage strategy has been introduced that combines these different strategies to lead to more efficient query execution for all versioned query types at the cost of increased ingestion time. While this trade-off is beneficial in the context of Web querying, it suffers from exponential ingestion times in terms of the number of versions, which becomes problematic for RDF datasets with many versions. As such, there is a need for an improved storage strategy that scales better in terms of ingestion time for many versions. We have designed, implemented, and evaluated a change to the hybrid storage strategy where we make use of a bidirectional delta chain instead of the default unidirectional delta chain. In this article, we introduce a concrete architecture for this change, together with accompanying ingestion and querying algorithms. Experimental results from our implementation show that the ingestion time is significantly reduced. As an additional benefit, this change also leads to lower total storage size and even improved query execution performance in some cases. This work shows that modifying the structure of delta chains within the hybrid storage strategy can be highly beneficial for RDF archives. In future work, other modifications to this delta chain structure deserve to be investigated, to further improve the scalability of ingestion and querying of datasets with many versions.
引用
下载
收藏
页码:705 / 734
页数:30
相关论文
共 50 条
  • [31] Optimizing bandwidth and storage requirements for mobile images using perceptual-based JPEG recompression
    Shoham, Tamar
    Gill, Dror
    Carmel, Sharon
    MULTIMEDIA ON MOBILE DEVICES 2011 AND MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS V, 2011, 7881
  • [32] Software rejuvenation and resource reservation policies for optimizing server resource availability using cyclic nonhomogeneous Markov chains
    Koutras, V. P.
    Platis, A. N.
    Gravvanis, G. A.
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2013, 29 (01) : 61 - 78
  • [33] A Methodology for Optimizing the Management of Spent Fuel of Nuclear Power Plants Using Dry Storage Casks
    Gomes, Ian B.
    Cruz Saldanha, Pedro L.
    Alvim, Antonio Carlos M.
    SCIENCE AND TECHNOLOGY OF NUCLEAR INSTALLATIONS, 2019, 2019
  • [34] Optimizing cold storage for uniform airflow and temperature distribution in apple preservation using CFD simulation
    Leo Daniel Alexander
    Sanjeev Jakhar
    Mani Sankar Dasgupta
    Scientific Reports, 14 (1)
  • [35] Optimizing a Distributed Wind-Storage System Under Critical Uncertainties Using Benders Decomposition
    Abdulgalil, Mohammed A.
    Khalid, Muhammad
    Alismail, Fahad
    IEEE ACCESS, 2019, 7 : 77951 - 77963
  • [36] Optimizing fin design for a PCM-based thermal storage device using dynamic Kriging
    Augspurger, Mike
    Choi, K. K.
    Udaykumar, H. S.
    INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2018, 121 : 290 - 308
  • [37] Optimizing Energy Storage Capacity in Islanded Microgrids Using Immunity-Based Multiobjective Planning
    Hong, Ying-Yi
    Lai, Yong-Zhen
    Chang, Yung-Ruei
    Lee, Yih-Der
    Lin, Chia-Hui
    ENERGIES, 2018, 11 (03):
  • [38] Increasing the Durability of Diesel Generator Engines by Using Energy Storage Systems and Optimizing Operating Modes
    Alekov, S.F.
    Pegachkov, A.A.
    Steel in Translation, 2024, 54 (03) : 220 - 225
  • [39] Optimizing the HW/SW Boundary of an ECC SoC Design Using Control Hierarchy and Distributed Storage
    Guo, Xu
    Schaumont, Patrick
    DATE: 2009 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2009, : 454 - 459
  • [40] Delta Encoding Overhead Analysis of Cloud Storage Systems using Client-side Encryption
    Henziger, Eric
    Carlsson, Niklas
    11TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2019), 2019, : 183 - 190