NoSQL Schema Evolution and Big Data Migration at Scale

被引:0
|
作者
Klettke, Meike [1 ]
Stoerl, Uta [2 ]
Shenavai, Manuel [2 ]
Scherzinger, Stefanie [3 ]
机构
[1] Univ Rostock, Rostock, Germany
[2] Univ Appl Sci, Darmstadt, Germany
[3] OTH Regensburg, Regensburg, Germany
关键词
NoSQL Databases; Schema Evolution; Data Migration Strategies; Lazy Migration; Lazy Composite Migration; Incremental Migration; Predictive Migration;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper explores scalable implementation strategies for carrying out lazy schema evolution in NoSQL data stores. For decades, schema evolution has been an evergreen in database research. Yet new challenges arise in the context of cloud-hosted data backends: With all database reads and writes charged by the provider, migrating the entire data instance eagerly into a new schema can be prohibitively expensive. Thus, lazy migration may be more cost-efficient, as legacy entities are only migrated in case they are actually accessed by the application. Related work has shown that the overhead of migrating data lazily is affordable when a single evolutionary change is carried out, such as adding a new property. In this paper, we focus on long-term schema evolution, where chains of pending schema evolution operations may have to be applied. Chains occur when legacy entities written several application releases back are finally accessed by the application. We discuss strategies for dealing with chains of evolution operations, in particular, the composition into a single, equivalent composite migration that performs the required version jump. Our experiments with MongoDB focus on scalable implementation strategies. Our lineup further compares the number of write operations, and thus, the operational costs of different data migration strategies.
引用
收藏
页码:2764 / 2774
页数:11
相关论文
共 50 条
  • [1] NoSQL document data migration strategy in the context of schema evolution
    Fedushko, Solomiia
    Malyi, Roman
    Syerov, Yuriy
    Serdyuk, Pavlo
    DATA & KNOWLEDGE ENGINEERING, 2024, 154
  • [2] Self-adapting data migration in the context of schema evolution in NoSQL databases
    Hillenbrand, Andrea
    Storl, Uta
    Nabiyev, Shamil
    Klettke, Meike
    DISTRIBUTED AND PARALLEL DATABASES, 2022, 40 (01) : 5 - 25
  • [3] Self-adapting data migration in the context of schema evolution in NoSQL databases
    Andrea Hillenbrand
    Uta Störl
    Shamil Nabiyev
    Meike Klettke
    Distributed and Parallel Databases, 2022, 40 : 5 - 25
  • [4] Towards Self-Adapting Data Migration in the Context of Schema Evolution in NoSQL Databases
    Hillenbrand, Andrea
    Stoerl, Uta
    Levchenko, Maksym
    Nabiyev, Shamil
    Klettke, Meike
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2020), 2020, : 133 - 138
  • [5] Supporting Schema Evolution in Schema-Less NoSQL Data Stores
    Meurice, Loup
    Cleve, Anthony
    2017 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), 2017, : 457 - 461
  • [6] Document-Oriented Data Schema for Relational Database Migration to NoSQL
    Hamouda, Shady
    Zainol, Zurinahni
    2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA INNOVATIONS AND APPLICATIONS (INNOVATE-DATA), 2017, : 43 - 50
  • [7] EvoBench: Benchmarking Schema Evolution in NoSQL
    Conrad, Andre
    Moeller, Mark Lukas
    Kreiter, Tobias
    Mair, Jan-Christopher
    Klettke, Meike
    Stoerl, Uta
    PERFORMANCE EVALUATION AND BENCHMARKING, TPCTC 2021, 2022, 13169 : 33 - 49
  • [8] EvoBench - A Framework for Benchmarking Schema Evolution in NoSQL
    Moeller, Mark Lukas
    Klettke, Meike
    Stoerl, Uta
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1974 - 1984
  • [9] A Generic Schema Evolution Approach for NoSQL and Relational Databases
    Chillon, Alberto Hernandez
    Klettke, Meike
    Ruiz, Diego Sevilla
    Molina, Jesus Garcia
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 2774 - 2789
  • [10] Remaining in Control of the Impact of Schema Evolution in NoSQL Databases
    Hillenbrand, Andrea
    Scherzinger, Stefanie
    Storl, Uta
    CONCEPTUAL MODELING, ER 2021, 2021, 13011 : 149 - 159