NoSQL Schema Evolution and Big Data Migration at Scale

被引:0
|
作者
Klettke, Meike [1 ]
Stoerl, Uta [2 ]
Shenavai, Manuel [2 ]
Scherzinger, Stefanie [3 ]
机构
[1] Univ Rostock, Rostock, Germany
[2] Univ Appl Sci, Darmstadt, Germany
[3] OTH Regensburg, Regensburg, Germany
关键词
NoSQL Databases; Schema Evolution; Data Migration Strategies; Lazy Migration; Lazy Composite Migration; Incremental Migration; Predictive Migration;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper explores scalable implementation strategies for carrying out lazy schema evolution in NoSQL data stores. For decades, schema evolution has been an evergreen in database research. Yet new challenges arise in the context of cloud-hosted data backends: With all database reads and writes charged by the provider, migrating the entire data instance eagerly into a new schema can be prohibitively expensive. Thus, lazy migration may be more cost-efficient, as legacy entities are only migrated in case they are actually accessed by the application. Related work has shown that the overhead of migrating data lazily is affordable when a single evolutionary change is carried out, such as adding a new property. In this paper, we focus on long-term schema evolution, where chains of pending schema evolution operations may have to be applied. Chains occur when legacy entities written several application releases back are finally accessed by the application. We discuss strategies for dealing with chains of evolution operations, in particular, the composition into a single, equivalent composite migration that performs the required version jump. Our experiments with MongoDB focus on scalable implementation strategies. Our lineup further compares the number of write operations, and thus, the operational costs of different data migration strategies.
引用
收藏
页码:2764 / 2774
页数:11
相关论文
共 50 条
  • [21] Providing Big Data Applications with Fault-Tolerant Data Migration Across Heterogeneous NoSQL Databases
    Scavuzzo, Marco
    Tamburri, Damian A.
    Di Nitto, Elisabetta
    2016 IEEE/ACM 2ND INTERNATIONAL WORKSHOP ON BIG DATA SOFTWARE ENGINEERING (BIGDSE 2016), 2016, : 26 - 32
  • [22] Data Models in NoSQL Databases for Big Data Contexts
    Santos, Maribel Yasmina
    Costa, Carlos
    DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 475 - 485
  • [23] USING NoSQL FOR PROCESSING UNSTRUCTURED BIG DATA
    Balakayeva, G. T.
    Phillips, C.
    Darkenbayev, D. K.
    Turdaliyev, M.
    NEWS OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN-SERIES OF GEOLOGY AND TECHNICAL SCIENCES, 2019, (06): : 12 - 21
  • [24] Persisting big-data: The NoSQL landscape
    Corbellini, Alejandro
    Mateos, Cristian
    Zunino, Alejandro
    Godoy, Daniela
    Schiaffino, Silvia
    INFORMATION SYSTEMS, 2017, 63 : 1 - 23
  • [25] Benchmarking Big Data OLAP NoSQL Databases
    El Malki, Mohammed
    Kopliku, Arlind
    Sabir, Essaid
    Teste, Olivier
    UBIQUITOUS NETWORKING, UNET 2018, 2018, 11277 : 82 - 94
  • [26] Query Performance Analysis of NoSQL and Big Data
    Samanta, Ashis Kumar
    Sarkar, Bidut Biman
    Chaki, Nabendu
    2018 FOURTH IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2018, : 237 - 241
  • [27] Automatic Transformation of Data Warehouse Schema To NoSQL Data Base: Comparative Study
    Yangui, Rania
    Nabli, Ahlem
    Gargouri, Faiez
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 264 - 273
  • [28] The schema evolution and data migration framework of the environmental mass database IMIS
    Draheim, D
    Horn, M
    Schulz, I
    16TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2004, : 341 - 344
  • [29] Are NoSQL Databases Affected by Schema?
    Bansal, Neha
    Sachdeva, Shelly
    Awasthi, Lalit K. K.
    IETE JOURNAL OF RESEARCH, 2024, 70 (05) : 4770 - 4791
  • [30] SQL-to-NoSQL Schema Denormalization and Migration: A Study on Content Management Systems
    Lee, Chao-Hsien
    Zheng, Yu-Lin
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2022 - 2026