NoSQL Schema Evolution and Big Data Migration at Scale

被引:0
|
作者
Klettke, Meike [1 ]
Stoerl, Uta [2 ]
Shenavai, Manuel [2 ]
Scherzinger, Stefanie [3 ]
机构
[1] Univ Rostock, Rostock, Germany
[2] Univ Appl Sci, Darmstadt, Germany
[3] OTH Regensburg, Regensburg, Germany
关键词
NoSQL Databases; Schema Evolution; Data Migration Strategies; Lazy Migration; Lazy Composite Migration; Incremental Migration; Predictive Migration;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper explores scalable implementation strategies for carrying out lazy schema evolution in NoSQL data stores. For decades, schema evolution has been an evergreen in database research. Yet new challenges arise in the context of cloud-hosted data backends: With all database reads and writes charged by the provider, migrating the entire data instance eagerly into a new schema can be prohibitively expensive. Thus, lazy migration may be more cost-efficient, as legacy entities are only migrated in case they are actually accessed by the application. Related work has shown that the overhead of migrating data lazily is affordable when a single evolutionary change is carried out, such as adding a new property. In this paper, we focus on long-term schema evolution, where chains of pending schema evolution operations may have to be applied. Chains occur when legacy entities written several application releases back are finally accessed by the application. We discuss strategies for dealing with chains of evolution operations, in particular, the composition into a single, equivalent composite migration that performs the required version jump. Our experiments with MongoDB focus on scalable implementation strategies. Our lineup further compares the number of write operations, and thus, the operational costs of different data migration strategies.
引用
收藏
页码:2764 / 2774
页数:11
相关论文
共 50 条
  • [31] A NoSQL Data Model For Scalable Big Data Workflow Execution
    Mohan, Aravind
    Ebrahimi, Mahdi
    Lu, Shiyong
    Kotov, Alexander
    2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 52 - 59
  • [32] Data schema design as a schema evolution process
    Proper, HA
    DATA & KNOWLEDGE ENGINEERING, 1997, 22 (02) : 159 - 189
  • [33] Cloud-based NoSQL Data Migration
    Bansel, Aryan
    Gonzalez-Velez, Horacio
    Chis, Adriana E.
    2016 24TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP), 2016, : 224 - 231
  • [34] Logical Schema for Data Warehouse on Column-Oriented NoSQL Databases
    Boussahoua, Mohamed
    Boussaid, Omar
    Bentayeb, Fadila
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT II, 2017, 10439 : 247 - 256
  • [35] Scalability and Realtime on Big Data, MapReduce, NoSQL and Spark
    Furtado, Pedro
    BUSINESS INTELLIGENCE (EBISS 2016), 2017, 280 : 79 - 104
  • [36] Assessing NoSql Approaches for Spatial Big Data Management
    Nassif, El Hassane
    Hicham, Hajji
    Yaagoubi, Reda
    Badir, Hassan
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT, AI2SD'2019, VOL 6: ADVANCED INTELLIGENT SYSTEMS FOR NETWORKS AND SYSTEMS, 2020, 92 : 49 - 58
  • [37] Access Control Management as a Service for NoSQL Big Data
    Habiba, Mansura
    Islam, Md Rafiqul
    Ali, A. B. M. Shawkat
    2015 2ND ASIA-PACIFIC WORLD CONGRESS ON COMPUTER SCIENCE AND ENGINEERING (APWC ON CSE 2015), 2015,
  • [38] A Semantic NoSQL Application Program Interface for Big Data
    ElDahshan, K.
    Elsayed, E. K.
    Mancy, H.
    AbuBakr, A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 715 - 726
  • [39] Schema Proposition Model for NoSQL Applications
    Imam, Abdullahi Abubakar
    Basri, Shuib
    Ahmad, Rohiza
    Gonzalez-Aparicio, Maria T.
    RECENT TRENDS IN DATA SCIENCE AND SOFT COMPUTING, IRICT 2018, 2019, 843 : 30 - 39
  • [40] NoSE: Schema Design for NoSQL Applications
    Mior, Michael J.
    Salem, Kenneth
    Aboulnaga, Ashraf
    Liu, Rui
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 181 - 192