DLToDW: Transferring Relational and NoSQL Databases from a Data Lake

被引:0
|
作者
Jemmali R. [1 ,2 ]
Abdelhedi F. [1 ]
Zurfluh G. [2 ]
机构
[1] CBI2, Trimane, Paris
[2] IRIT CNRS (UMR 5505), Toulouse University, Toulouse
关键词
Big Data; Data Lake; Data Warehouse; MDA; NoSQL; QVT; Relational databases;
D O I
10.1007/s42979-022-01287-7
中图分类号
学科分类号
摘要
Over the past decade, digital transformation has led to the evolution of databases towards Big Data. A need to collect and analyze data from different sources has emerged. At the same time, traditional decision support systems are unable to meet the growing needs of modern businesses to integrate and analyze a wide variety of generated data. As a result, most organizations need to convert their data stored in relational systems to NoSQL or "Not only SQL" systems that are based on flexible models and schemas. Our work is part of a medical application that must allow health professionals to analyze complex data for decision making. We propose mechanisms to extract data from a Data Lake and store them in a NoSQL Data Warehouse. This will allow to perform, in a second time, decisional analysis facilitated by the features offered by NoSQL systems (richness of data structures, query language, access performances). In this article, we present a process for ingesting data from a Data Lake into a Data Warehouse. The ingestion consists, first, in transferring relational and NoSQL DBs extracted from the Data Lake into a single NoSQL DB (the Data Warehouse), second, in merging so-called "similar" classes and third, in converting the links into references between objects. To automate this process, we used the Model Driven Architecture (MDA) which provides a schema transformation environment. From the physical schemas describing a Data Lake, we propose transformation rules that allow to create a Data Warehouse stored under a document-oriented NoSQL system. An experimentation has been performed for a medical application. © 2022, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
下载
收藏
相关论文
共 50 条
  • [1] Ingestion of a Data Lake into a NoSQL Data Warehouse: The Case of Relational Databases
    Abdelhedi, Fatma
    Jemmali, Rym
    Zurfluh, Gilles
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KMIS), VOL 3, 2021, : 64 - 72
  • [2] Performance Analysis in NoSQL Databases, Relational Databases and NoSQL Databases as a Service in the Cloud
    Marrero, Luciano
    Olsowy, Verena
    Tesone, Fernando
    Thomas, Pablo
    Delia, Lisandro
    Pesado, Patricia
    COMPUTER SCIENCE - CACIC 2020, 2021, 1409 : 157 - 170
  • [3] Integration of Relational and NoSQL Databases
    Pokorny, Jaroslav
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2018, PT II, 2018, 10752 : 35 - 45
  • [4] Integration of Relational and NoSQL Databases
    Pokorny, Jaroslav
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2019, 6 (04) : 389 - 405
  • [5] Data Ingestion from a Data Lake: The Case of Document-oriented NoSQL Databases
    Abdelhedi, Fatma
    Jemmali, Rym
    Zurfluh, Gilles
    ICEIS: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2022, : 226 - 233
  • [6] A unified metamodel for NoSQL and relational databases
    Fernandez Candel, Carlos J.
    Sevilla Ruiz, Diego
    Garcia-Molina, Jesus J.
    INFORMATION SYSTEMS, 2022, 104
  • [7] Comparison between relational and NOSQL databases
    Sahatqija, Kosovare
    Ajdari, Jaumin
    Zenuni, Xhemal
    Raufi, Bujar
    Ismaili, Florije
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 216 - 221
  • [8] Query Processing over Data Warehouse using Relational Databases and NoSQL
    Carniel, Anderson Chaves
    Sa, Aried de Aguiar
    Porto Brisighello, Vinicius Henrique
    Ribeiro, Marcela Xavier
    Bueno, Renato
    Ciferri, Ricardo Rodrigues
    de Aguiar Ciferri, Cristina Dutra
    2012 XXXVIII CONFERENCIA LATINOAMERICANA EN INFORMATICA (CLEI), 2012,
  • [9] Transformation of Schema from Relational Database (RDB) to NoSQL Databases
    Alotaibi, Obaid
    Pardede, Eric
    DATA, 2019, 4 (04)
  • [10] Uniform query framework for relational and NoSQL databases
    Karanjekar, J.B.
    Chandak, M.B.
    CMES - Computer Modeling in Engineering and Sciences, 2017, 113 (02): : 177 - 187