DLToDW: Transferring Relational and NoSQL Databases from a Data Lake

被引:0
|
作者
Jemmali R. [1 ,2 ]
Abdelhedi F. [1 ]
Zurfluh G. [2 ]
机构
[1] CBI2, Trimane, Paris
[2] IRIT CNRS (UMR 5505), Toulouse University, Toulouse
关键词
Big Data; Data Lake; Data Warehouse; MDA; NoSQL; QVT; Relational databases;
D O I
10.1007/s42979-022-01287-7
中图分类号
学科分类号
摘要
Over the past decade, digital transformation has led to the evolution of databases towards Big Data. A need to collect and analyze data from different sources has emerged. At the same time, traditional decision support systems are unable to meet the growing needs of modern businesses to integrate and analyze a wide variety of generated data. As a result, most organizations need to convert their data stored in relational systems to NoSQL or "Not only SQL" systems that are based on flexible models and schemas. Our work is part of a medical application that must allow health professionals to analyze complex data for decision making. We propose mechanisms to extract data from a Data Lake and store them in a NoSQL Data Warehouse. This will allow to perform, in a second time, decisional analysis facilitated by the features offered by NoSQL systems (richness of data structures, query language, access performances). In this article, we present a process for ingesting data from a Data Lake into a Data Warehouse. The ingestion consists, first, in transferring relational and NoSQL DBs extracted from the Data Lake into a single NoSQL DB (the Data Warehouse), second, in merging so-called "similar" classes and third, in converting the links into references between objects. To automate this process, we used the Model Driven Architecture (MDA) which provides a schema transformation environment. From the physical schemas describing a Data Lake, we propose transformation rules that allow to create a Data Warehouse stored under a document-oriented NoSQL system. An experimentation has been performed for a medical application. © 2022, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [21] Performance Analysis of NoSQL and Relational Databases with CouchDB and MySQL for Application's Data Storage
    Gyorodi, Cornelia A.
    Dumse-Burescu, Diana, V
    Zmaranda, Doina R.
    Gyorodi, Robert S.
    Gabor, Gianina A.
    Pecherle, George D.
    APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 21
  • [22] NoSQL Databases for Large Volumes of Data
    Telnarova, Zdenka
    Zacek, Martin
    Smolka, Pavel
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019), 2019, 2186
  • [23] NoSQL Databases for Big Data Management
    Gaspar, Drazena
    Mabic, Mirela
    CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS (CECIIS 2016), 2016, : 3 - 10
  • [24] Modeling and Querying Data in NoSQL Databases
    Kaur, Karamjit
    Rani, Rinkle
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [25] ZQL: A Unified Middleware Bridging Both Relational and NoSQL Databases
    Xu, Jie
    Shi, Mengjie
    Chen, Chaoyuan
    Zhang, Zhen
    Fu, Jigao
    Lin, Chi Harold
    2016 IEEE 14TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 14TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 2ND INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/DATACOM/CYBERSC, 2016, : 730 - 737
  • [26] NOSOLAP: Moving from Data Warehouse Requirements to NoSQL Databases
    Prakash, Deepika
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING (ENASE), 2019, : 452 - 458
  • [27] Data Models in NoSQL Databases for Big Data Contexts
    Santos, Maribel Yasmina
    Costa, Carlos
    DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 475 - 485
  • [28] An evaluation of relational and NoSQL distributed databases on a low-power cluster
    da Silva, Lucas Ferreira
    Lima, Joao V. F.
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (12): : 13402 - 13420
  • [29] Research and Development of the Method of Investigating the Possibility of Transformation Relational Databases to NoSQL
    Ivanova, Tatyana S.
    Ivanov, Evgeniy A.
    PROCEEDINGS OF THE 2021 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (ELCONRUS), 2021, : 2090 - 2093
  • [30] When Relational-Based Applications Go to NoSQL Databases: A Survey
    Schreiner, Geomar A.
    Duarte, Denio
    Mello, Ronaldo dos Santos
    INFORMATION, 2019, 10 (07)