A non-intrusive and reactive architecture to support real-time ETL processes in data warehousing environments

被引:3
|
作者
de Assis Vilela, Flavio [1 ]
Times, Valeria Cesario [2 ]
de Campos Bernardi, Alberto Carlos [3 ]
de Paula Freitas, Augusto [4 ]
Ciferri, Ricardo Rodrigues [4 ]
机构
[1] Fed Inst Goias, Goias, GO, Brazil
[2] Univ Fed Pernambuco, Recife, Brazil
[3] Embrapa Southeast Livestock, Brasilia, DF, Brazil
[4] Univ Fed Sao Carlos, Sao Carlos, Brazil
关键词
Data warehouse; Real-time; ETL; Data extraction; Data loading;
D O I
10.1016/j.heliyon.2023.e15728
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nowadays, organizations are very interested to gather data for strategic decision-making. Data are disposable in operational sources, which are distributed, heterogeneous, and autonomous. These data are gathered through ETL processes, which occur traditionally in a pre-defined time, that is, once a day, once a week, once a month or in a specific period of time. On the other hand, there are special applications for which data needs to be obtained in a faster way and sometimes even immediately after the data are generated in the operation data sources, such as health systems and digital agriculture. Thus, the conventional ETL process and the disposable techniques are incapable of making the operational data delivered in real-time, providing low latency, high availability, and scalability. As our proposal, we present an innovative architecture, named Data Magnet, to cope with real-time ETL processes. The experimental tests performed in the digital agriculture domain using real and synthetic data showed that our proposal was able to deal in real-time with the ETL process. The Data Magnet provided great performance, showing an almost constant elapsed time for growing data volumes. Besides, Data Magnet provided significant performance gains over the traditional trigger technique.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] A novel solution to perform real-time ETL process based on non-intrusive and reactive concepts
    Vilela, Flavio de Assis
    Ciferri, Ricardo Rodrigues
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 556 - 561
  • [2] Real-time, non-intrusive evaluation of VoIP
    Raja, Adil
    Azad, R. Muhammad Atif
    Flanagan, Colin
    Ryan, Conor
    [J]. GENETIC PROGRAMMING, PROCEEDINGS, 2007, 4445 : 217 - +
  • [3] A Non-Intrusive and Real-Time Data Provenance Method for DDS Systems
    Wei, Siyi
    Tu, Jinbin
    Wang, Yun
    [J]. 2023 19TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN 2023, 2023, : 439 - 446
  • [4] An architecture for real-time warehousing of scientific data
    Lawrence, R
    Kruger, A
    [J]. CSC '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON SCIENTIFIC COMPUTING, 2005, : 151 - 156
  • [5] Real-time non-intrusive RF biochemical sensor
    Pandit, Nidhi
    Jaiswal, Rahul K.
    Pathak, Nagendra P.
    [J]. ELECTRONICS LETTERS, 2020, 56 (19) : 185 - 187
  • [6] A Real-time Non-Intrusive Load Monitoring System
    Welikala, Shirantha
    Dinesh, Chinthaka
    Ekanayake, Mervyn Parakrama B.
    Godaliyadda, Roshan Indika
    Ekanayake, Janaka
    [J]. 2016 11TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2016, : 850 - 855
  • [7] A Real-Time Non-Intrusive Tool for Network Traffic Analysis
    Giorgi, G.
    Dindo, S.
    Vantini, M.
    Narduzzi, C.
    [J]. I2MTC: 2009 IEEE INSTRUMENTATION & MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1-3, 2009, : 53 - 57
  • [8] Distributed real-time ETL architecture for unstructured big data
    Erum Mehmood
    Tayyaba Anees
    [J]. Knowledge and Information Systems, 2022, 64 : 3419 - 3445
  • [9] Distributed real-time ETL architecture for unstructured big data
    Mehmood, Erum
    Anees, Tayyaba
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (12) : 3419 - 3445
  • [10] Near Real-Time Data Warehousing Using State-of-the-Art ETL Tools
    Joerg, Thomas
    Dessloch, Stefan
    [J]. ENABLING REAL-TIME BUSINESS INTELLIGENCE, 2010, 41 : 100 - 117