Lake Data Warehouse Architecture for Big Data Solutions

被引:0
|
作者
Saddad, Emad [1 ,2 ]
El-Bastawissy, Ali [3 ]
Mokhtar, Hoda M. O. [4 ]
Hazman, Maryam [1 ,2 ]
机构
[1] Agr Res Ctr ARC, Climate Change Informat Ctr, Giza, Egypt
[2] Agr Res Ctr ARC, Renewable Energy & Expert Syst, Giza, Egypt
[3] MSA Univ, Fac Comp Sci, Giza, Egypt
[4] Cairo Univ, Fac Comp & Artificial Intelligence, Giza, Egypt
关键词
Traditional data warehouse; big data; semi-structured data; unstructured data; novel data warehouses architecture; Hadoop; spark;
D O I
10.14569/IJACSA.2020.0110854
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Traditional Data Warehouse is a multidimensional repository. It is nonvolatile, subject-oriented, integrated, time-variant, and non-operational data. It is gathered from multiple heterogeneous data sources. We need to adapt traditional Data Warehouse architecture to deal with the new challenges imposed by the abundance of data and the current big data characteristics, containing volume, value, variety, validity, volatility, visualization, variability, and venue. The new architecture also needs to handle existing drawbacks, including availability, scalability, and consequently query performance. This paper introduces a novel Data Warehouse architecture, named Lake Data Warehouse Architecture, to provide the traditional Data Warehouse with the capabilities to overcome the challenges. Lake Data Warehouse Architecture depends on merging the traditional Data Warehouse architecture with big data technologies, like the Hadoop framework and Apache Spark. It provides a hybrid solution in a complementary way. The main advantage of the proposed architecture is that it integrates the current features in traditional Data Warehouses and big data features acquired through integrating the traditional Data Warehouse with Hadoop and Spark ecosystems. Furthermore, it is tailored to handle a tremendous volume of data while maintaining availability, reliability, and scalability.
引用
收藏
页码:417 / 424
页数:8
相关论文
共 50 条
  • [1] Privacy-aware Big Data Warehouse Architecture
    Navuluri, Karthik
    Mukkamala, Ravi
    Ahmad, Aftab
    [J]. 2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 341 - 344
  • [2] Present Situation and Prospect of Data Warehouse Architecture under the Background of Big Data
    Sun, Lihua
    Hu, Mu
    Ren, Kaiyin
    Ren, Mingming
    [J]. PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CLOUD COMPUTING COMPANION (ISCC-C), 2014, : 529 - 535
  • [3] On the Research of Data Warehouse in Big Data
    Qin, Hai-fei
    Qian, Zhi-ming
    Zhao, Yong-chao
    [J]. 2015 INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC), 2015, : 354 - 357
  • [4] Data Lake Lambda Architecture for Smart Grids Big Data Analytics
    Munshi, Amr A.
    Mohamed, Yasser Abdel-Rady I.
    [J]. IEEE ACCESS, 2018, 6 : 40463 - 40471
  • [5] Big data analytics of the technological equipment based on Data Lake architecture
    Kovalev, Ilya
    Nezhmetdinov, Ramil
    Kvashnin, Denis
    [J]. INTERNATIONAL CONFERENCE ON MODERN TRENDS IN MANUFACTURING TECHNOLOGIES AND EQUIPMENT: MECHANICAL ENGINEERING AND MATERIALS SCIENCE (ICMTMTE 2019), 2019, 298
  • [6] Big Data Augmentation with Data Warehouse: A Survey
    Aftab, Umar
    Siddiqui, Ghazanfar Farooq
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2785 - 2794
  • [7] Data Warehouse Design for Big Data in Academia
    Rudniy, Alex
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 979 - 992
  • [8] Big Data Augmentation with Data Warehouse: A Survey
    Aftab, Umar
    Siddiqui, Ghazanfar Farooq
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2775 - 2784
  • [9] Data Warehouse Architecture Classification
    Blazic, G.
    Poscic, P.
    Jaksic, D.
    [J]. 2017 40TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2017, : 1491 - 1495
  • [10] Data warehouse architecture and design
    Rifaie, Mohammad
    Kianmehr, Keivan
    Alhajj, Reda
    Ridleya, Mick J.
    [J]. PROCEEDINGS OF THE 2008 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2008, : 58 - +