BIG DATA ARCHITECTURES FOR DATA LAKES: A SYSTEMATIC LITERATURE REVIEW

被引:1
|
作者
Ramchand, Sonam [1 ]
Mahmood, Tariq [1 ]
机构
[1] Inst Business Adm IBA, Sch Math & Comp Sci SMCS, Karachi, Pakistan
关键词
Data Lakes; Data Lakes Big Data; Data Lake Management; Data Lake Storage; Data Lake Architecture;
D O I
10.1109/COMPSAC54236.2022.00179
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The rise in big technologies has been demanding different concepts and practices for data exploitation; among them data lake is a recently emerged concept that is meant to deal with the heterogeneous data. Data lakes have been residing in the big data era since 2010, but there has not been any systematic review yet over data lake implementation. In this research survey, we conduct a review and provide a road map to researcher that elaborates what has happened to data lakes till now. We aim to give understanding for basic concept of data lakes and propose a novel data lake definition that could best describe the concept based on the literature review. One of the main problem while implementing data lake is deciding the technologies to use, this study covers technologies that can potentially be used for data lake implementation. Furthermore, data lake architectures and their variants are discussed in detail. Moreover, we analyze current state, challenges, pros and cons of the data lake. This study is all in one place for researchers who try to understand data lake concept, architectures, technologies, approaches, current state and challenges.
引用
收藏
页码:1141 / 1146
页数:6
相关论文
共 50 条
  • [1] The State of Big Data Reference Architectures: A Systematic Literature Review
    Ataei, Pouya
    Litchfield, Alan
    [J]. IEEE ACCESS, 2022, 10 : 113789 - 113807
  • [2] Agricultural Big Data Architectures in the Context of Climate Change: A Systematic Literature Review
    Cravero, Ania
    Bustamante, Ana
    Negrier, Marlene
    Galeas, Patricio
    [J]. SUSTAINABILITY, 2022, 14 (13)
  • [3] Big Data, European Data Strategy, And Innovation: A Systematic Review of The Literature
    Walter, Cicero Eduardo
    Valente, Tiago
    Polonia, Daniel Ferreira
    Au-Yong-Olivera, Manuel
    Veloso, Claudia Miranda
    [J]. QUALITY-ACCESS TO SUCCESS, 2021, 22 (184): : 16 - 20
  • [4] Big data analytics in healthcare: a systematic literature review
    Khanra, Sayantan
    Dhir, Amandeep
    Islam, Najmul
    Mantymaki, Matti
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2020, 14 (07) : 878 - 912
  • [5] Cleaning Big Data Streams: A Systematic Literature Review
    Alotaibi, Obaid
    Pardede, Eric
    Tomy, Sarath
    Bagui, Sikha
    Iacono, Mauro
    [J]. TECHNOLOGIES, 2023, 11 (04)
  • [6] 15 years of Big Data: a systematic literature review
    Tosi, Davide
    Kokaj, Redon
    Roccetti, Marco
    [J]. JOURNAL OF BIG DATA, 2024, 11 (01)
  • [7] Security and Privacy for Big Data: A Systematic Literature Review
    Nelson, Boel
    Olovsson, Tomas
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3693 - 3702
  • [8] A Systematic Literature Review of Big Data and the Hadoop frameworks
    Naidu, Devishree
    Thakur, Adi
    [J]. INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (02) : 2969 - 2973
  • [9] A systematic literature review of big data adoption in internationalization
    Dam, Nguyen Anh Khoa
    Le Dinh, Thang
    Menvielle, William
    [J]. JOURNAL OF MARKETING ANALYTICS, 2019, 7 (03) : 182 - 195
  • [10] Classifying Big Data Taxonomies: A Systematic Literature Review
    Staegemann, Daniel
    Volk, Matthias
    Grube, Alexandra
    Hintsch, Johannes
    Bosse, Sascha
    Haeusler, Robert
    Nahhas, Abdulrahman
    Pohl, Matthias
    Turowski, Klaus
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 267 - 278