Metadata Systems for Data Lakes: Models and Features

被引:18
|
作者
Sawadogo, Pegdwende N. [1 ]
Scholly, Etienne [1 ,2 ]
Favre, Cecile [1 ]
Ferey, Eric [2 ]
Loudcher, Sabine [1 ]
Darmont, Jerome [1 ]
机构
[1] Univ Lyon, Lyon 2, ERIC EA 3083, Lyon, France
[2] BIAL X, Limonest, France
关键词
Data lakes; Metadata modeling; Metadata management; BIG DATA;
D O I
10.1007/978-3-030-30278-8_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the past decade, the data lake concept has emerged as an alternative to data warehouses for storing and analyzing big data. A data lake allows storing data without any predefined schema. Therefore, data querying and analysis depend on a metadata system that must be efficient and comprehensive. However, metadata management in data lakes remains a current issue and the criteria for evaluating its effectiveness are more or less nonexistent. In this paper, we introduce MEDAL, a generic, graph-based model for metadata management in data lakes. We also propose evaluation criteria for data lake metadata systems through a list of expected features. Eventually, we show that our approach is more comprehensive than existing metadata systems.
引用
收藏
页码:440 / 451
页数:12
相关论文
共 50 条
  • [31] Metadata as Data Intelligence
    Greenberg, Jane
    Wu, Mingfang
    Liu, Wei
    Liu, Fenghong
    [J]. DATA INTELLIGENCE, 2023, 5 (01) : 1 - 5
  • [32] A Proposed Big Data Architecture Using Data Lakes for Education Systems
    Oukhouya, Lamya
    El Haddadi, Anass
    Er-Raha, Brahim
    Asri, Hiba
    Laaz, Naziha
    [J]. EMERGING TRENDS IN INTELLIGENT SYSTEMS & NETWORK SECURITY, 2023, 147 : 53 - 62
  • [33] Performative Metadata: Reliability Frameworks and Accounting Frameworks in Content Aggregation Data Models
    Bettivia, Rhiannon
    Stainforth, Elizabeth
    [J]. TRANSFORMING DIGITAL WORLDS, ICONFERENCE 2018, 2018, 10766 : 592 - 597
  • [34] Comparative Analysis of Metadata Models on e-Government Open Data Platforms
    Milic, Petar
    Veljkovic, Natasa
    Stoimenov, Leonid
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (01) : 119 - 130
  • [35] MetaStore: an adaptive metadata management framework for heterogeneous metadata models
    Prabhune, Ajinkya
    Stotzka, Rainer
    Sakharkar, Vaibhav
    Hesser, Jurgen
    Gertz, Michael
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2018, 36 (01) : 153 - 194
  • [36] Systematizing the record of earth's shapes and colors: A framework for data and metadata models
    Goldberg, AM
    [J]. IGARSS 2003: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS I - VII, PROCEEDINGS: LEARNING FROM EARTH'S SHAPES AND SIZES, 2003, : 1395 - 1395
  • [37] Enriching and enhancing moving images with Linked Data An exploration in the alignment of metadata models
    Gracy, Karen F.
    [J]. JOURNAL OF DOCUMENTATION, 2018, 74 (02) : 354 - 371
  • [38] Data, metadata and information
    [J]. 1600, (10):
  • [39] Data, Metadata, and Ted
    Borgman, Christine L.
    [J]. INTERTWINGLED: THE WORK AND INFLUENCE OF TED NELSON, 2015, : 67 - 74
  • [40] Metadata systems architecture
    Morgan, O
    [J]. SMPTE MOTION IMAGING JOURNAL, 2003, 112 (04): : 129 - 135