Automated curation of spatial metadata in environmental monitoring data

被引:0
|
作者
Mutlu, Ilhan [1 ]
Hackermueller, Joerg [1 ,2 ]
Schor, Jana [1 ,2 ]
机构
[1] UFZ Helmholtz Ctr Environm Res, Dept Computat Biol & Chem, D-04318 Leipzig, Germany
[2] Univ Leipzig, Fac Math & Comp Sci, Dept Comp Sci, D-04109 Leipzig, Germany
关键词
Environmental monitoring; Spatial data accuracy; Automated data curation; Big data analytics; AI applications in hydrology;
D O I
10.1016/j.ecoinf.2025.103038
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Spatial data accuracy in environmental monitoring is crucial for practical large-scale data analytics and the development of AI models. In this context, spatial data is metadata and faces the same challenges as any other metadata, like missing values, false or contradicting information, formatting problems of textual data and numbers, the usage of different languages, and more. These issues severely limit the usability of the data. With this study, we provide an automatic approach, CleanGeoStreamR, to resolve as many of these issues as possible for the spatially annotated environmental monitoring database. We substantially increased the quality of the spatial metadata and, therefore, the quantity of data points that can be used in large-scale data analytics and AI applications. Further, our goal is to raise awareness about the issues related to spatial metadata and promote the implementation of our concepts in other environmental monitoring data sources. Advanced understanding and the availability of automatic approaches like the presented method will substantially contribute to making environmental monitoring data FAIR and enhance its usability in the era of Big Data and AI.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Anatomy of Metadata for Data Curation
    Visengeriyeva, Larysa
    Abedjan, Ziawasch
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2020, 12 (03):
  • [2] Research data and metadata curation as institutional issues
    Mayernik, Matthew S.
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (04) : 973 - 993
  • [3] Louvre: A Framework for Metadata Curation in Data Ecosystem
    Oliveira, Marcelo Iury S.
    Loscio, Bernadette Farias
    PROCEEDINGS OF THE XV BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, SBSI 2019: Complexity on Modern Information Systems, 2019,
  • [4] Institutional Structures for Research Data and Metadata Curation
    Mayernik, Matthew S.
    JCDL'13: PROCEEDINGS OF THE 13TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES, 2013, : 401 - 402
  • [5] Metadata functional requirements for genomic data practice and curation
    Huang, Hong
    Qin, Jian
    INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2024, 29 (02): : 3 - 29
  • [6] The role of data and metadata archives in environmental monitoring and research programs
    Michener, WK
    NORTH AMERICAN SCIENCE SYMPOSIUM: TOWARD A UNIFIED FRAMEWORK FOR INVENTORYING AND MONITORING FOREST ECOSYSTEM RESOURCES, 1999, (12): : 441 - 444
  • [7] Automated data curation and data governance automation
    Talburt, John R.
    Ehrlinger, Lisa
    Magruder, Justin
    FRONTIERS IN BIG DATA, 2023, 6
  • [8] The Curation of Environmental Health Data
    Knight, Mel
    JOURNAL OF ENVIRONMENTAL HEALTH, 2012, 74 (10) : 4 - 5
  • [9] The importance of metrological metadata in the environmental monitoring
    Santana, Marcio A. A.
    Guimaraes, Patricia L. O.
    Almeida, Eugenio S.
    Eklin, Tero
    8TH BRAZILIAN CONGRESS ON METROLOGY (METROLOGIA 2015), 2016, 733
  • [10] Guest editorial: large-scale data curation and metadata management
    Eltabakh, Mohamed
    Glavic, Boris
    DISTRIBUTED AND PARALLEL DATABASES, 2018, 36 (01) : 5 - 8