Automated curation of spatial metadata in environmental monitoring data

被引:0
|
作者
Mutlu, Ilhan [1 ]
Hackermueller, Joerg [1 ,2 ]
Schor, Jana [1 ,2 ]
机构
[1] UFZ Helmholtz Ctr Environm Res, Dept Computat Biol & Chem, D-04318 Leipzig, Germany
[2] Univ Leipzig, Fac Math & Comp Sci, Dept Comp Sci, D-04109 Leipzig, Germany
关键词
Environmental monitoring; Spatial data accuracy; Automated data curation; Big data analytics; AI applications in hydrology;
D O I
10.1016/j.ecoinf.2025.103038
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Spatial data accuracy in environmental monitoring is crucial for practical large-scale data analytics and the development of AI models. In this context, spatial data is metadata and faces the same challenges as any other metadata, like missing values, false or contradicting information, formatting problems of textual data and numbers, the usage of different languages, and more. These issues severely limit the usability of the data. With this study, we provide an automatic approach, CleanGeoStreamR, to resolve as many of these issues as possible for the spatially annotated environmental monitoring database. We substantially increased the quality of the spatial metadata and, therefore, the quantity of data points that can be used in large-scale data analytics and AI applications. Further, our goal is to raise awareness about the issues related to spatial metadata and promote the implementation of our concepts in other environmental monitoring data sources. Advanced understanding and the availability of automatic approaches like the presented method will substantially contribute to making environmental monitoring data FAIR and enhance its usability in the era of Big Data and AI.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Visual analysis of geographic metadata in a spatial data infrastructure
    Albertoni, R
    Bertone, A
    De Martino, M
    15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 861 - 865
  • [32] MedShift: Automated Identification of Shift Data for Medical Image Dataset Curation
    Guo, Xiaoyuan
    Gichoya, Judy Wawira
    Trivedi, Hari
    Purkayastha, Saptarshi
    Banerjee, Imon
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (08) : 3936 - 3947
  • [33] Automated Collection and Processing Data in the System of Monitoring Environmental Pollution by Industrial Enterprises
    Belov, A. A.
    Kropotov, Y. A.
    2018 INTERNATIONAL SCIENTIFIC MULTI-CONFERENCE ON INDUSTRIAL ENGINEERING AND MODERN TECHNOLOGIES (FAREASTCON), 2018,
  • [34] Data and metadata management in distributed environmental information systems
    Koschel, A
    Kramer, R
    Nikolai, R
    Lukacs, G
    Heinemeier, T
    ENVIRONMENTAL SOFTWARE SYSTEMS, VOL 2, 1997, : 144 - 151
  • [35] Automated Quality Assessment of Metadata across Open Data Portals
    Neumaier, Sebastian
    Umbrich, Jurgen
    Polleres, Axel
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2016, 8 (01):
  • [36] Automated diagnosis of data-model conflicts using metadata
    Chen, RO
    Altman, RB
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, 6 (05) : 374 - 392
  • [37] Metadata as tools for integration of environmental data and information production
    Vyazilov, E
    Mikhailov, N
    Ibragimova, V
    Puzova, N
    INTEGRATED TECHNOLOGIES FOR ENVIRONMENTAL MONITORING AND INFORMATION PRODUCTION, 2003, 23 : 425 - 434
  • [38] Data and metadata management automation for an effective approach to sharing environmental data
    D'Amore, F.
    Cinnirella, S.
    Pirrone, N.
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HEAVY METALS IN THE ENVIRONMENT, 2013, 1
  • [39] AUTOMATED DATA INTERPRETATION IN AN AUTOMATED ENVIRONMENTAL LABORATORY
    ELLING, JW
    KLATT, LN
    UNRUH, WP
    LABORATORY ROBOTICS AND AUTOMATION, 1994, 6 (02) : 73 - 78
  • [40] The Georgia automated environmental monitoring network
    Hoogenboom, G
    22ND CONFERENCE ON AGRICULTURAL & FOREST METEOROLOGY WITH SYMPOSIUM ON FIRE & FOREST METEOROLOGY/12TH CONFERENCE ON BIOMETEOROLOGY & AEROBIOLOGY, 1996, : 343 - 346