Automated curation of spatial metadata in environmental monitoring data

被引:0
|
作者
Mutlu, Ilhan [1 ]
Hackermueller, Joerg [1 ,2 ]
Schor, Jana [1 ,2 ]
机构
[1] UFZ Helmholtz Ctr Environm Res, Dept Computat Biol & Chem, D-04318 Leipzig, Germany
[2] Univ Leipzig, Fac Math & Comp Sci, Dept Comp Sci, D-04109 Leipzig, Germany
关键词
Environmental monitoring; Spatial data accuracy; Automated data curation; Big data analytics; AI applications in hydrology;
D O I
10.1016/j.ecoinf.2025.103038
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Spatial data accuracy in environmental monitoring is crucial for practical large-scale data analytics and the development of AI models. In this context, spatial data is metadata and faces the same challenges as any other metadata, like missing values, false or contradicting information, formatting problems of textual data and numbers, the usage of different languages, and more. These issues severely limit the usability of the data. With this study, we provide an automatic approach, CleanGeoStreamR, to resolve as many of these issues as possible for the spatially annotated environmental monitoring database. We substantially increased the quality of the spatial metadata and, therefore, the quantity of data points that can be used in large-scale data analytics and AI applications. Further, our goal is to raise awareness about the issues related to spatial metadata and promote the implementation of our concepts in other environmental monitoring data sources. Advanced understanding and the availability of automatic approaches like the presented method will substantially contribute to making environmental monitoring data FAIR and enhance its usability in the era of Big Data and AI.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Spatial Data Infrastructures Its metadata and analysis
    Roy, Sayon
    Das, Subhashis
    2015 4TH INTERNATIONAL SYMPOSIUM ON EMERGING TRENDS AND TECHNOLOGIES IN LIBRARIES AND INFORMATION SERVICES (ETTLIS), 2015, : 43 - 51
  • [22] The role of metadata and strategies to detect and control temporal data bias in environmental monitoring of soil contamination
    Desaules, Andre
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2012, 184 (11) : 7023 - 7039
  • [23] The role of metadata and strategies to detect and control temporal data bias in environmental monitoring of soil contamination
    André Desaules
    Environmental Monitoring and Assessment, 2012, 184 : 7023 - 7039
  • [24] Long-term digital metadata curation
    Shaon, Arif
    Proceedings of the UK e-Science All Hands Meeting 2006, 2006, : 193 - 200
  • [25] Automated environmental compliance monitoring of rivers with IoT and open government data
    Miasayedava, Lizaveta
    McBride, Keegan
    Tuhtan, Jeffrey Andrew
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2022, 303
  • [26] Medical data quality assessment: On the development of an automated framework for medical data curation
    Pezoulas, Vasileios C.
    Kourou, Konstantina D.
    Kalatzis, Fanis
    Exarchos, Themis P.
    Venetsanopoulou, Aliki
    Zampeli, Evi
    Gandolfo, Saviana
    Skopouli, Fotini
    De Vita, Salvatore
    Tzioufas, Athanasios G.
    Fotiadis, Dimitrios I.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 107 : 270 - 283
  • [27] Automated workflows for data curation and standardization of chemical structures for QSAR modeling
    Mansouri, Kamel
    McEachran, Andrew
    Grulke, Christopher
    Richard, Ann
    Judson, Richard
    Williams, Antony
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [28] Automated Electronic Health Record Data Extraction and Curation Using ExtractEHR
    Miller, Tamara P.
    Getz, Kelly D.
    Krause, Edward
    Jo, Yun Gun
    Charapala, Sandhya
    Gramatages, M. Monica
    Rabin, Karen
    Scheurer, Michael E.
    Wilkes, Jennifer J.
    Fisher, Brian T.
    Aplenc, Richard
    JCO CLINICAL CANCER INFORMATICS, 2024, 8
  • [29] Avant-garde: an automated data-driven DIA data curation tool
    Vaca Jacome, Alvaro Sebastian
    Peckner, Ryan
    Shulman, Nicholas
    Krug, Karsten
    DeRuff, Katherine C.
    Officer, Adam
    Christianson, Karen E.
    MacLean, Brendan
    MacCoss, Michael J.
    Carr, Steven A.
    Jaffe, Jacob D.
    NATURE METHODS, 2020, 17 (12) : 1237 - +
  • [30] Avant-garde: an automated data-driven DIA data curation tool
    Alvaro Sebastian Vaca Jacome
    Ryan Peckner
    Nicholas Shulman
    Karsten Krug
    Katherine C. DeRuff
    Adam Officer
    Karen E. Christianson
    Brendan MacLean
    Michael J. MacCoss
    Steven A. Carr
    Jacob D. Jaffe
    Nature Methods, 2020, 17 : 1237 - 1244