Automated curation of spatial metadata in environmental monitoring data

被引:0
|
作者
Mutlu, Ilhan [1 ]
Hackermueller, Joerg [1 ,2 ]
Schor, Jana [1 ,2 ]
机构
[1] UFZ Helmholtz Ctr Environm Res, Dept Computat Biol & Chem, D-04318 Leipzig, Germany
[2] Univ Leipzig, Fac Math & Comp Sci, Dept Comp Sci, D-04109 Leipzig, Germany
关键词
Environmental monitoring; Spatial data accuracy; Automated data curation; Big data analytics; AI applications in hydrology;
D O I
10.1016/j.ecoinf.2025.103038
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Spatial data accuracy in environmental monitoring is crucial for practical large-scale data analytics and the development of AI models. In this context, spatial data is metadata and faces the same challenges as any other metadata, like missing values, false or contradicting information, formatting problems of textual data and numbers, the usage of different languages, and more. These issues severely limit the usability of the data. With this study, we provide an automatic approach, CleanGeoStreamR, to resolve as many of these issues as possible for the spatially annotated environmental monitoring database. We substantially increased the quality of the spatial metadata and, therefore, the quantity of data points that can be used in large-scale data analytics and AI applications. Further, our goal is to raise awareness about the issues related to spatial metadata and promote the implementation of our concepts in other environmental monitoring data sources. Advanced understanding and the availability of automatic approaches like the presented method will substantially contribute to making environmental monitoring data FAIR and enhance its usability in the era of Big Data and AI.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Automated preprocessing of environmental data
    Ronkko, Mauno
    Heikkinen, Jani
    Kotovirta, Ville
    Chandrasekar, Venkatachalam
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 45 : 13 - 24
  • [42] Data Curation
    Woo, Jung-Ah
    ARTFORUM INTERNATIONAL, 2013, 52 (02): : 316 - 317
  • [43] GIS, geostatistics, metadata banking, and tree-based models for data analysis and mapping in environmental monitoring and epidemiology
    Schroeder, Winfried
    INTERNATIONAL JOURNAL OF MEDICAL MICROBIOLOGY, 2006, 296 : 23 - 36
  • [44] Automated data monitoring/collecting
    不详
    PLASTICS ENGINEERING, 2002, 58 (12) : 40 - 40
  • [45] Automated Test Data Monitoring
    Bosas, Joseph
    2017 IEEE AUTOTESTCON, 2017, : 116 - 121
  • [46] Method for monitoring environmental flows with high spatial and temporal resolution satellite data
    Yuming Lu
    Bingfang Wu
    Nana Yan
    Hongwei Zeng
    Yong Guo
    Weiwei Zhu
    Hao Zhang
    Environmental Monitoring and Assessment, 2022, 194
  • [47] Importance of timely metadata curation to the global surveillance of genetic diversity
    Crandall, Eric D.
    Toczydlowski, Rachel H.
    Liggins, Libby
    Holmes, Ann E.
    Ghoojaei, Maryam
    Gaither, Michelle R.
    Wham, Briana E.
    Pritt, Andrea L.
    Noble, Cory
    Anderson, Tanner J.
    Barton, Randi L.
    Berg, Justin T.
    Beskid, Sofia G.
    Delgado, Alonso
    Farrell, Emily
    Himmelsbach, Nan
    Queeno, Samantha R.
    Trinh, Thienthanh
    Weyand, Courtney
    Bentley, Andrew
    Deck, John
    Riginos, Cynthia
    Bradburd, Gideon S.
    Toonen, Robert J.
    CONSERVATION BIOLOGY, 2023, 37 (04)
  • [48] Method for monitoring environmental flows with high spatial and temporal resolution satellite data
    Lu, Yuming
    Wu, Bingfang
    Yan, Nana
    Zeng, Hongwei
    Guo, Yong
    Zhu, Weiwei
    Zhang, Hao
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2022, 194 (01)
  • [49] Metadata for geo-spatial data sharing: A comparative analysis
    Tschangho John Kim
    The Annals of Regional Science, 1999, 33 : 171 - 181
  • [50] Metadata for geo-spatial data sharing: A comparative analysis
    Kim, TJ
    ANNALS OF REGIONAL SCIENCE, 1999, 33 (02): : 171 - 181