Importance of Spatial Autocorrelation in Machine Learning Modeling of Polymetallic Nodules, Model Uncertainty and Transferability at Local Scale

被引:9
|
作者
Gazis, Iason-Zois [1 ]
Greinert, Jens [1 ,2 ]
机构
[1] GEOMAR Helmholtz Ctr Ocean Res Kiel, Wischhofstr 1-3, D-24148 Kiel, Germany
[2] Christian Albrechts Univ Kiel, Inst Geosci, Ludewig Meyn Str 10-12, D-24098 Kiel, Germany
关键词
polymetallic nodules; spatial autocorrelation; cross-validation; model transferability; DEEP-SEA SEDIMENTS; SPECIES DISTRIBUTION MODELS; PERU BASIN; CROSS-VALIDATION; STRATEGIES; MANGANESE; IMPACT; ASSOCIATION; VARIABILITY; PERFORMANCE;
D O I
10.3390/min11111172
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Machine learning spatial modeling is used for mapping the distribution of deep-sea polymetallic nodules (PMN). However, the presence and influence of spatial autocorrelation (SAC) have not been extensively studied. SAC can provide information regarding the variable selection before modeling, and it results in erroneous validation performance when ignored. ML models are also problematic when applied in areas far away from the initial training locations, especially if the (new) area to be predicted covers another feature space. Here, we study the spatial distribution of PMN in a geomorphologically heterogeneous area of the Peru Basin, where SAC of PMN exists. The local Moran's I analysis showed that there are areas with a significantly higher or lower number of PMN, associated with different backscatter values, aspect orientation, and seafloor geomorphological characteristics. A quantile regression forests (QRF) model is used using three cross-validation (CV) techniques (random-, spatial-, and cluster-blocking). We used the recently proposed "Area of Applicability" method to quantify the geographical areas where feature space extrapolation occurs. The results show that QRF predicts well in morphologically similar areas, with spatial block cross-validation being the least unbiased method. Conversely, random-CV overestimates the prediction performance. Under new conditions, the model transferability is reduced even on local scales, highlighting the need for spatial model-based dissimilarity analysis and transferability assessment in new areas.
引用
收藏
页数:33
相关论文
共 24 条
  • [1] Spatial Autocorrelation Incorporated Machine Learning Model for Geotechnical Subsurface Modeling
    Kim, Hyeong-Joo
    Mawuntu, Kevin Bagas Arifki
    Park, Tae-Woong
    Kim, Hyeong-Soo
    Park, Jun-Young
    Jeong, Yeong-Seong
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [2] Spatial modeling and uncertainty characterization of polymetallic nodules in the Clarion-Clipperton zone
    Vishnu, Hari
    Kalyan, Bharath
    Chitre, Mandar
    [J]. OCEANS 2017 - ANCHORAGE, 2017,
  • [3] Importance of spatial autocorrelation in modeling bird distributions at a continental scale
    Bahn, Volker
    O'Connor, Raymond J.
    Krohn, William B.
    [J]. ECOGRAPHY, 2006, 29 (06) : 835 - 844
  • [4] On the transferability of local model-agnostic explanations of machine learning models to unseen data
    Lopez Gonzalez, Alba Maria
    Garcia-Cuesta, Esteban
    [J]. IEEE CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS 2024, IEEE EAIS 2024, 2024, : 243 - 252
  • [5] FEATURE IMPORTANCE AND UNCERTAINTY QUANTIFICATION OF MACHINE LEARNING MODEL IN MATERIALS SCIENCE
    Liu, Zhichen
    Singh, Akash
    Li, Yumeng
    [J]. PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 11, 2023,
  • [6] Machine-learning-based modeling of coarse-scale error, with application to uncertainty quantification
    Trehan, Sumeet
    Durlofsky, Louis J.
    [J]. COMPUTATIONAL GEOSCIENCES, 2018, 22 (04) : 1093 - 1113
  • [7] Machine-learning-based modeling of coarse-scale error, with application to uncertainty quantification
    Sumeet Trehan
    Louis J. Durlofsky
    [J]. Computational Geosciences, 2018, 22 : 1093 - 1113
  • [8] Testing for the transferability of a water quality model to areas of similar spatial and temporal scale based on an uncertainty vs. complexity hypothesis
    Lindenschmidt, Karl-Erich
    [J]. ECOLOGICAL COMPLEXITY, 2006, 3 (03) : 241 - 252
  • [9] An Interpretable Multi-Model Machine Learning Approach for Spatial Mapping of Deep-Sea Polymetallic Nodule Occurrences
    Gazis, Iason-Zois
    Charlet, Francois
    Greinert, Jens
    [J]. NATURAL RESOURCES RESEARCH, 2024,
  • [10] Enhancing turbulence modeling: Machine learning for pressure-strain correlation and uncertainty quantification in the Reynolds stress model
    Sudharsun, G.
    Warrior, Hari V.
    [J]. PHYSICS OF FLUIDS, 2023, 35 (12)