A Comparative Analysis of Machine Learning Techniques for National Glacier Mapping: Evaluating Performance through Spatial Cross-Validation in Perú

被引:1
|
作者
Bueno, Marcelo [1 ]
Macera, Briggitte [1 ]
Montoya, Nilton [1 ]
机构
[1] Univ Nacl San Antonio Abad del Cusco UNSAAC, Dept Acad Agr, Cuzco 08000, Peru
关键词
spatial modeling; machine learning; glacier mapping; glacier retreat; climate change; spatial autocorrelation; spatial cross-validation; CORDILLERA BLANCA; TROPICAL ANDES; VILCANOTA; CLASSIFICATION; ALGORITHMS; FRAMEWORK; ACCURACY; RETREAT; MODELS; TRENDS;
D O I
10.3390/w15244214
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate glacier mapping is crucial for assessing future water security in Andean ecosystems. Traditional accuracy assessment may be biased due to overlooking spatial autocorrelation during map validation. In recent years, spatial cross-validation (CV) strategies have been proposed in environmental and ecological modeling to reduce bias in predictive accuracy. In this study, we demonstrate the influence of spatial autocorrelation on the accuracy assessment of glacier surface predictive models. This is achieved by comparing the performance of several widely used machine learning algorithms including the gradient-boosting machines (GBM), k-nearest neighbors (KNN), random forest (RF), and logistic regression (LR) for mapping nine main Peruvian glacier regions. Spatial and non-spatial cross-validation methods were used to evaluate the model's classification errors in terms of the Matthews correlation coefficient. Performance differences of up to 18% were found between bias-reduced (spatial) and overoptimistic (non-spatial) cross-validation results. Regarding only spatial CV, the k-nearest neighbors were the overall best model across Huallanca (0.90), Huayhuasha (0.78), Huaytapallana (0.96), Raura (0.93), Urubamba (0.96), Vilcabamba (0.93), and Vilcanota (0.92) regions, consistently demonstrating the highest performance followed by logistic regression at Blanca (0.95) and Central (0.97) regions. Our validation approach, accounting for spatial characteristics, provides valuable insights for glacier mapping studies and future efforts on glacier retreat monitoring. Incorporating this approach improves the reliability of glacier mapping, guiding future national-level initiatives.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Performance evaluation and comparative analysis of various machine learning techniques for diagnosis of breast cancer
    Kanchanamani, M.
    Perumal, Varalakshmi
    BIOMEDICAL RESEARCH-INDIA, 2016, 27 (03): : 623 - 631
  • [32] Performance Evaluation and Comparative Analysis of Machine Learning Techniques to Predict the Chronic Kidney Disease
    Malik, Majid Bashir
    Ali, Mohd
    Bashir, Sadiya
    Ganie, Shahid Mohammad
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 473 - 486
  • [33] Novel machine learning model to improve performance of an early warning system in hospitalized patients: a retrospective multisite cross-validation study
    Salehinejad, Hojjat
    Meehan, Anne M.
    Rahman, Parvez A.
    Core, Marcia A.
    Borah, Bijan J.
    Caraballo, Pedro J.
    ECLINICALMEDICINE, 2023, 66
  • [34] Methodological Issues in Evaluating Machine Learning Models for EEG Seizure Prediction: Good Cross-Validation Accuracy Does Not Guarantee Generalization to New Patients
    Shafiezadeh, Sina
    Duma, Gian Marco
    Mento, Giovanni
    Danieli, Alberto
    Antoniazzi, Lisa
    Cristaldi, Fiorella Del Popolo
    Bonanni, Paolo
    Testolin, Alberto
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [35] Comparative Analysis of Supervised Machine Learning Algorithms to Build a Predictive Model for Evaluating Students' Performance
    El Guabassi, Inssaf
    Bousalem, Zakaria
    Marah, Rim
    Qazdar, Aimad
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2021, 17 (02) : 90 - 105
  • [36] A simulation study to compare cross-validation versus holdout or external testing to assess the performance of machine learning based clinical prediction rules
    Boellaard, R.
    Eertink, J. J.
    Lugtenburg, P. J.
    Zwezerijnen, G. J.
    Wiegers, S. E.
    de Vet, H. C.
    Zijlstra, J. M.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2021, 48 (SUPPL 1) : S285 - S285
  • [37] High-Resolution Vegetation Mapping in Japan by Combining Sentinel-2 and Landsat 8 Based Multi-Temporal Datasets through Machine Learning and Cross-Validation Approach
    Sharma, Ram C.
    Hara, Keitarou
    Tateishi, Ryutaro
    LAND, 2017, 6 (03)
  • [38] Prediction quality of cattle behavior traits evaluated through different cross-validation strategies using wearable sensor data and machine learning algorithms
    Ribeiro, Leonardo Augusto Coelho
    Bresolin, Tiago
    Rosa, Guilherme J. M.
    Casagrande, Daniel Rume
    Camargo Danes, Marina De Arruda
    Dorea, Joao R.
    JOURNAL OF ANIMAL SCIENCE, 2020, 98 : 383 - 383
  • [39] Quantum data encoding: a comparative analysis of classical-to-quantum mapping techniques and their impact on machine learning accuracy
    Rath, Minati
    Date, Hema
    EPJ Quantum Technology, 2024, 11 (01)
  • [40] A comparative spatial analysis of flood susceptibility mapping using boosting machine learning algorithms in Rathnapura, Sri Lanka
    Kurugama, Kumudu Madhawa
    Kazama, So
    Hiraga, Yusuke
    Samarasuriya, Chaminda
    JOURNAL OF FLOOD RISK MANAGEMENT, 2024, 17 (02):