A Comparative Analysis of Machine Learning Techniques for National Glacier Mapping: Evaluating Performance through Spatial Cross-Validation in Perú

被引：1

作者：

Bueno, Marcelo ^{[1
]}

Macera, Briggitte ^{[1
]}

Montoya, Nilton ^{[1
]}

机构：

[1] Univ Nacl San Antonio Abad del Cusco UNSAAC, Dept Acad Agr, Cuzco 08000, Peru

来源：

WATER | 2023年 / 15卷 / 24期

关键词：

spatial modeling; machine learning; glacier mapping; glacier retreat; climate change; spatial autocorrelation; spatial cross-validation; CORDILLERA BLANCA; TROPICAL ANDES; VILCANOTA; CLASSIFICATION; ALGORITHMS; FRAMEWORK; ACCURACY; RETREAT; MODELS; TRENDS;

D O I：

10.3390/w15244214

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Accurate glacier mapping is crucial for assessing future water security in Andean ecosystems. Traditional accuracy assessment may be biased due to overlooking spatial autocorrelation during map validation. In recent years, spatial cross-validation (CV) strategies have been proposed in environmental and ecological modeling to reduce bias in predictive accuracy. In this study, we demonstrate the influence of spatial autocorrelation on the accuracy assessment of glacier surface predictive models. This is achieved by comparing the performance of several widely used machine learning algorithms including the gradient-boosting machines (GBM), k-nearest neighbors (KNN), random forest (RF), and logistic regression (LR) for mapping nine main Peruvian glacier regions. Spatial and non-spatial cross-validation methods were used to evaluate the model's classification errors in terms of the Matthews correlation coefficient. Performance differences of up to 18% were found between bias-reduced (spatial) and overoptimistic (non-spatial) cross-validation results. Regarding only spatial CV, the k-nearest neighbors were the overall best model across Huallanca (0.90), Huayhuasha (0.78), Huaytapallana (0.96), Raura (0.93), Urubamba (0.96), Vilcabamba (0.93), and Vilcanota (0.92) regions, consistently demonstrating the highest performance followed by logistic regression at Blanca (0.95) and Central (0.97) regions. Our validation approach, accounting for spatial characteristics, provides valuable insights for glacier mapping studies and future efforts on glacier retreat monitoring. Incorporating this approach improves the reliability of glacier mapping, guiding future national-level initiatives.

引用

页数：21

共 50 条

[31] Performance evaluation and comparative analysis of various machine learning techniques for diagnosis of breast cancer
Kanchanamani, M.
Perumal, Varalakshmi
BIOMEDICAL RESEARCH-INDIA, 2016, 27 (03): : 623 - 631
[32] Performance Evaluation and Comparative Analysis of Machine Learning Techniques to Predict the Chronic Kidney Disease
Malik, Majid Bashir
Ali, Mohd
Bashir, Sadiya
Ganie, Shahid Mohammad
ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 473 - 486
[33] Novel machine learning model to improve performance of an early warning system in hospitalized patients: a retrospective multisite cross-validation study
Salehinejad, Hojjat
Meehan, Anne M.
Rahman, Parvez A.
Core, Marcia A.
Borah, Bijan J.
Caraballo, Pedro J.
ECLINICALMEDICINE, 2023, 66
[34] Methodological Issues in Evaluating Machine Learning Models for EEG Seizure Prediction: Good Cross-Validation Accuracy Does Not Guarantee Generalization to New Patients
Shafiezadeh, Sina
Duma, Gian Marco
Mento, Giovanni
Danieli, Alberto
Antoniazzi, Lisa
Cristaldi, Fiorella Del Popolo
Bonanni, Paolo
Testolin, Alberto
APPLIED SCIENCES-BASEL, 2023, 13 (07):
[35] Comparative Analysis of Supervised Machine Learning Algorithms to Build a Predictive Model for Evaluating Students' Performance
El Guabassi, Inssaf
Bousalem, Zakaria
Marah, Rim
Qazdar, Aimad
INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2021, 17 (02) : 90 - 105
[36] A simulation study to compare cross-validation versus holdout or external testing to assess the performance of machine learning based clinical prediction rules
Boellaard, R.
Eertink, J. J.
Lugtenburg, P. J.
Zwezerijnen, G. J.
Wiegers, S. E.
de Vet, H. C.
Zijlstra, J. M.
EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2021, 48 (SUPPL 1) : S285 - S285
[37] High-Resolution Vegetation Mapping in Japan by Combining Sentinel-2 and Landsat 8 Based Multi-Temporal Datasets through Machine Learning and Cross-Validation Approach
Sharma, Ram C.
Hara, Keitarou
Tateishi, Ryutaro
LAND, 2017, 6 (03)
[38] Prediction quality of cattle behavior traits evaluated through different cross-validation strategies using wearable sensor data and machine learning algorithms
Ribeiro, Leonardo Augusto Coelho
Bresolin, Tiago
Rosa, Guilherme J. M.
Casagrande, Daniel Rume
Camargo Danes, Marina De Arruda
Dorea, Joao R.
JOURNAL OF ANIMAL SCIENCE, 2020, 98 : 383 - 383
[39] Quantum data encoding: a comparative analysis of classical-to-quantum mapping techniques and their impact on machine learning accuracy
Rath, Minati
Date, Hema
EPJ Quantum Technology, 2024, 11 (01)
[40] A comparative spatial analysis of flood susceptibility mapping using boosting machine learning algorithms in Rathnapura, Sri Lanka
Kurugama, Kumudu Madhawa
Kazama, So
Hiraga, Yusuke
Samarasuriya, Chaminda
JOURNAL OF FLOOD RISK MANAGEMENT, 2024, 17 (02):

← 1 2 3 4 5 →