Predictive Performances of Ensemble Machine Learning Algorithms in Landslide Susceptibility Mapping Using Random Forest, Extreme Gradient Boosting (XGBoost) and Natural Gradient Boosting (NGBoost)

被引:109
|
作者
Kavzoglu, Taskin [1 ]
Teke, Alihan [1 ]
机构
[1] Gebze Tech Univ, Dept Geomat Engn, TR-41400 Gebze, Kocaeli, Turkey
关键词
Ensemble learning; Random forest; XGBoost; NGBoost; SHAP; SUPPORT VECTOR MACHINE; BLACK-SEA REGION; ARTIFICIAL NEURAL-NETWORKS; LOGISTIC-REGRESSION; FREQUENCY RATIO; DECISION TREE; MODELS; AREA; GIS; TURKEY;
D O I
10.1007/s13369-022-06560-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Across the globe, landslides have been recognized as one of the most detrimental geological calamities, especially in hilly terrains. However, the correct determination of landslide-prone fields remained a challenging task due to the nonlinear, complex, and inconsistent nature of landslides. Therefore, it is essential to apply methods with superior capabilities in dealing with real-world problems for properly delineating potential landslide zones. Recently, ensemble learning techniques have been drawn intense interest in landslide susceptibility mapping studies due to their distinct advantages. This present work intended to propose natural gradient boosting (NGBoost), a novel member of the ensemble learning family, for modeling landslide susceptibility for Macka County of Trabzon province, Turkey. The predictive performance of NGBoost was compared to ensemble-based machine learning methods, namely random forest (RF) and XGBoost using five accuracy metrics including overall accuracy (OA), F1 score, Kappa coefficient, area under curve (AUC) value, and root-mean-square error to evaluate its competence and robustness. Besides, SHAP based on the game theory approach was implemented to interpret the influences of the predisposing factors on the produced model. Results indicated that the NGBoost method utilized for landslide susceptibility mapping problem for the first time had the greatest predictive ability (AUC = 0.898), followed by XGBoost (AUC = 0.871) and RF (AUC = 0.863), and outperformed the XGBoost and RF by approximately 6% in terms of OA. McNemar's statistical significance test results also confirmed the superiority of the proposed NGBoost method over the RF and XGBoost algorithms.
引用
收藏
页码:7367 / 7385
页数:19
相关论文
共 50 条