Sample size effects on landslide susceptibility models: A comparative study of heuristic, statistical, machine learning, deep learning and ensemble learning models with SHAP analysis

被引:1
|
作者
Yang, Shilong [1 ]
Tan, Jiayao [1 ]
Luo, Danyuan [1 ]
Wang, Yuzhou [2 ,3 ]
Guo, Xu [1 ]
Zhu, Qiuyu [1 ,4 ]
Ma, Chuanming [1 ]
Xiong, Hanxiang [1 ]
机构
[1] China Univ Geosci, Sch Environm Studies, Wuhan 430074, Peoples R China
[2] Eastern Inst Technol, Eastern Inst Adv Study, Ningbo 315200, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Environm Sci & Engn, Shanghai 200240, Peoples R China
[4] Hangzhou Yuhang Urban Dev Investment Grp Co Ltd, Hangzhou 311100, Peoples R China
关键词
Landslide susceptibility assessment; Model robustness; Inventory sample size; XGBoost and LightGBM; Explainable machine learning; ANALYTICAL HIERARCHY PROCESS; FREQUENCY RATIO MODEL; LOGISTIC-REGRESSION; NEURAL-NETWORKS; GIS; AREA; HAZARD; PROVINCE; BASIN; INDEX;
D O I
10.1016/j.cageo.2024.105723
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In landslide susceptibility assessment (LSA), inventory incompleteness impacts the accuracy of different models to varying degrees. However, this area remains under-researched. This study investigated six LSA models from heuristic, statistical, machine learning and ensemble learning models (analytical hierarchy process (AHP), frequency ratio (FR), logistic regression (LR), Keras based deep learning (KBDL), XGBoost, and LightGBM) across six different sample sizes (100%, 90%, 75%, 50%, 25%, and 10%). Results revealed that XGBoost and LightGBM consistently outperformed other models across all sample sizes. The LR and KBDL models followed, while FR model was the most affected by sample size variations. AHP, an empirical model, remained unaffected by sample size. Through SHapley Additive exPlanations (SHAP) analysis, elevation, NDVI, slope, land use, and distance to roads and rivers emerged as pivotal indicators for landslide occurrences in the study area, suggesting that human activities significantly influence these events. Five time-varying indicators regarding human activity and climate validated this inference, which provides a new method to identify landslide triggering factors, especially in areas of intense human activity. Based on the findings, a comprehensive framework for LSA is proposed to assist landslide managers in making informed decisions. Future research should focus on expanding model diversity to address the effects of sample size, enhancing the adaptability of the LSA framework, deepening the analysis of human activity impacts on landslides using explainable machine learning techniques, addressing temporal inventory incompleteness in LSA, and critically evaluating model sensitivity to sample size variations across multiple disciplines.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Comparative study of different machine learning models in landslide susceptibility assessment: A case study of Conghua District, Guangzhou, China
    Zhang, Ao
    Zhao, Xin-wen
    Zhao, Xing-yuezi
    Zheng, Xiao-zhan
    Zeng, Min
    Huang, Xuan
    Wu, Pan
    Jiang, Tuo
    Wang, Shi-chang
    He, Jun
    Li, Yi-yong
    CHINA GEOLOGY, 2024, 7 (01) : 104 - 115
  • [42] A comparative study of machine learning models for construction costs prediction with natural gradient boosting algorithm and SHAP analysis
    Das P.
    Kashem A.
    Hasan I.
    Islam M.
    Asian Journal of Civil Engineering, 2024, 25 (4) : 3301 - 3316
  • [43] Comparative study of different machine learning models in landslide susceptibility assessment:A case study of Conghua District,Guangzhou,China
    Ao Zhang
    Xin-wen Zhao
    Xing-yuezi Zhao
    Xiao-zhan Zheng
    Min Zeng
    Xuan Huang
    Pan Wu
    Tuo Jiang
    Shi-chang Wang
    Jun He
    Yi-yong Li
    China Geology, 2024, 7 (01) : 104 - 115
  • [44] Comparative Study of Deep Learning Models Versus Machine Learning Models for Wind Turbine Intelligent Health Diagnosis Systems
    Rababaah, Aaron Rasheed
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10875 - 10899
  • [45] Comparative Study of Deep Learning Models Versus Machine Learning Models for Wind Turbine Intelligent Health Diagnosis Systems
    Aaron Rasheed Rababaah
    Arabian Journal for Science and Engineering, 2023, 48 : 10875 - 10899
  • [46] Enhancing landslide management with hyper-tuned machine learning and deep learning models: Predicting susceptibility and analyzing sensitivity and uncertainty
    Dahim, Mohammed
    Alqadhi, Saeed
    Mallick, Javed
    FRONTIERS IN ECOLOGY AND EVOLUTION, 2023, 11
  • [47] Flood susceptibility modelling using advanced ensemble machine learning models
    Abu Reza Md Towfiqul Islam
    Swapan Talukdar
    Susanta Mahato
    Sonali Kundu
    Kutub Uddin Eibek
    Quoc Bao Pham
    Alban Kuriqi
    Nguyen Thi Thuy Linh
    Geoscience Frontiers, 2021, (03) : 66 - 83
  • [48] Susceptibility Prediction of Groundwater Hardness Using Ensemble Machine Learning Models
    Mosavi, Amirhosein
    Hosseini, Farzaneh Sajedi
    Choubin, Bahram
    Abdolshahnejad, Mahsa
    Gharechaee, Hamidreza
    Lahijanzadeh, Ahmadreza
    Dineva, Adrienn A.
    WATER, 2020, 12 (10)
  • [49] Flood susceptibility modelling using advanced ensemble machine learning models
    Islam, Abu Reza Md Towfiqul
    Talukdar, Swapan
    Mahato, Susanta
    Kundu, Sonali
    Eibek, Kutub Uddin
    Quoc Bao Pham
    Kuriqi, Alban
    Nguyen Thi Thuy Linh
    GEOSCIENCE FRONTIERS, 2021, 12 (03)
  • [50] Flood susceptibility modelling using advanced ensemble machine learning models
    Abu Reza Md Towfiqul Islam
    Swapan Talukdar
    Susanta Mahato
    Sonali Kundu
    Kutub Uddin Eibek
    Quoc Bao Pham
    Alban Kuriqi
    Nguyen Thi Thuy Linh
    Geoscience Frontiers, 2021, 12 (03) : 66 - 83