Sample size effects on landslide susceptibility models: A comparative study of heuristic, statistical, machine learning, deep learning and ensemble learning models with SHAP analysis

被引:1
|
作者
Yang, Shilong [1 ]
Tan, Jiayao [1 ]
Luo, Danyuan [1 ]
Wang, Yuzhou [2 ,3 ]
Guo, Xu [1 ]
Zhu, Qiuyu [1 ,4 ]
Ma, Chuanming [1 ]
Xiong, Hanxiang [1 ]
机构
[1] China Univ Geosci, Sch Environm Studies, Wuhan 430074, Peoples R China
[2] Eastern Inst Technol, Eastern Inst Adv Study, Ningbo 315200, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Environm Sci & Engn, Shanghai 200240, Peoples R China
[4] Hangzhou Yuhang Urban Dev Investment Grp Co Ltd, Hangzhou 311100, Peoples R China
关键词
Landslide susceptibility assessment; Model robustness; Inventory sample size; XGBoost and LightGBM; Explainable machine learning; ANALYTICAL HIERARCHY PROCESS; FREQUENCY RATIO MODEL; LOGISTIC-REGRESSION; NEURAL-NETWORKS; GIS; AREA; HAZARD; PROVINCE; BASIN; INDEX;
D O I
10.1016/j.cageo.2024.105723
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In landslide susceptibility assessment (LSA), inventory incompleteness impacts the accuracy of different models to varying degrees. However, this area remains under-researched. This study investigated six LSA models from heuristic, statistical, machine learning and ensemble learning models (analytical hierarchy process (AHP), frequency ratio (FR), logistic regression (LR), Keras based deep learning (KBDL), XGBoost, and LightGBM) across six different sample sizes (100%, 90%, 75%, 50%, 25%, and 10%). Results revealed that XGBoost and LightGBM consistently outperformed other models across all sample sizes. The LR and KBDL models followed, while FR model was the most affected by sample size variations. AHP, an empirical model, remained unaffected by sample size. Through SHapley Additive exPlanations (SHAP) analysis, elevation, NDVI, slope, land use, and distance to roads and rivers emerged as pivotal indicators for landslide occurrences in the study area, suggesting that human activities significantly influence these events. Five time-varying indicators regarding human activity and climate validated this inference, which provides a new method to identify landslide triggering factors, especially in areas of intense human activity. Based on the findings, a comprehensive framework for LSA is proposed to assist landslide managers in making informed decisions. Future research should focus on expanding model diversity to address the effects of sample size, enhancing the adaptability of the LSA framework, deepening the analysis of human activity impacts on landslides using explainable machine learning techniques, addressing temporal inventory incompleteness in LSA, and critically evaluating model sensitivity to sample size variations across multiple disciplines.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Comparing the prediction performance of a Deep Learning Neural Network model with conventional machine learning models in landslide susceptibility assessment
    Dieu Tien Bui
    Tsangaratos, Paraskevas
    Viet-Tien Nguyen
    Ngo Van Liem
    Phan Trong Trinh
    CATENA, 2020, 188
  • [32] Spatial prediction and mapping of landslide susceptibility using machine learning models
    Chen, Yu
    NATURAL HAZARDS, 2025,
  • [33] Combining Evolutionary Algorithms and Machine Learning Models in Landslide Susceptibility Assessments
    Chen, Wei
    Chen, Yunzhi
    Tsangaratos, Paraskevas
    Ilia, Ioanna
    Wang, Xiaojing
    REMOTE SENSING, 2020, 12 (23) : 1 - 26
  • [34] A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan
    Shahzad, Naeem
    Ding, Xiaoli
    Abbas, Sawaid
    APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [35] A Machine Learning Approach to Identify Phishing Websites: A Comparative Study of Classification Models and Ensemble Learning Techniques
    Gontla, Bhogesh Karthik
    Gundu, Priyanka
    Uppalapati, Padma Jyothi
    Narasimharao, Kandula
    Hussain, S. Mahaboob
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2023, 10 (05) : 1 - 9
  • [36] Using the rotation and random forest models of ensemble learning to predict landslide susceptibility
    Zhao, Lingran
    Wu, Xueling
    Niu, Ruiqing
    Wang, Ying
    Zhang, Kaixiang
    GEOMATICS NATURAL HAZARDS & RISK, 2020, 11 (01) : 1542 - 1564
  • [37] Predictive analysis by ensemble classifier with machine learning models
    Chaya J.D.
    Usha R.N.
    International Journal of Computers and Applications, 2023, 45 (01) : 19 - 26
  • [38] A Comparative Study of Machine Learning and Deep Learning Models for Microplastic Classification using FTIR Spectra
    Thar, Aeint Shune
    Laitrakun, Seksan
    Deepaisarn, Somrudee
    Opaprakasit, Pakorn
    Somnuake, Pattara
    Athikulwongse, Krit
    2023 18TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING, ISAI-NLP, 2023,
  • [39] An examination of daily CO2 emissions prediction through a comparative analysis of machine learning, deep learning, and statistical models
    Adewole Adetoro Ajala
    Oluwatosin Lawrence Adeoye
    Olawale Moshood Salami
    Ayoola Yusuf Jimoh
    Environmental Science and Pollution Research, 2025, 32 (5) : 2510 - 2535
  • [40] A Comparative Study on Deep Learning and Machine Learning Models for Human Action Recognition in Aerial Videos
    Kapoor, Surbhi
    Sharma, Akashdeep
    Verma, Amandeep
    Dhull, Vishal
    Goyal, Chahat
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (04) : 567 - 574