Evaluation of re-sampling methods on performance of machine learning models to predict landslide susceptibility

被引:15
|
作者
Hassangavyar, Moslem Borji [1 ]
Damaneh, Hadi Eskandari [2 ]
Pham, Quoc Bao [3 ,4 ]
Linh, Nguyen Thi Thuy [5 ]
Tiefenbacher, John [6 ]
Bach, Quang-Vu [7 ]
机构
[1] Univ Tehran, Fac Nat Resources, Dept Arid & Mountainous Reg Reclamat, Tehran, Iran
[2] Univ Hormozgan, Fac Nat Resources, Bandar Abbas, Iran
[3] Duy Tan Univ, Inst Res & Dev, Danang, Vietnam
[4] Duy Tan Univ, Fac Environm & Chem Engn, Danang, Vietnam
[5] Thuyloi Univ, Fac Water Resource Engn, Hanoi, Vietnam
[6] Texas State Univ, Dept Geog, San Marcos, TX USA
[7] Ton Thang Univ, Fac Environm & Labour Safety, Sustainable Management Nat Resources & Environm R, Ho Chi Minh City, Vietnam
关键词
Resampling approach; machine learning; landslide; prediction; REGRESSION; BIVARIATE; AREA;
D O I
10.1080/10106049.2020.1837257
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This study tests the applicability of three resampling methods (i.e. bootstrapping, random-subsampling and cross-validation) for enhancing the performance of eight machine-learning models: boosted regression trees, flexible discriminant analysis, random forests, mixture discriminate analysis, multivariate adaptive regression splines, classification and regression trees, support vector machines and generalized linear models, compared to the use of the original data. The results of models were evaluated using correlation (COR), area under curve (AUC), true skill statistic (TSS), receiver-operating characteristic and the probability of detection (POD). The evaluation showed that the bootstrapping technique improved the performance of all models. The Bootstrapping-random forest (with COR = 0.75, AUC = 0.92, TSS = 0.80 and POD = 0.98) proved to be the best model for landslide prediction. Among the 18 contributing factors, distance from fault, curvature and precipitation were the most influential in all 32 models .
引用
收藏
页码:2772 / 2794
页数:23
相关论文
共 50 条
  • [1] Effects of non-landslide sampling strategies on machine learning models in landslide susceptibility mapping
    Gu, Tengfei
    Duan, Ping
    Wang, Mingguo
    Li, Jia
    Zhang, Yanke
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [2] Evaluating Landslide Susceptibility Using Sampling Methodology and Multiple Machine Learning Models
    Song, Yingze
    Yang, Degang
    Wu, Weicheng
    Zhang, Xin
    Zhou, Jie
    Tian, Zhaoxu
    Wang, Chencan
    Song, Yingxu
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (05)
  • [3] Impact of sampling for landslide susceptibility assessment using interpretable machine learning models
    Wu, Bin
    Shi, Zhenming
    Zheng, Hongchao
    Peng, Ming
    Meng, Shaoqiang
    [J]. Bulletin of Engineering Geology and the Environment, 2024, 83 (11)
  • [4] EVALUATION OF PREDICTION METHODS IN MULTIVARIATE REGRESSION - A RE-SAMPLING APPROACH
    SPARKS, SS
    [J]. SOUTH AFRICAN STATISTICAL JOURNAL, 1985, 19 (02) : 147 - 147
  • [5] Reservoir Landslide Susceptibility Prediction Considering Non-Landslide Sampling and Ensemble Machine Learning Methods
    Wang, Yue
    Cao, Ying
    Xu, Fangdang
    Zhou, Chao
    Yu, Lanbing
    Wu, Lixing
    Wang, Yang
    Yin, Kunlong
    [J]. Diqiu Kexue - Zhongguo Dizhi Daxue Xuebao/Earth Science - Journal of China University of Geosciences, 2024, 49 (05): : 1619 - 1635
  • [6] ACTIVE LEARNING WITH RE-SAMPLING FOR SUPPORT VECTOR MACHINE IN PERSON RE-IDENTIFICATION
    Xiang, Jin-Peng
    Bai, Yang
    [J]. PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 597 - 602
  • [7] Landslide Susceptibility Mapping Methods Coupling with Statistical Methods, Machine Learning Models and Clustering Algorithms
    Wang, Qisheng
    Xiong, Junnan
    Cheng, Weiming
    Cui, Xingjie
    Pang, Quan
    Liu, Jun
    Chen, Wenjie
    Tang, Haoran
    Song, Nanxiao
    [J]. Journal of Geo-Information Science, 2024, 26 (03) : 620 - 637
  • [8] Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance
    Merghadi, Abdelaziz
    Yunus, Ali P.
    Dou, Jie
    Whiteley, Jim
    Binh ThaiPham
    Dieu Tien Bui
    Avtar, Ram
    Abderrahmane, Boumezbeur
    [J]. EARTH-SCIENCE REVIEWS, 2020, 207
  • [9] A study of re-sampling methods with regression modeling
    Hossain, MA
    Woodburn, RL
    [J]. DATA MINING III, 2002, 6 : 83 - 91
  • [10] A Novel Strategy Coupling Optimised Sampling with Heterogeneous Ensemble Machine-Learning to Predict Landslide Susceptibility
    Lu, Yongxing
    Xu, Honggen
    Wang, Can
    Yan, Guanxi
    Huo, Zhitao
    Peng, Zuwu
    Liu, Bo
    Xu, Chong
    [J]. Remote Sensing, 2024, 16 (19)