Landslide Susceptibility Prediction Considering Spatio-Temporal Division Principle of Training/Testing Datasets in Machine Learning Models

被引:0
|
作者
Huang, Faming [1 ,2 ]
Ouyang, Weiping [1 ]
Jiang, Shuihua [1 ]
Fan, Xuanmei [2 ]
Lian, Zhipeng [3 ]
Zhou, Chuangbing [1 ]
机构
[1] School of Infrastructure Engineering, Nanchang University, Nanchang,330031, China
[2] State Key Laboratory of Geohazard Prevention and Geoenvironment Protection, Chengdu University of Technology, Chengdu,610059, China
[3] Wuhan Center, China Geological Survey, Wuhan,430205, China
关键词
Environmental factors - Landslide susceptibility - Machine learning models - Multilayers perceptrons - Prediction modelling - Spatial datasets - Support vectors machine - Times series - Training/testing dataset - Uncertainty;
D O I
10.3799/dqkx.2022.357
中图分类号
学科分类号
摘要
In most of the landslide susceptibility prediction (LSP) models, the landslide-non landslide spatial datasets are divided into training/testing datasets according to the principle of spatial random, however, this spatial randomness division inevitably introduces uncertainties into LSP modelling. Theoretically, LSP modelling is based on past landslide inventories to predict the spatial probability of future landslides, which has significant time series characteristics rather than only spatial random characteristics. Therefore, we believe that it is necessary to divide spatial datasets into the model training/testing datasets based on the time series of landslide occurrence. Taking Wencheng County in China as an example, 11 types of environmental factors and 128 time-accurate landslides are obtained; Then, the landslide and non-landslide samples connected with environmental factors are divided into two different types of training/testing datasets according to the principles of landslide time series and spatial random, respectively. The division ratios of training/testing datasets are set as 9∶1, 8∶2, 7∶3, 6∶4 and 5∶5, respectively, to avoid the influences of different ratios on the LSP results. Thus, the training/testing datasets under 10 combined working conditions are obtained. Finally, several typical machine learning models, such as Support Vector Machine (SVM), Multi-Layer Perceptron (MLP) and Random Forest (RF), are then trained and tested to perform LSP and analyze their uncertainties. Results show that: (1) The LSP uncertainties performed by the time series-based SVM, MLP and RF models are slightly lower than those by spatial random-based models, which verifies the feasibility of dividing by time series; (2) The time series division of training/testing datasets is actually adeterministiccase among the spatial random division, which is more consistent with the actual situation of landslides. Of course, it is also feasible to carry out spatial random division for training and testing datasets when lacking landslide occurrence time. © 2024 China University of Geosciences. All rights reserved.
引用
收藏
页码:1607 / 1618
相关论文
共 50 条
  • [11] Landslide susceptibility prediction using slope unit-based machine learning models considering the heterogeneity of conditioning factors
    Chang, Zhilu
    Catani, Filippo
    Huang, Faming
    Liu, Gengzhe
    Meena, Sansar Raj
    Huang, Jinsong
    Zhou, Chuangbing
    [J]. JOURNAL OF ROCK MECHANICS AND GEOTECHNICAL ENGINEERING, 2023, 15 (05) : 1127 - 1143
  • [12] Uncertainties of landslide susceptibility prediction due to different spatial resolutions and different proportions of training and testing datasets
    Huang F.
    Chen J.
    Tang Z.
    Fan X.
    Huang J.
    Zhou C.
    Chang Z.
    [J]. Yanshilixue Yu Gongcheng Xuebao/Chinese Journal of Rock Mechanics and Engineering, 2021, 40 (06): : 1155 - 1169
  • [13] Spatio-Temporal Prediction of the Epidemic Spread of Dangerous Pathogens Using Machine Learning Methods
    Hamer, Wolfgang B.
    Birr, Tim
    Verreet, Joseph-Alexander
    Duttmann, Rainer
    Klink, Holger
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (01)
  • [14] Comparative Presentation of Machine Learning Algorithms in Flood Prediction Using Spatio-Temporal Data
    Jangyodsuk, Piraporn
    Seo, Dong-Jun
    Elmasri, Ramez
    Gao, Jean
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2016, 386 : 1015 - 1023
  • [15] Effects of Different Training Datasets on Machine Learning Models for Pavement Performance Prediction
    Aranha, Ana Luisa
    Bernucci, Liedi Legi Bariani
    Vasconcelos, Kamilla L.
    [J]. TRANSPORTATION RESEARCH RECORD, 2023, 2677 (08) : 196 - 206
  • [16] A Bayesian machine learning approach for spatio-temporal prediction of COVID-19 cases
    Niraula, Poshan
    Mateu, Jorge
    Chaudhuri, Somnath
    [J]. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (08) : 2265 - 2283
  • [17] A Bayesian machine learning approach for spatio-temporal prediction of COVID-19 cases
    Poshan Niraula
    Jorge Mateu
    Somnath Chaudhuri
    [J]. Stochastic Environmental Research and Risk Assessment, 2022, 36 : 2265 - 2283
  • [18] Multistep speed prediction on traffic networks: A deep learning approach considering spatio-temporal dependencies
    Zhang, Zhengchao
    Li, Meng
    Lin, Xi
    Wang, Yinhai
    He, Fang
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 105 : 297 - 322
  • [19] Comparisons of heuristic, general statistical and machine learning models for landslide susceptibility prediction and mapping
    Huang, Faming
    Cao, Zhongshan
    Guo, Jianfei
    Jiang, Shui-Hua
    Li, Shu
    Guo, Zizheng
    [J]. CATENA, 2020, 191
  • [20] Spatio-Temporal Abnormal Behavior Prediction in Elderly Persons Using Deep Learning Models
    Zerkouk, Meriem
    Chikhaoui, Belkacem
    [J]. SENSORS, 2020, 20 (08)