Robustness of Optimized Decision Tree-Based Machine Learning Models to Map Gully Erosion Vulnerability

被引:5
|
作者
Eloudi, Hasna [1 ]
Hssaisoune, Mohammed [1 ,2 ,3 ]
Reddad, Hanane [4 ]
Namous, Mustapha [5 ]
Ismaili, Maryem [5 ]
Krimissa, Samira [5 ]
Ouayah, Mustapha [5 ]
Bouchaou, Lhoussaine [1 ,3 ]
机构
[1] Ibn Zohr Univ, Fac Sci, Appl Geol & Geoenvironm Lab, Agadir 80000, Morocco
[2] Ibn Zohr Univ, Fac Appl Sci, Ait Melloul 86150, Morocco
[3] Mohammed VI Polytech Univ, Int Water Res Inst, Ben Guerir 43150, Morocco
[4] Sultan Moulay Slimane Univ, Ecole Super Technol Beni Mellal, Lab Ingn & Technol Appl LITA, Beni Mellal 23000, Morocco
[5] Sultan Moulay Slimane Univ, Data Sci Sustainable Earth Lab Data4Earth, Beni Mellal 23000, Morocco
关键词
soil erosion; inventory data; performance; robustness; spatial prediction; LANDSLIDE SUSCEPTIBILITY ASSESSMENT; SOIL-EROSION; LOGISTIC-REGRESSION; SEDIMENT YIELD; CLIMATE-CHANGE; WATER EROSION; SLOPE ASPECT; HIGH-ATLAS; CLASSIFICATION; VEGETATION;
D O I
10.3390/soilsystems7020050
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Gully erosion is a worldwide threat with numerous environmental, social, and economic impacts. The purpose of this research is to evaluate the performance and robustness of six machine learning ensemble models based on the decision tree principle: Random Forest (RF), C5.0, XGBoost, treebag, Gradient Boosting Machines (GBMs) and Adaboost, in order to map and predict gully erosion-prone areas in a semi-arid mountain context. The first step was to prepare the inventory data, which consisted of 217 gully points. This database was then randomly subdivided into five percentages of Train/Test (50/50, 60/40, 70/30, 80/20, and 90/10) to assess the stability and robustness of the models. Furthermore, 17 geo-environmental variables were used as potential controlling factors, and several metrics were examined to evaluate the performance of the six models. The results revealed that all of the models used performed well in terms of predicting vulnerability to gully erosion. The C5.0 and RF models had the best prediction performance (AUC = 90.8 and AUC = 90.1, respectively). However, according to the random subdivisions of the database, these models exhibit small but noticeable instability, with high performance for the 80/20% and 70/30% subdivisions. This demonstrates the significance of database refining and the need to test various splitting data in order to ensure efficient and reliable output results.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Assessment of flood susceptibility prediction based on optimized tree-based machine learning models
    Eslaminezhad, Seyed Ahmad
    Eftekhari, Mobin
    Azma, Aliasghar
    Kiyanfar, Ramin
    Akbari, Mohammad
    [J]. JOURNAL OF WATER AND CLIMATE CHANGE, 2022, 13 (06) : 2353 - 2385
  • [2] Robustness Verification of Tree-based Models
    Chen, Hongge
    Zhang, Huan
    Si, Si
    Li, Yang
    Boning, Duane
    Hsieh, Cho-Jui
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [3] Runtime Optimizations for Tree-based Machine Learning Models
    Asadi, Nima
    Lin, Jimmy
    de Vries, Arjen P.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) : 2281 - 2292
  • [4] Stock Market Decision Support Modeling with Tree-Based Adaboost Ensemble Machine Learning Models
    Ampomah, Ernest Kwame
    Qin, Zhiguang
    Nyame, Gabriel
    Botchey, Francis Effirm
    [J]. INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2020, 44 (04): : 477 - 490
  • [5] Stock market decision support modeling with tree-based adaboost ensemble machine learning models
    Ampomah, Ernest Kwame
    Qin, Zhiguang
    Nyame, Gabriel
    Botchey, Francis Effirm
    [J]. Informatica (Slovenia), 2020, 44 (04): : 477 - 489
  • [6] Comparison of machine learning models for gully erosion susceptibility mapping
    Alireza Arabameri
    Wei Chen
    Marco Loche
    Xia Zhao
    Yang Li
    Luigi Lombardo
    Artemi Cerda
    Biswajeet Pradhan
    Dieu Tien Bui
    [J]. Geoscience Frontiers, 2020, 11 (05) : 1609 - 1620
  • [7] Comparison of machine learning models for gully erosion susceptibility mapping
    Alireza Arabameri
    Wei Chen
    Marco Loche
    Xia Zhao
    Yang Li
    Luigi Lombardo
    Artemi Cerda
    Biswajeet Pradhan
    Dieu Tien Bui
    [J]. Geoscience Frontiers . , 2020, (05) - 1620
  • [8] Comparison of machine learning models for gully erosion susceptibility mapping
    Arabameri, Alireza
    Chen, Wei
    Loche, Marco
    Zhao, Xia
    Li, Yang
    Lombardo, Luigi
    Cerda, Artemi
    Pradhan, Biswajeet
    Dieu Tien Bui
    [J]. GEOSCIENCE FRONTIERS, 2020, 11 (05) : 1609 - 1620
  • [9] Evaluating the effectiveness and robustness of machine learning models with varied geo-environmental factors for determining vulnerability to water flow-induced gully erosion
    Aboutaib, Fatima
    Krimissa, Samira
    Pradhan, Biswajeet
    Elaloui, Abdenbi
    Ismaili, Maryem
    Abdelrahman, Kamal
    Eloudi, Hasna
    Ouayah, Mustapha
    Ourribane, Malika
    Namous, Mustapha
    [J]. FRONTIERS IN ENVIRONMENTAL SCIENCE, 2023, 11
  • [10] Hybrid decision tree-based machine learning models for short-term water quality prediction
    Lu, Hongfang
    Ma, Xin
    [J]. CHEMOSPHERE, 2020, 249