Robustness of Optimized Decision Tree-Based Machine Learning Models to Map Gully Erosion Vulnerability

被引:9
|
作者
Eloudi, Hasna [1 ]
Hssaisoune, Mohammed [1 ,2 ,3 ]
Reddad, Hanane [4 ]
Namous, Mustapha [5 ]
Ismaili, Maryem [5 ]
Krimissa, Samira [5 ]
Ouayah, Mustapha [5 ]
Bouchaou, Lhoussaine [1 ,3 ]
机构
[1] Ibn Zohr Univ, Fac Sci, Appl Geol & Geoenvironm Lab, Agadir 80000, Morocco
[2] Ibn Zohr Univ, Fac Appl Sci, Ait Melloul 86150, Morocco
[3] Mohammed VI Polytech Univ, Int Water Res Inst, Ben Guerir 43150, Morocco
[4] Sultan Moulay Slimane Univ, Ecole Super Technol Beni Mellal, Lab Ingn & Technol Appl LITA, Beni Mellal 23000, Morocco
[5] Sultan Moulay Slimane Univ, Data Sci Sustainable Earth Lab Data4Earth, Beni Mellal 23000, Morocco
关键词
soil erosion; inventory data; performance; robustness; spatial prediction; LANDSLIDE SUSCEPTIBILITY ASSESSMENT; SOIL-EROSION; LOGISTIC-REGRESSION; SEDIMENT YIELD; CLIMATE-CHANGE; WATER EROSION; SLOPE ASPECT; HIGH-ATLAS; CLASSIFICATION; VEGETATION;
D O I
10.3390/soilsystems7020050
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Gully erosion is a worldwide threat with numerous environmental, social, and economic impacts. The purpose of this research is to evaluate the performance and robustness of six machine learning ensemble models based on the decision tree principle: Random Forest (RF), C5.0, XGBoost, treebag, Gradient Boosting Machines (GBMs) and Adaboost, in order to map and predict gully erosion-prone areas in a semi-arid mountain context. The first step was to prepare the inventory data, which consisted of 217 gully points. This database was then randomly subdivided into five percentages of Train/Test (50/50, 60/40, 70/30, 80/20, and 90/10) to assess the stability and robustness of the models. Furthermore, 17 geo-environmental variables were used as potential controlling factors, and several metrics were examined to evaluate the performance of the six models. The results revealed that all of the models used performed well in terms of predicting vulnerability to gully erosion. The C5.0 and RF models had the best prediction performance (AUC = 90.8 and AUC = 90.1, respectively). However, according to the random subdivisions of the database, these models exhibit small but noticeable instability, with high performance for the 80/20% and 70/30% subdivisions. This demonstrates the significance of database refining and the need to test various splitting data in order to ensure efficient and reliable output results.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Modelling 5G Data Using Tree-Based Machine Learning Models
    Kumar, P. Mithillesh
    Supriya, M.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 81 - 90
  • [42] Spatial Mapping of Flood Susceptibility Using Decision Tree-Based Machine Learning Models for the Vembanad Lake System in Kerala, India
    Sundar, Parthasarathy Kulithalai Shiyam
    Kundapura, Subrahmanya
    JOURNAL OF WATER RESOURCES PLANNING AND MANAGEMENT, 2023, 149 (10)
  • [43] Predicting surgical decision-making in vestibular schwannoma using tree-based machine learning
    Gadot, Ron
    Anand, Adrish
    Lovin, Benjamin D.
    Sweeney, Alex D.
    Patel, Akash J.
    NEUROSURGICAL FOCUS, 2022, 52 (04)
  • [44] Optimized Intrusion Detection for IoMT Networks with Tree-Based Machine Learning and Filter-Based Feature Selection
    Balhareth, Ghaida
    Ilyas, Mohammad
    SENSORS, 2024, 24 (17)
  • [45] Salivary metabolomics with alternative decision tree-based machine learning methods for breast cancer discrimination
    Murata, Takeshi
    Yanagisawa, Takako
    Kurihara, Toshiaki
    Kaneko, Miku
    Ota, Sana
    Enomoto, Ayame
    Tomita, Masaru
    Sugimoto, Masahiro
    Sunamura, Makoto
    Hayashida, Tetsu
    Kitagawa, Yuko
    Jinno, Hiromitsu
    BREAST CANCER RESEARCH AND TREATMENT, 2019, 177 (03) : 591 - 601
  • [46] Salivary metabolomics with alternative decision tree-based machine learning methods for breast cancer discrimination
    Takeshi Murata
    Takako Yanagisawa
    Toshiaki Kurihara
    Miku Kaneko
    Sana Ota
    Ayame Enomoto
    Masaru Tomita
    Masahiro Sugimoto
    Makoto Sunamura
    Tetsu Hayashida
    Yuko Kitagawa
    Hiromitsu Jinno
    Breast Cancer Research and Treatment, 2019, 177 : 591 - 601
  • [47] Robustness analysis of machine learning classifiers in predicting spatial gully erosion susceptibility with altered training samples
    Hembram, Tusar Kanti
    Saha, Sunil
    Pradhan, Biswajeet
    Maulud, Khairul Nizam Abdul
    Alamri, Abdullah M.
    GEOMATICS NATURAL HAZARDS & RISK, 2021, 12 (01) : 794 - 828
  • [48] Compressive strength prediction of PET fiber-reinforced concrete using Dolphin echolocation optimized decision tree-based machine learning algorithms
    Parhi S.K.
    Patro S.K.
    Asian Journal of Civil Engineering, 2024, 25 (1) : 977 - 996
  • [49] Subsidence risk assessment based on a novel hybrid form of a tree-based machine learning algorithm and an index model of vulnerability
    Mohebbi Tafreshi, Ghazaleh
    Nakhaei, Mohammad
    Lak, Razyeh
    GEOCARTO INTERNATIONAL, 2022, 37 (10) : 2842 - 2870
  • [50] Gully Erosion Susceptibility Mapping in Highly Complex Terrain Using Machine Learning Models
    Yang, Annan
    Wang, Chunmei
    Pang, Guowei
    Long, Yongqing
    Wang, Lei
    Cruse, Richard M.
    Yang, Qinke
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (10)