Spatial prediction of soil properties using random forest, k-nearest neighbors and cubist approaches in the foothills of the Ural Mountains, Russia

被引:23
|
作者
Suleymanov, Azamat [1 ,2 ]
Tuktarova, Irina [1 ]
Belan, Larisa [1 ]
Suleymanov, Ruslan [2 ,3 ]
Gabbasova, Ilyusya [2 ,3 ]
Araslanova, Lyasan [1 ]
机构
[1] Ufa State Petr Technol Univ, Dept Environm Protect & Prudent Exploitat Nat Reso, Ufa 450064, Russia
[2] Russian Acad Sci, Ufa Inst Biol, Ufa Fed Res Ctr, Lab Soil Sci, Ufa 450054, Russia
[3] Ufa State Petr Technol Univ, Lab Climate Change Monitoring & Carbon Ecosyst Bal, Ufa 450064, Russia
关键词
Digital soil mapping; Soil organic matter; Soil organic carbon; pH; Machine learning; ORGANIC-CARBON STOCKS; AFFORESTATION; ATTRIBUTES; DENSITY; MATTER; REGION; MAP;
D O I
10.1007/s40808-023-01723-4
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Soil organic matter (SOM) and pH are important indicators to evaluate soil quality. This study applied several machine learning (ML) techniques, namely Random Forest (RF), k-Nearest Neighbor method (kNN) and Cubist, to digital map SOM and pH (H2O and KCl) contents in the foothills of the Ural Mountains, Russia. For this purpose, a total of 52 soil samples were collected from the topsoil depth (0-20 cm). The environmental variables were derived from a digital elevation model (DEM), satellite imagery (Sentinel-2A) and land use/land cover (LULC) map. The ML models were calibrated and validated by the leave-one-out cross-validation approach. The coefficient of determination (R-2) and root-mean-square error (RMSE) were used to determine the ML model performance. According to the R-2 and RMSE metrics, Cubist method resulted in the most accurate spatial prediction for SOM (R-2 = 0.64 and RMSE = 1.95), while RF approach showed the highest performance to predict pH H2O and KCl (R-2 = 0.49; RMSE = 0.45 and R-2 = 0.44; RMSE = 0.61, respectively). Results showed that remote sensing data were the key variables to explain the variability of all soil properties. Sentinel-2A bands B8A, B7 and B8 were the most effective covariates in predicting SOM, whereas spectral indices GNDVI and SAVI explained most of the spatial distribution of both soil pH. According to the generated maps, the highest SOM concentrations (4-8%) were found under the forest and especially at the bottom of the slopes, which is consistent with favorable conditions of organic carbon accumulation and its redeposition under the influence of water erosion. More acidic soils (pH H2O < 6) were also located under the forest, consisting of mixed coniferous broad-leaved species, which also affects the soil acidity. This study confirms the necessity to use various ML techniques to predict individual soil properties. Overall, our findings and generated maps can provide useful information for future digital soil mapping of areas in similar climatic and geographic conditions.
引用
收藏
页码:3461 / 3471
页数:11
相关论文
共 50 条
  • [1] Spatial prediction of soil properties using random forest, k-nearest neighbors and cubist approaches in the foothills of the Ural Mountains, Russia
    Azamat Suleymanov
    Irina Tuktarova
    Larisa Belan
    Ruslan Suleymanov
    Ilyusya Gabbasova
    Lyasan Araslanova
    Modeling Earth Systems and Environment, 2023, 9 : 3461 - 3471
  • [2] Prediction of Beef Production Using Linear Regression, Random Forest and k-Nearest Neighbors Algorithms
    Yildiz, Berkant Ismail
    Karabag, Kemal
    KSU TARIM VE DOGA DERGISI-KSU JOURNAL OF AGRICULTURE AND NATURE, 2025, 28 (01): : 247 - 255
  • [3] k-nearest neighbors prediction and classification for spatial data
    Mohamed-Salem Ahmed
    Mamadou N’diaye
    Mohammed Kadi Attouch
    Sophie Dabo-Niange
    Journal of Spatial Econometrics, 2023, 4 (1):
  • [4] ANALYSIS OF CUSTOMER CHURN PREDICTION USING LOGISTIC REGRESSION, k-NEAREST NEIGHBORS, DECISION TREE AND RANDOM FOREST ALGORITHMS
    Atay, Mehmet Tarik
    Turanli, Munevver
    ADVANCES AND APPLICATIONS IN STATISTICS, 2025, 92 (02) : 147 - 169
  • [5] A Placement Prediction System Using K-Nearest Neighbors Classifier
    Giri, Animesh
    Bhagavath, M. Vignesh V.
    Pruthvi, Bysani
    Dubey, Naini
    2016 SECOND INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2016,
  • [6] Weather Prediction and Classification Using Neural Networks and k-Nearest Neighbors
    Mantri, Rhea
    Raghavendra, Kulkarni Rakshit
    Puri, Harshita
    Chaudhary, Jhanavi
    Bingi, Kishore
    2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 263 - 268
  • [7] Low visibility event prediction using random forest and K-nearest neighbor methods
    Saleh H. Alhathloul
    Ashok K. Mishra
    Abdul A. Khan
    Theoretical and Applied Climatology, 2024, 155 : 1289 - 1300
  • [8] Low visibility event prediction using random forest and K-nearest neighbor methods
    Alhathloul, Saleh H.
    Mishra, Ashok K.
    Khan, Abdul A.
    THEORETICAL AND APPLIED CLIMATOLOGY, 2024, 155 (02) : 1289 - 1300
  • [9] Skin Disease Classification: A Comparative Analysis of K-Nearest Neighbors (KNN) and Random Forest Algorithm
    Pal, Osim Kumar
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND INFORMATION TECHNOLOGY 2021 (ICECIT 2021), 2021,
  • [10] Prediction of Heart Disease using Forest Algorithm over K-nearest neighbors using Machine Learning with Improved Accuracy
    Raj, K. N. S. Shanmukha
    Thinakaran, K.
    CARDIOMETRY, 2022, (25): : 1500 - 1506