Predicting measures of soil health using the microbiome and supervised machine learning

被引:55
|
作者
Wilhelm, Roland C. [1 ]
van Es, Harold M. [1 ]
Buckley, Daniel H. [1 ]
机构
[1] Cornell Univ, Sch Integrat Plant Sci, Bradfield Hall, Ithaca, NY 14853 USA
来源
基金
美国农业部; 美国能源部;
关键词
Soil health monitoring; Soil microbiome; Supervised machine learning; 16S rRNA gene; Agriculture; BACTERIAL COMMUNITY STRUCTURE; QUALITY ASSESSMENT; UNIQUE;
D O I
10.1016/j.soilbio.2021.108472
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Soil health encompasses a range of biological, chemical, and physical soil properties that sustain the commercial and ecological value of agroecosystems. Monitoring soil health requires a comprehensive set of diagnostics that can be cost-prohibitive for routine analyses. The soil microbiome provides a rich source of information about soil properties, which can be assayed in a high-throughput, cost-effective way. We evaluated the accuracy of random forest (RF) and support vector machine (SVM) regression and classification models in predicting 12 measures of soil health, tillage status, and soil texture from 16S rRNA gene amplicon data with an operationally relevant sample set. We validated the efficacy of the best performing models against independent datasets and also tested best practices for processing microbiome data for use in machine learning. Soil health metrics could be predicted from microbiome data with the best models achieving a Kappa value of -0.65, for categorical assessments, and a R2 value of -0.8, for numerical scores. Biological health ratings were better predicted than chemical or physical ratings. Validation with independent datasets revealed that models had general predictive value for soil properties, including yield. The ecological profiles of several taxa important for model accuracy matched the observed relationships with soil health, including Pyrinomonadaceae, Nitrososphaeraceae, and Candidatus Udeaobacter. Models trained at the highest taxonomic resolution proved most accurate, with losses in accuracy resulting from rarefying, sparsity filtering, and aggregating at higher taxonomic ranks. Our study provides the groundwork for developing scalable technology to use microbiome-based diagnostics for the assessment of soil health.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Predicting cancer using supervised machine learning: Mesothelioma
    Choudhury, Avishek
    [J]. TECHNOLOGY AND HEALTH CARE, 2021, 29 (01) : 45 - 58
  • [2] Predicting cash holdings using supervised machine learning algorithms
    Ozlem, Sirin
    Tan, Omer Faruk
    [J]. FINANCIAL INNOVATION, 2022, 8 (01)
  • [3] Predicting cash holdings using supervised machine learning algorithms
    Şirin Özlem
    Omer Faruk Tan
    [J]. Financial Innovation, 8
  • [4] Predicting the Political Polarity of Tweets Using Supervised Machine Learning
    Voong, Michelle
    Gunda, Keerthana
    Gokhale, Swapna S.
    [J]. 2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 1707 - 1712
  • [5] Predicting survival of pancreatic cancer using supervised machine learning
    Osman, M. H.
    [J]. ANNALS OF ONCOLOGY, 2018, 29
  • [6] Predicting declining and growing occupations using supervised machine learning
    Khalaf, Christelle
    Michaud, Gilbert
    Jolley, G. Jason
    [J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2023, 6 (02): : 757 - 780
  • [7] A framework for predicting academic orientation using supervised machine learning
    El Mrabet H.
    Ait Moussa A.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (12) : 16539 - 16549
  • [8] Predicting tax fraud using supervised machine learning approach
    Murorunkwere, Belle Fille
    Haughton, Dominique
    Nzabanita, Joseph
    Kipkogei, Francis
    Kabano, Ignace
    [J]. AFRICAN JOURNAL OF SCIENCE TECHNOLOGY INNOVATION & DEVELOPMENT, 2023, 15 (06): : 731 - 742
  • [9] Predicting declining and growing occupations using supervised machine learning
    Christelle Khalaf
    Gilbert Michaud
    G. Jason Jolley
    [J]. Journal of Computational Social Science, 2023, 6 : 757 - 780
  • [10] Predicting agricultural soil carbon using machine learning
    Nguyen, Thu Thuy
    [J]. NATURE REVIEWS EARTH & ENVIRONMENT, 2021, 2 (12) : 825 - 825