Experimental study and Random Forest prediction model of microbiome cell surface hydrophobicity

被引:24
|
作者
Liu, Yong [1 ,2 ,3 ]
Tang, Shaoxun [1 ,2 ]
Fernandez-Lozano, Carlos [3 ]
Munteanu, Cristian R. [3 ]
Pazos, Alejandro [3 ,4 ]
Yu, Yi-zun [5 ]
Tan, Zhiliang [1 ,2 ]
Gonzalez-Diaz, Humberto [6 ,7 ]
机构
[1] Chinese Acad Sci, Key Lab Agroecol Proc Subtrop Reg, Changsha 410125, Hunan, Peoples R China
[2] Chinese Acad Sci, Hunan Coinnovat Ctr Anim Prod Safety, CICAPS, Inst Subtrop Agr, Changsha 410125, Hunan, Peoples R China
[3] Univ A Coruna, Fac Comp Sci, Campus Elvina S-N, La Coruna 15071, Spain
[4] CHUAC, Inst Invest Biomed A Coruna INIBIC, La Coruna 15006, Spain
[5] Jiangxi Acad Sci, Inst Biol Resources, Nanchang 330096, Jiangxi, Peoples R China
[6] Univ Basque Country UPV EHU, Dept Organ Chem 2, Leioa 48940, Spain
[7] Basque Fdn Sci, Ikerbasque, Bilbao 48011, Spain
基金
中国国家自然科学基金;
关键词
Machine learning; Expected values; Moving averages; Cell properties; Perturbation theory; Time series analysis; STAPHYLOCOCCUS-EPIDERMIDIS; ESCHERICHIA-COLI; ADHESION; ADHERENCE; BACTERIA; SELECTION; CHARGE; BIOMATERIALS; HYDROCARBON; REGRESSION;
D O I
10.1016/j.eswa.2016.10.058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The cell surface hydrophobicity (CSH) is an assessable physicochemical property used to evaluate the microbial adhesion to the surface of biomaterials, which is an essential step in the microbial biofilm formation and pathogenesis. For the present in vitro fermentation experiment, the CSH of ruminal mixed microbes was considered, along with other data records of pH, ammonia-nitrogen concentration, and neutral detergent fibre digestibility, conditions of surface tension and specific surface area in two different time scales. A dataset of 170,707 perturbations of input variables, grouped into two blocks of data, was constructed. Next, Expected Measurement Moving Average - Machine Learning (EMMA-ML) models were developed in order to predict CSH after perturbations,of all input variables. EMMA-ML is a Perturbation Theory method that combines the ideas of Expected Measurement, Box Jenkins Operators/Moving Average, and Time Series Analysis. Seven regression methods have been tested: Multiple Linear regression, Generalized Linear Model with Stepwise Feature Selection, Partial Least Squares regression, Lasso regression, Elastic Net regression, Neural Networks regression, and Random Forests (RF). The best regression performance has been obtained with RF (EMMA-RF model) with an R-squared of 0.992. The model analysis has shown that CSH values were highly dependent on the in vitro fermentation parameters of detergent fibre digestibility, ammonia - nitrogen concentration, and the expected values of cell surface hydrophobicity in the first time scale. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:306 / 316
页数:11
相关论文
共 50 条
  • [1] Optimized Random Forest Model for Remaining Useful Life Prediction of Experimental Bearings
    Alfarizi, Muhammad Gibran
    Tajiani, Bahareh
    Vatn, Jorn
    Yin, Shen
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (06) : 7771 - 7779
  • [2] Signatures of Mucosal Microbiome in Oral Squamous Cell Carcinoma Identified Using a Random Forest Model
    Zhou, Jianhua
    Wang, Lili
    Yuan, Rongtao
    Yu, Xinjuan
    Chen, Zhenggang
    Yang, Fang
    Sun, Guirong
    Dong, Quanjiang
    CANCER MANAGEMENT AND RESEARCH, 2020, 12 : 5353 - 5363
  • [3] A Bankruptcy Prediction Model Using Random Forest
    Joshi, Shreya
    Ramesh, Rachana
    Tahsildar, Shagufta
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1722 - 1727
  • [4] Experimental study of population density using an optimized random forest model
    LI Lingling
    LIU Jinsong
    LI Zhi
    WEN Peizhang
    LI Yancheng
    LIU Yi
    Journal of Geographical Sciences, 2024, 34 (08) : 1636 - 1656
  • [5] Experimental study of population density using an optimized random forest model
    Li L.
    Liu J.
    Li Z.
    Wen P.
    Li Y.
    Liu Y.
    Dili Xuebao/Acta Geographica Sinica, 2023, 78 (05): : 1304 - 1320
  • [6] Experimental study of population density using an optimized random forest model
    Li, Lingling
    Liu, Jinsong
    Li, Zhi
    Wen, Peizhang
    Li, Yancheng
    Liu, Yi
    JOURNAL OF GEOGRAPHICAL SCIENCES, 2024, 34 (08) : 1636 - 1656
  • [7] Comparison of Random Survival Forest and Cox Model for Prediction Performance: A Case Study
    Oliveira, Tiago A.
    Silva, Pedro Augusto F.
    Martins, Hiago Jose A. A.
    Pereira, Lucas C.
    Brito, Alisson L.
    Mendonca, Edndrio B.
    SIGMAE, 2019, 8 (02): : 490 - 508
  • [8] Study of cardiovascular disease prediction model based on random forest in eastern China
    Li Yang
    Haibin Wu
    Xiaoqing Jin
    Pinpin Zheng
    Shiyun Hu
    Xiaoling Xu
    Wei Yu
    Jing Yan
    Scientific Reports, 10
  • [9] Study on prediction model of liquid hold up based on random forest algorithm
    Liu, Jianyi
    Jiang, Lu
    Chen, Yizhao
    Liu, Zhibin
    Yuan, Hua
    Wen, Yimin
    CHEMICAL ENGINEERING SCIENCE, 2023, 268
  • [10] Study of cardiovascular disease prediction model based on random forest in eastern China
    Yang, Li
    Wu, Haibin
    Jin, Xiaoqing
    Zheng, Pinpin
    Hu, Shiyun
    Xu, Xiaoling
    Yu, Wei
    Yan, Jing
    SCIENTIFIC REPORTS, 2020, 10 (01)