A comparative evaluation of machine learning algorithms for predicting syngas fermentation outcomes

被引:7
|
作者
Roell, Garrett W. [1 ]
Sathish, Ashik [2 ,3 ]
Wan, Ni [1 ]
Cheng, Qianshun [4 ]
Wen, Zhiyou [2 ,3 ]
Tang, Yinjie J. [1 ]
Bao, Forrest Sheng [5 ]
机构
[1] Washington Univ St Louis, DOE Environm & Chem Engn, St Louis, MO 63130 USA
[2] Iowa State Univ, Dept Agr & Biosyst Engn, Ames, IA 50011 USA
[3] Iowa State Univ, Dept Food Sci & Human Nutr, Ames, IA 50011 USA
[4] Univ Illinois, Dept Math Stat & Comp Sci, Chicago, IL 60607 USA
[5] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
基金
美国国家科学基金会;
关键词
Clostridium carboxidivorans; Neural network; Random forest; Support vector machine; Data transformation; Model predictive control; HOLLOW-FIBER MEMBRANE; MASS-TRANSFER; BUTANOL; P7;
D O I
10.1016/j.bej.2022.108578
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Clostridium carboxidivorans can use syngas to produce acids and alcohols. However, simulating gas fermentation dynamics remains challenging. This study employed data transformation and machine learning (ML) approaches to predict syngas fermentation behavior. Syngas composition and fermentative metabolite concentrations (features) were paired with the production rates (prediction targets) of acetate, ethanol, butyrate, and butanol at each time point. This transformation avoided the use of time as a feature. Data augmentation by polynomial smoothing of experimental measurements was used to create a database for supervised learning of 836 rate instances from 10 gas compositions. Seven families of ML algorithms were compared, including neural networks, support vector machines, random forests, elastic nets, lasso regressors, k-nearest neighbors, and Bayesian ridge regressors. These algorithms predicted production rates for training data with Pearson correlation coefficients (R-2 > 0.9), but they showed poorer performance for predicting unseen test data. Among the algorithms, random forests and support vector machines produced the most accurate predictions for the test data, which could regenerate product concentration curves (R-2 asymptotic to 0.85). In contrast, neural networks had a higher risk of over-fitting. Additionally, ML-based feature importance analysis highlighted the significant impacts of CO and H-2 on alcohol production, which offersguidance for model predictive control. Together, these findings can help direct future applications of ML algorithms to complex bioprocesses with limited data.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Performance Evaluation and Comparative Analysis of Different Machine Learning Algorithms in Predicting Cardiovascular Disease
    Asif, Md Asfi-Ar-Raihan
    Nishat, Mirza Muntasir
    Faisal, Fahim
    Dip, Rezuanur Rahman
    Udoy, Mahmudul Hasan
    Shikder, Md Fahim
    Ahsan, Ragib
    ENGINEERING LETTERS, 2021, 29 (02) : 731 - 741
  • [2] A comparative analysis of machine learning algorithms for predicting wave runup
    Ahmet Durap
    Anthropocene Coasts, 6
  • [3] A comparative analysis of machine learning algorithms for predicting wave runup
    Durap, Ahmet
    ANTHROPOCENE COASTS, 2023, 6 (01)
  • [4] Deep Learning in Predicting Preterm Birth: A Comparative Study of Machine Learning Algorithms
    Zhang Fangchao
    Tong Lingling
    Shi Chen
    Zuo Rui
    Wang Liwei
    Wang Yan
    母胎医学杂志(英文), 2024, 06 (03)
  • [5] Deep Learning in Predicting Preterm Birth: A Comparative Study of Machine Learning Algorithms
    Zhang, Fangchao
    Tong, Lingling
    Shi, Chen
    Zuo, Rui
    Wang, Liwei
    Wang, Yan
    MATERNAL-FETAL MEDICINE, 2024, 6 (03) : 141 - 146
  • [6] Evaluation of machine learning algorithms for prediction of trabeculectomy outcomes
    Zanabli, Ahmed Amer
    Ul Banna, Hasan
    McMillan, Brian
    Lehmann, Maria
    Gupta, Sumeet
    Palko, Joel R.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2021, 62 (08)
  • [7] Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms
    Saxena, Surabhi
    Mohapatra, Debashish
    Padhee, Subhransu
    Sahoo, Goutam Kumar
    EVOLUTIONARY INTELLIGENCE, 2023, 16 (02) : 587 - 603
  • [8] Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms
    Surabhi Saxena
    Debashish Mohapatra
    Subhransu Padhee
    Goutam Kumar Sahoo
    Evolutionary Intelligence, 2023, 16 : 587 - 603
  • [9] A Comparative Study of Machine Learning Algorithms for Predicting Weight Range of Neonate
    Adeeba, Saleem
    Banujan, Kuhaneswaran
    Kumara, B. T. G. S.
    Prasanth, Senthan
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 869 - 873
  • [10] Comparative Analysis of Machine Learning Algorithms for Predicting Air Quality Index
    Kekulanadara, K.M.O.V.K.
    Kumara, B.T.G.S.
    Kuhaneswaran, Banujan
    2021 From Innovation To Impact, FITI 2021, 2021,