A comparative evaluation of machine learning algorithms for predicting syngas fermentation outcomes

被引:7
|
作者
Roell, Garrett W. [1 ]
Sathish, Ashik [2 ,3 ]
Wan, Ni [1 ]
Cheng, Qianshun [4 ]
Wen, Zhiyou [2 ,3 ]
Tang, Yinjie J. [1 ]
Bao, Forrest Sheng [5 ]
机构
[1] Washington Univ St Louis, DOE Environm & Chem Engn, St Louis, MO 63130 USA
[2] Iowa State Univ, Dept Agr & Biosyst Engn, Ames, IA 50011 USA
[3] Iowa State Univ, Dept Food Sci & Human Nutr, Ames, IA 50011 USA
[4] Univ Illinois, Dept Math Stat & Comp Sci, Chicago, IL 60607 USA
[5] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
基金
美国国家科学基金会;
关键词
Clostridium carboxidivorans; Neural network; Random forest; Support vector machine; Data transformation; Model predictive control; HOLLOW-FIBER MEMBRANE; MASS-TRANSFER; BUTANOL; P7;
D O I
10.1016/j.bej.2022.108578
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Clostridium carboxidivorans can use syngas to produce acids and alcohols. However, simulating gas fermentation dynamics remains challenging. This study employed data transformation and machine learning (ML) approaches to predict syngas fermentation behavior. Syngas composition and fermentative metabolite concentrations (features) were paired with the production rates (prediction targets) of acetate, ethanol, butyrate, and butanol at each time point. This transformation avoided the use of time as a feature. Data augmentation by polynomial smoothing of experimental measurements was used to create a database for supervised learning of 836 rate instances from 10 gas compositions. Seven families of ML algorithms were compared, including neural networks, support vector machines, random forests, elastic nets, lasso regressors, k-nearest neighbors, and Bayesian ridge regressors. These algorithms predicted production rates for training data with Pearson correlation coefficients (R-2 > 0.9), but they showed poorer performance for predicting unseen test data. Among the algorithms, random forests and support vector machines produced the most accurate predictions for the test data, which could regenerate product concentration curves (R-2 asymptotic to 0.85). In contrast, neural networks had a higher risk of over-fitting. Additionally, ML-based feature importance analysis highlighted the significant impacts of CO and H-2 on alcohol production, which offersguidance for model predictive control. Together, these findings can help direct future applications of ML algorithms to complex bioprocesses with limited data.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Comparative analysis of machine learning algorithms for predicting standard time in a manufacturing environment
    Cakit, Erman
    Dagdeviren, Metin
    AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2023, 37
  • [22] Comparative analysis of machine learning algorithms for predicting live weight of Hereford cows
    Ruchay, Alexey
    Kober, Vitaly
    Dorofeev, Konstantin
    Kolpakov, Vladimir
    Dzhulamanov, Kinispay
    Kalschikov, Vsevolod
    Guo, Hao
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 195
  • [23] A comparative study of machine learning and deep learning algorithms for predicting student's academic performance
    Bhushan, Megha
    Vyas, Satyam
    Mall, Shrey
    Negi, Arun
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023, 14 (06) : 2674 - 2683
  • [24] A comparative study of machine learning and deep learning algorithms for predicting student’s academic performance
    Megha Bhushan
    Satyam Vyas
    Shrey Mall
    Arun Negi
    International Journal of System Assurance Engineering and Management, 2023, 14 : 2674 - 2683
  • [25] Comparative Evaluation of Machine Learning Algorithms for Credit Card Fraud Detection
    Singh, Kiran Jot
    Thakur, Khushal
    Kapoor, Divneet Singh
    Sharma, Anshul
    Bajpai, Sakshi
    Sirawag, Neeraj
    Mehta, Riya
    Chaudhary, Chitransh
    Singh, Utkarsh
    THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 69 - 78
  • [26] Evaluation of Machine Learning Algorithms in Predicting the Next SQL Query from the Future
    Meduri, Venkata Vamsikrishna
    Chowdhury, Kanchan
    Sarwat, Mohamed
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2021, 46 (01):
  • [27] A comparative evaluation of machine learning and deep learning algorithms for question categorization of VQA datasets
    Asudani, Deepak Suresh
    Nagwani, Naresh Kumar
    Singh, Pradeep
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (19) : 57829 - 57859
  • [28] Comparative studies of different machine learning algorithms in predicting the compressive strength of geopolymer concrete
    Paruthi, Sagar
    Rahman, Ibadur
    Husain, Asif
    COMPUTERS AND CONCRETE, 2023, 32 (06): : 607 - 613
  • [29] Predicting the clothing insulation through machine learning algorithms: A comparative analysis and a practical approach
    Pablo Aparicio-Ruiz
    Elena Barbadilla-Martín
    José Guadix
    Jesús Muñuzuri
    Building Simulation, 2024, 17 : 839 - 855
  • [30] A Comparative Study of Machine Learning Algorithms in Predicting Severe Complications after Bariatric Surgery
    Cao, Yang
    Fang, Xin
    Ottosson, Johan
    Naslund, Erik
    Stenberg, Erik
    JOURNAL OF CLINICAL MEDICINE, 2019, 8 (05)