Investigating the Performance of Machine Learning Methods in Predicting Functional Properties of the Hydrogenase Variants

被引:0
|
作者
Gyucheol Choi
Wonjun Kim
Jamin Koo
机构
[1] Hongik University,Department of Chemical Engineering
关键词
hydrogenase; metalloprotein; protein engineering; O; sensitivity; machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Improving a functional property of an enzyme via mutagenesis is still a challenging problem due to vast search space and difficulty of predicting the effects of mutation(s). Machine learning has proven to be proficient in solving similar problems with unprecedented speed owing to the latest advances in computing power and analytical algorithms. In this study, we investigate the performance of machine learning methods in predicting the H2 production activity and O2 tolerance of the hydrogenase variants. Experimentally measured activities and tolerance of 377 variants having single or double amino acid replacements are used to train and test seven types of machine learning models. Binary representation of amino acid sequence as well as the series of vectors quantifying physicochemical properties of amino acids, namely VHSE, are employed as features representing each variant. The results show that the VHSE enable higher performance, especially with respect to correlation coefficient and coefficient of determination in addition to the root mean square error. Next, the analysis of model performance with respect to changes in the data size and heterogeneity is conducted to provide insights on designing effective mutagenesis library for applying machine learning. The best performance was obtained when support vector machine or ridge regression was trained using a large, homogeneous data. In this manner, our study reveals the factors affecting the performance of machine learning in identifying the enzyme variants with enhanced function.
引用
收藏
页码:143 / 151
页数:8
相关论文
共 50 条
  • [1] Investigating the Performance of Machine Learning Methods in Predicting Functional Properties of the Hydrogenase Variants
    Choi, Gyucheol
    Kim, Wonjun
    Koo, Jamin
    [J]. BIOTECHNOLOGY AND BIOPROCESS ENGINEERING, 2023, 28 (01) : 143 - 151
  • [2] Predicting functional effects of ion channel variants using new phenotypic machine learning methods
    Bosselmann, Christian Malte
    Hedrich, Ulrike B. S.
    Lerche, Holger
    Pfeifer, Nico
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (03)
  • [3] Application of machine learning methods for predicting the mechanical properties of rubbercrete
    Miladirad, Kaveh
    Golafshani, Emadaldin Mohammadi
    Safehian, Majid
    Sarkar, Alireza
    [J]. ADVANCES IN CONCRETE CONSTRUCTION, 2022, 14 (01) : 15 - 34
  • [4] In silico methods for predicting functional synonymous variants
    Brian C. Lin
    Upendra Katneni
    Katarzyna I. Jankowska
    Douglas Meyer
    Chava Kimchi-Sarfaty
    [J]. Genome Biology, 24
  • [5] In silico methods for predicting functional synonymous variants
    Lin, Brian C. C.
    Katneni, Upendra
    Jankowska, Katarzyna I.
    Meyer, Douglas
    Kimchi-Sarfaty, Chava
    [J]. GENOME BIOLOGY, 2023, 24 (01)
  • [6] Predicting the Performance of Retail Market Firms: Regression and Machine Learning Methods
    Vukovic, Darko B.
    Spitsina, Lubov
    Gribanova, Ekaterina
    Spitsin, Vladislav
    Lyzin, Ivan
    [J]. MATHEMATICS, 2023, 11 (08)
  • [7] Machine Learning methods in predicting electroencephalogram
    Lin, Zizhao
    Ma, Yijiang
    [J]. INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021), 2021, 12156
  • [8] Highly versatile and accurate machine learning methods for predicting perovskite properties
    Chen, Ziming
    Wang, Jing
    Li, Canjie
    Liu, Baiquan
    Luo, Dongxiang
    Min, Yonggang
    Fu, Nianqing
    Xue, Qifan
    [J]. JOURNAL OF MATERIALS CHEMISTRY C, 2024, 12 (38) : 15444 - 15453
  • [9] Perspective: Predicting and optimizing thermal transport properties with machine learning methods
    Wei, Han
    Bao, Hua
    Ruan, Xiulin
    [J]. ENERGY AND AI, 2022, 8
  • [10] A Comprehensive Study on Predicting Functional Role of Metagenomes Using Machine Learning Methods
    Wassan, Jyotsna Talreja
    Wang, Haiying
    Browne, Fiona
    Zheng, Huiru
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 751 - 763