hydrogenase;
metalloprotein;
protein engineering;
O;
sensitivity;
machine learning;
D O I:
暂无
中图分类号:
学科分类号:
摘要:
Improving a functional property of an enzyme via mutagenesis is still a challenging problem due to vast search space and difficulty of predicting the effects of mutation(s). Machine learning has proven to be proficient in solving similar problems with unprecedented speed owing to the latest advances in computing power and analytical algorithms. In this study, we investigate the performance of machine learning methods in predicting the H2 production activity and O2 tolerance of the hydrogenase variants. Experimentally measured activities and tolerance of 377 variants having single or double amino acid replacements are used to train and test seven types of machine learning models. Binary representation of amino acid sequence as well as the series of vectors quantifying physicochemical properties of amino acids, namely VHSE, are employed as features representing each variant. The results show that the VHSE enable higher performance, especially with respect to correlation coefficient and coefficient of determination in addition to the root mean square error. Next, the analysis of model performance with respect to changes in the data size and heterogeneity is conducted to provide insights on designing effective mutagenesis library for applying machine learning. The best performance was obtained when support vector machine or ridge regression was trained using a large, homogeneous data. In this manner, our study reveals the factors affecting the performance of machine learning in identifying the enzyme variants with enhanced function.
机构:Center for Biologics Evaluation and Research,Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products
Brian C. Lin
Upendra Katneni
论文数: 0引用数: 0
h-index: 0
机构:Center for Biologics Evaluation and Research,Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products
Upendra Katneni
Katarzyna I. Jankowska
论文数: 0引用数: 0
h-index: 0
机构:Center for Biologics Evaluation and Research,Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products
Katarzyna I. Jankowska
Douglas Meyer
论文数: 0引用数: 0
h-index: 0
机构:Center for Biologics Evaluation and Research,Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products
Douglas Meyer
Chava Kimchi-Sarfaty
论文数: 0引用数: 0
h-index: 0
机构:Center for Biologics Evaluation and Research,Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products
机构:
St Petersburg State Univ, Grad Sch Management, Volkhovskiy Pereulok 3, St Petersburg 199004, Russia
Geog Inst Jovan Cvij SASA, Djure Jaks 9, Belgrade 11000, SerbiaSt Petersburg State Univ, Grad Sch Management, Volkhovskiy Pereulok 3, St Petersburg 199004, Russia
Vukovic, Darko B.
Spitsina, Lubov
论文数: 0引用数: 0
h-index: 0
机构:
Natl Res Tomsk Polytech Univ, Sch Engn Educ, Div Social Sci & Humanities, Lenina Ave 30, Tomsk 634050, RussiaSt Petersburg State Univ, Grad Sch Management, Volkhovskiy Pereulok 3, St Petersburg 199004, Russia
Spitsina, Lubov
Gribanova, Ekaterina
论文数: 0引用数: 0
h-index: 0
机构:
Natl Res Tomsk Polytech Univ, Sch Engn Educ, Div Social Sci & Humanities, Lenina Ave 30, Tomsk 634050, RussiaSt Petersburg State Univ, Grad Sch Management, Volkhovskiy Pereulok 3, St Petersburg 199004, Russia
Gribanova, Ekaterina
Spitsin, Vladislav
论文数: 0引用数: 0
h-index: 0
机构:
Natl Res Tomsk Polytech Univ, Sch Engn Entrepreneurship, Lenina Ave 30, Tomsk 634050, RussiaSt Petersburg State Univ, Grad Sch Management, Volkhovskiy Pereulok 3, St Petersburg 199004, Russia
Spitsin, Vladislav
Lyzin, Ivan
论文数: 0引用数: 0
h-index: 0
机构:
Natl Res Tomsk Polytech Univ, Sch Informat Technol & Robot Engn, Lenina Ave, 30, Tomsk 634050, RussiaSt Petersburg State Univ, Grad Sch Management, Volkhovskiy Pereulok 3, St Petersburg 199004, Russia
机构:
South China Univ Technol, Inst Polymer Optoelect Mat & Devices, Sch Mat Sci & Engn, State Key Lab Luminescent Mat & Devices, Guangzhou 510640, Peoples R ChinaSouth China Univ Technol, Inst Polymer Optoelect Mat & Devices, Sch Mat Sci & Engn, State Key Lab Luminescent Mat & Devices, Guangzhou 510640, Peoples R China
Chen, Ziming
Wang, Jing
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Mat & Energy, Guangzhou 510006, Peoples R ChinaSouth China Univ Technol, Inst Polymer Optoelect Mat & Devices, Sch Mat Sci & Engn, State Key Lab Luminescent Mat & Devices, Guangzhou 510640, Peoples R China
Wang, Jing
Li, Canjie
论文数: 0引用数: 0
h-index: 0
机构:
South China Univ Technol, Inst Polymer Optoelect Mat & Devices, Sch Mat Sci & Engn, State Key Lab Luminescent Mat & Devices, Guangzhou 510640, Peoples R ChinaSouth China Univ Technol, Inst Polymer Optoelect Mat & Devices, Sch Mat Sci & Engn, State Key Lab Luminescent Mat & Devices, Guangzhou 510640, Peoples R China
Li, Canjie
Liu, Baiquan
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510275, Peoples R ChinaSouth China Univ Technol, Inst Polymer Optoelect Mat & Devices, Sch Mat Sci & Engn, State Key Lab Luminescent Mat & Devices, Guangzhou 510640, Peoples R China
Liu, Baiquan
论文数: 引用数:
h-index:
机构:
Luo, Dongxiang
论文数: 引用数:
h-index:
机构:
Min, Yonggang
Fu, Nianqing
论文数: 0引用数: 0
h-index: 0
机构:
South China Univ Technol, Inst Polymer Optoelect Mat & Devices, Sch Mat Sci & Engn, State Key Lab Luminescent Mat & Devices, Guangzhou 510640, Peoples R ChinaSouth China Univ Technol, Inst Polymer Optoelect Mat & Devices, Sch Mat Sci & Engn, State Key Lab Luminescent Mat & Devices, Guangzhou 510640, Peoples R China