Convolution Neural Network-Based Prediction of Protein Thermostability

被引:20
|
作者
Fang, Xingrong [1 ]
Huang, Jinsha [1 ]
Zhang, Rui [2 ]
Wang, Fei [1 ]
Zhang, Qiuyu [1 ]
Li, Guanlin [1 ]
Yan, Jinyong [1 ]
Zhang, Houjin [1 ]
Yan, Yunjun [1 ]
Xu, Li [1 ]
机构
[1] Huazhong Univ Sci & Technol, Coll Life Sci & Technol, Key Lab Mol Biophys, Minist Educ, Wuhan 430074, Hubei, Peoples R China
[2] Wuhan Inst Technol, Journal Wuhan Inst Technol, Editorial Board, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
STABILITY CHANGES; THERMODYNAMIC DATABASE; DIRECTED EVOLUTION; SEQUENCE; MUTATIONS; LIPASE; FOLDX; TOOLS;
D O I
10.1021/acs.jcim.9b00220
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Most natural proteins exhibit poor thermostability, which limits their industrial application. Computer-aided rational design is an efficient purpose-oriented method that can improve protein thermostability. Numerous machine-learning-based methods have been designed to predict the changes in protein thermostability induced by mutations. However, all of these methods have certain limitations due to existing mutation coding methods that overlook protein sequence features. Here we propose a method to predict protein thermostability using convolutional neural networks based on an in-depth study of thermostability-related protein properties. This method comprises a three-dimensional coding algorithm, including protein mutation information and a strategy to extract neighboring features at protein mutation sites based on multiscale convolution. The accuracies on the S1615 and 5388 data sets, which are widely used for protein thermostability predictions, reached 86.4 and 87%, respectively. The Matthews correlation coefficient was nearly double those produced using other methods. Furthermore, a model was constructed to predict the thermostability of Rhizomucor miehei lipase mutants based on the S3661 data set, a single amino acid mutation data set screened from the ProTherm protein thermodynamics database. Compared with the RIF strategy, which consists of three algorithms, i.e., Rosetta ddg monomer, I Mutant 3.0, and FoldX, the accuracy of the proposed method was higher (75.0 vs 66.7%), and the negative sample resolution was simultaneously enhanced. These results indicate that our prediction method more effectively assessed the protein thermostability and distinguished its features, making it a powerful tool to devise mutations that enhance the thermostability of proteins, particularly enzymes.
引用
收藏
页码:4833 / 4843
页数:11
相关论文
共 50 条
  • [1] Convolution Neural Network-based Lane Change Intention Prediction of Surrounding Vehicles for ACC
    Lee, Donghan
    Kwon, Youngwook Paul
    McMains, Sara
    Hedrick, J. Karl
    [J]. 2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,
  • [2] A Convolution Neural Network-Based Seed Classification System
    Gulzar, Yonis
    Hamid, Yasir
    Soomro, Arjumand Bano
    Alwan, Ali A.
    Journaux, Ludovic
    [J]. SYMMETRY-BASEL, 2020, 12 (12): : 1 - 18
  • [3] Prediction of Stock Based on Convolution Neural Network
    Zhang, Rui
    Wu, Zi'ang
    Wang, Siqi
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 3175 - 3178
  • [4] Network-based prediction of protein interactions
    Kovacs, Istvan A.
    Luck, Katja
    Spirohn, Kerstin
    Wang, Yang
    Pollis, Carl
    Schlabach, Sadie
    Bian, Wenting
    Kim, Dae-Kyum
    Kishore, Nishka
    Hao, Tong
    Calderwood, Michael A.
    Vidal, Marc
    Barabasi, Albert-Laszlo
    [J]. NATURE COMMUNICATIONS, 2019, 10 (1)
  • [5] Network-based prediction of protein function
    Sharan, Roded
    Ulitsky, Igor
    Shamir, Ron
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2007, 3 : 1 - 13
  • [6] Network-based prediction of protein interactions
    István A. Kovács
    Katja Luck
    Kerstin Spirohn
    Yang Wang
    Carl Pollis
    Sadie Schlabach
    Wenting Bian
    Dae-Kyum Kim
    Nishka Kishore
    Tong Hao
    Michael A. Calderwood
    Marc Vidal
    Albert-László Barabási
    [J]. Nature Communications, 10
  • [7] Graph Neural Network-Based Diagnosis Prediction
    Li, Yang
    Qian, Buyue
    Zhang, Xianli
    Liu, Hui
    [J]. BIG DATA, 2020, 8 (05) : 379 - 390
  • [8] Neural network-based prediction of solar activities
    Qahwaji, Rarni S. R.
    Colak, Tufan
    [J]. 3RD INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS, AND APPLICAT/4TH INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2006, : 192 - +
  • [9] Prediction of protein interaction with neural network-based feature association rule mining
    Eom, Jae-Hong
    Zhang, Byoung-Tak
    [J]. NEURAL INFORMATION PROCESSING, PT 3, PROCEEDINGS, 2006, 4234 : 30 - 39
  • [10] A Neural Network-Based Multi-Label Classifier for Protein Function Prediction
    Tahzeeb, Shahab
    Hasan, Shehzad
    [J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2022, 12 (01) : 7974 - 7981