Protein Flexibility Prediction Based on Point Cloud Convolutional Neural Network

被引:0
|
作者
Zhang Xiao-Hui [1 ]
Gu Hao-Sheng [1 ]
Wang Zhi-Ren [1 ]
机构
[1] Yanshan Univ, Sch Sci, Qinhuangdao 066004, Hebei, Peoples R China
关键词
protein flexibility; PointNet; point cloud; pooling operations; spatial transformation network; mini-batch; B-factor; DYNAMICS; SITES; MODEL;
D O I
10.16476/j.pibb.2020.0383
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Objective Protein flexibility plays important roles in various biochemical processes in the living organisms, such as enzyme catalysis, signal transduction, substance transport and storage, etc. Prediction of the intrinsic flexible motions based on the tertiary structure of proteins is helpful for our better understanding of the mechanism of protein functions, which is an important scientific problem in the research field of protein structure-function relationship. Convolutional neural network (CNN), one of the mainstream algorithms in deep learning, has been successfully applied in the study of protein structure-function relationship. Methods In the present work, based on the idea of PointNet method developed in the computer vision research, a CNN model was proposed to predict the protein flexibility. In this model, protein structures were treated as three-dimensional point clouds, where the atomic coordinates of proteins were directly inputted into the model, and the permutation invariance and global rotation invariance of the point cloud were delt with by using the pooling operations and a spatial transformation network, respectively. In addition, considering the varied sizes of different proteins, a new mini-batch optimization strategy was proposed, where the model was trained by using the mini-batches of protein structures with different sizes as input. The Pearson correlation coefficient was used as the evaluation function for the training of the model. Besides that, in order to further enhance the performance of the network, an improved model was constructed based on the PointNet-based CNN model, in which the max-pooling and the average-pooling were concatenated to better extract the global features of protein structures. Then the PointNet-based CNN model and the improved model were trained and tested by using the temperature factors (B-factors) of 243 non-redundant proteins. Results The results show that the average Pearson correlation coefficient between the predicted and the experimental temperature factors predicted by the PointNet-based model and the improved model were 0.64 and 0.65, respectively. The prediction accuracy of our models is better than that of the Gaussian network model that has been widely used in investigating protein flexibility. Especially, for the 74 relatively loose natural disordered proteins from the Disbind website, the average Pearson correlation coefficient predicted by our models were 0.62 and 0.64, respectively, which were significantly better than GNM. Conclusion Our studies provide an effective model for the effective prediction of the intrinsic flexibility encoded in protein structures.
引用
收藏
页码:607 / 616
页数:10
相关论文
共 23 条
  • [1] 3D Morphable Models as Spatial Transformer Networks
    Bas, Anil
    Huber, Patrik
    Smith, William A. P.
    Awais, Muhammad
    Kittler, Josef
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 895 - 903
  • [2] Machine Learning to Identify Flexibility Signatures of Class A GPCR Inhibition
    Bemister-Buffington, Joseph
    Wolf, Alex J.
    Raschka, Sebastian
    Kuhn, Leslie A.
    [J]. BIOMOLECULES, 2020, 10 (03)
  • [3] Botalb A., 2018, 2018 INT C INTELLIGE, DOI [DOI 10.1109/ICIAS.2018, DOI 10.1109/ICIAS.2018.8540626]
  • [4] Blind prediction of protein B-factor and flexibility
    Bramer, David
    Wei, Guo-Wei
    [J]. JOURNAL OF CHEMICAL PHYSICS, 2018, 149 (13):
  • [5] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [6] Effects of local repositioning of charged surface residues on the kinetics of protein dimerization probed by Brownian dynamics simulations
    Fisher, EW
    Rojnuckarin, A
    Kim, S
    [J]. JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 2002, 592 : 37 - 45
  • [7] Fout A, 2017, ADV NEUR IN, V30
  • [8] A fast method to predict protein interaction sites from sequences
    Gallet, X
    Charloteaux, B
    Thomas, A
    Brasseur, R
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 302 (04) : 917 - 926
  • [9] Gaussian dynamics of folded proteins
    Haliloglu, T
    Bahar, I
    Erman, B
    [J]. PHYSICAL REVIEW LETTERS, 1997, 79 (16) : 3090 - 3093
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778