GraphGPSM: a global scoring model for protein structure using graph neural networks

被引:5
|
作者
He, Guangxing [1 ]
Liu, Jun [1 ]
Liu, Dong [1 ]
Zhang, Guijun [1 ]
机构
[1] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou 310023, Peoples R China
基金
国家重点研发计划;
关键词
protein structures; scoring model; graph neural network; protein modeling; STRUCTURE PREDICTION; QUALITY; SEQUENCE; SPACE;
D O I
10.1093/bib/bbad219
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The scoring models used for protein structure modeling and ranking are mainly divided into unified field and protein-specific scoring functions. Although protein structure prediction has made tremendous progress since CASP14, the modeling accuracy still cannot meet the requirements to a certain extent. Especially, accurate modeling of multi-domain and orphan proteins remains a challenge. Therefore, an accurate and efficient protein scoring model should be developed urgently to guide the protein structure folding or ranking through deep learning. In this work, we propose a protein structure global scoring model based on equivariant graph neural network (EGNN), named GraphGPSM, to guide protein structure modeling and ranking. We construct an EGNN architecture, and a message passing mechanism is designed to update and transmit information between nodes and edges of the graph. Finally, the global score of the protein model is output through a multilayer perceptron. Residue-level ultrafast shape recognition is used to describe the relationship between residues and the overall structure topology, and distance and direction encoded by Gaussian radial basis functions are designed to represent the overall topology of the protein backbone. These two features are combined with Rosetta energy terms, backbone dihedral angles and inter-residue distance and orientations to represent the protein model and embedded into the nodes and edges of the graph neural network. The experimental results on the CASP13, CASP14 and CAMEO test sets show that the scores of our developed GraphGPSM have a strong correlation with the TM-score of the models, which are significantly better than those of the unified field score function REF2015 and the state-of-the-art local lDDT-based scoring models ModFOLD8, ProQ3D and DeepAccNet, etc. The modeling experimental results on 484 test proteins demonstrate that GraphGPSM can greatly improve the modeling accuracy. GraphGPSM is further used to model 35 orphan proteins and 57 multi-domain proteins. The results show that the average TM-score of the models predicted by GraphGPSM is 13.2 and 7.1% higher than that of the models predicted by AlphaFold2. GraphGPSM also participates in CASP15 and achieves competitive performance in global accuracy estimation.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A dual graph neural networks model using sequence embedding as graph nodes for vulnerability detection
    Ling, Miaogui
    Tang, Mingwei
    Bian, Deng
    Lv, Shixuan
    Tang, Qi
    Information and Software Technology, 2025, 177
  • [32] Predicting the functional state of protein kinases using interpretable graph neural networks
    Ravichandran, Ashwin
    Araque, Juan
    Lawson, John
    BIOPHYSICAL JOURNAL, 2022, 121 (03) : 321A - 321A
  • [33] PANDA2: protein function prediction using graph neural networks
    Zhao, Chenguang
    Liu, Tong
    Wang, Zheng
    NAR GENOMICS AND BIOINFORMATICS, 2022, 4 (01)
  • [34] GNNfam: Utilizing Sparsity in Protein Family Predictions using Graph Neural Networks
    Godase, Anuj
    Rahman, Md Khaledur
    Azad, Ariful
    12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [35] Decoding the protein-ligand interactions using parallel graph neural networks
    Knutson, Carter
    Bontha, Mridula
    Bilbrey, Jenna A.
    Kumar, Neeraj
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [36] IGPRED: Combination of convolutional neural and graph convolutional networks for protein secondary structure prediction
    Gormez, Yasin
    Sabzekar, Mostafa
    Aydin, Zafer
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (10) : 1277 - 1288
  • [37] Hierarchical Model Selection for Graph Neural Networks
    Oishi, Yuga
    Kaneiwa, Ken
    IEEE ACCESS, 2023, 11 : 16974 - 16983
  • [38] Predicting residue-specific qualities of individual protein models using residual neural networks and graph neural networks
    Zhao, Chenguang
    Liu, Tong
    Wang, Zheng
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2022, 90 (12) : 2091 - 2102
  • [39] Protein secondary structure prediction using three neural networks and a segmental semi Markov model
    Malekpour, Seyed Amir
    Naghizaideh, Sima
    Pezeshk, Hamid
    Sadeghi, Mehdi
    Eslahchi, Changiz
    MATHEMATICAL BIOSCIENCES, 2009, 217 (02) : 145 - 150
  • [40] Modeling and prediction of protein structure using deep neural networks
    Lamoureux, Guillaume
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256