GraphGPSM: a global scoring model for protein structure using graph neural networks

被引:5
|
作者
He, Guangxing [1 ]
Liu, Jun [1 ]
Liu, Dong [1 ]
Zhang, Guijun [1 ]
机构
[1] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou 310023, Peoples R China
基金
国家重点研发计划;
关键词
protein structures; scoring model; graph neural network; protein modeling; STRUCTURE PREDICTION; QUALITY; SEQUENCE; SPACE;
D O I
10.1093/bib/bbad219
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The scoring models used for protein structure modeling and ranking are mainly divided into unified field and protein-specific scoring functions. Although protein structure prediction has made tremendous progress since CASP14, the modeling accuracy still cannot meet the requirements to a certain extent. Especially, accurate modeling of multi-domain and orphan proteins remains a challenge. Therefore, an accurate and efficient protein scoring model should be developed urgently to guide the protein structure folding or ranking through deep learning. In this work, we propose a protein structure global scoring model based on equivariant graph neural network (EGNN), named GraphGPSM, to guide protein structure modeling and ranking. We construct an EGNN architecture, and a message passing mechanism is designed to update and transmit information between nodes and edges of the graph. Finally, the global score of the protein model is output through a multilayer perceptron. Residue-level ultrafast shape recognition is used to describe the relationship between residues and the overall structure topology, and distance and direction encoded by Gaussian radial basis functions are designed to represent the overall topology of the protein backbone. These two features are combined with Rosetta energy terms, backbone dihedral angles and inter-residue distance and orientations to represent the protein model and embedded into the nodes and edges of the graph neural network. The experimental results on the CASP13, CASP14 and CAMEO test sets show that the scores of our developed GraphGPSM have a strong correlation with the TM-score of the models, which are significantly better than those of the unified field score function REF2015 and the state-of-the-art local lDDT-based scoring models ModFOLD8, ProQ3D and DeepAccNet, etc. The modeling experimental results on 484 test proteins demonstrate that GraphGPSM can greatly improve the modeling accuracy. GraphGPSM is further used to model 35 orphan proteins and 57 multi-domain proteins. The results show that the average TM-score of the models predicted by GraphGPSM is 13.2 and 7.1% higher than that of the models predicted by AlphaFold2. GraphGPSM also participates in CASP15 and achieves competitive performance in global accuracy estimation.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Protein-ligand scoring with convolutional neural networks
    Koes, David
    Ragoza, Matthew
    Idrobo, Elisa
    Hochuli, Joshua
    Sunseri, Jocelyn
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [22] Convolutional neural networks for protein-ligand scoring
    Ragoza, Matt
    Collins, Jasmine
    Koes, David
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252
  • [23] Protein-Ligand Scoring with Convolutional Neural Networks
    Ragoza, Matthew
    Hochuli, Joshua
    Idrobo, Elisa
    Sunseri, Jocelyn
    Koes, David Ryan
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (04) : 942 - 957
  • [24] GRAPH NEURAL NETWORKS FOR PREDICTING PROTEIN FUNCTIONS
    Ioannidis, Vassilis N.
    Marques, Antonio G.
    Giannakis, Georgios B.
    2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 221 - 225
  • [25] Scoring Summaries Using Recurrent Neural Networks
    Ruseti, Stefan
    Dascalu, Mihai
    Johnson, Amy M.
    McNamara, Danielle S.
    Balyan, Renu
    McCarthy, Kathryn S.
    Trausan-Matu, Stefan
    INTELLIGENT TUTORING SYSTEMS, ITS 2018, 2018, 10858 : 191 - 201
  • [26] Sleep scoring using artificial neural networks
    Ronzhina, Marina
    Janousek, Oto
    Kolarova, Jana
    Novakova, Marie
    Honzik, Petr
    Provaznik, Ivo
    SLEEP MEDICINE REVIEWS, 2012, 16 (03) : 251 - 263
  • [27] Automatic Text Scoring Using Neural Networks
    Alikaniotis, Dimitrios
    Yannakoudakis, Helen
    Rei, Marek
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 715 - 725
  • [28] Graph structure and homophily for label propagation in Graph Neural Networks
    Vandromme, Maxence
    Petiton, Serge G.
    2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, : 194 - 201
  • [29] Graph Neural Networks for the Global Economy with Microsoft DeepGraph
    Yang, Jaewon
    Shi, Baoxu
    Samylkin, Alex
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1655 - 1655
  • [30] Towards the Characterization of Realistic Model Generators using Graph Neural Networks
    Hernandez Lopez, Jose Antonio
    Sanchez Cuadrado, Jesus
    24TH INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING LANGUAGES AND SYSTEMS (MODELS 2021), 2021, : 58 - 69