Rapid prediction of protein natural frequencies using graph neural networks

被引:16
|
作者
Guo, Kai [1 ,2 ]
Buehler, Markus J. [1 ,3 ,4 ]
机构
[1] MIT, Lab Atomist & Mol Mech LAMM, 77 Massachusetts Ave 1-165, Cambridge, MA 02139 USA
[2] ASTAR, Inst High Performance Comp, Singapore 138632, Singapore
[3] MIT, Schwarzman Coll Comp, Ctr Computat Sci & Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] Ctr Mat Sci & Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
DIGITAL DISCOVERY | 2022年 / 1卷 / 03期
关键词
DESIGN; DISCOVERY; MODES;
D O I
10.1039/d1dd00007a
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Natural vibrational frequencies of proteins help to correlate functional shifts with sequence or geometric variations that lead to negligible changes in protein structures, such as point mutations related to disease lethality or medication effectiveness. Normal mode analysis is a well-known approach to accurately obtain protein natural frequencies. However, it is not feasible when high-resolution protein structures are not available or time consuming to obtain. Here we provide a machine learning model to directly predict protein frequencies from primary amino acid sequences and low-resolution structural features such as contact or distance maps. We utilize a graph neural network called principal neighborhood aggregation, trained with the structural graphs and normal mode frequencies of more than 34 000 proteins from the protein data bank. combining with existing contact/distance map prediction tools, this approach enables an end-to-end prediction of the frequency spectrum of a protein given its primary sequence. We present a computational framework based on graph neural networks (GNNs) to predict the natural frequencies of proteins from primary amino acid sequences and contact/distance maps.
引用
收藏
页码:277 / 285
页数:9
相关论文
共 50 条
  • [41] Equivariant Graph Neural Networks for Toxicity Prediction
    Cremer, Julian
    Medrano Sandonas, Leonardo
    Tkatchenko, Alexandre
    Clevert, Djork-Arne
    De Fabritiis, Gianni
    CHEMICAL RESEARCH IN TOXICOLOGY, 2023, 36 (10) : 1561 - 1573
  • [42] Disease Prediction via Graph Neural Networks
    Sun, Zhenchao
    Yin, Hongzhi
    Chen, Hongxu
    Chen, Tong
    Cui, Lizhen
    Yang, Fan
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (03) : 818 - 826
  • [43] Line Graph Neural Networks for Link Prediction
    Cai, Lei
    Li, Jundong
    Wang, Jie
    Ji, Shuiwang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5103 - 5113
  • [44] Software bug prediction using graph neural networks and graph-based text representations
    Siachos, Ilias
    Kanakaris, Nikos
    Karacapilidis, Nikos
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [45] Decoding the protein–ligand interactions using parallel graph neural networks
    Carter Knutson
    Mridula Bontha
    Jenna A. Bilbrey
    Neeraj Kumar
    Scientific Reports, 12
  • [46] Fast and Flexible Protein Design Using Deep Graph Neural Networks
    Strokach, Alexey
    Becerra, David
    Corbi-Verge, Carles
    Perez-Riba, Albert
    Kim, Philip M.
    CELL SYSTEMS, 2020, 11 (04) : 402 - +
  • [47] CoPriNet: graph neural networks provide accurate and rapid compound price prediction for molecule prioritisation
    Sanchez-Garcia, Ruben
    Havasi, David
    Takacs, Gergely
    Robinson, Matthew C.
    Lee, Alpha
    von Delft, Frank
    Deane, Charlotte M.
    DIGITAL DISCOVERY, 2023, 2 (01): : 103 - 111
  • [48] Implementing link prediction in protein networks via feature fusion models based on graph neural networks
    Zhang, Chi
    Gao, Qian
    Li, Ming
    Yu, Tianfei
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2024, 108
  • [49] IGPRED: Combination of convolutional neural and graph convolutional networks for protein secondary structure prediction
    Gormez, Yasin
    Sabzekar, Mostafa
    Aydin, Zafer
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (10) : 1277 - 1288
  • [50] Three-stage prediction of protein β-sheets by neural networks, alignments and graph algorithms
    Cheng, JL
    Baldi, P
    BIOINFORMATICS, 2005, 21 : I75 - I84