GPCR-BERT: Interpreting Sequential Design of G Protein-Coupled Receptors Using Protein Language Models

被引:3
|
作者
Kim, Seongwon [1 ]
Mollaei, Parisa [2 ]
Antony, Akshay [2 ]
Magar, Rishikesh [2 ]
Barati Farimani, Amir [2 ,3 ,4 ]
机构
[1] Carnegie Mellon Univ, Dept Chem Engn, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Dept Mech Engn, Pittsburgh, PA 15213 USA
[3] Carnegie Mellon Univ, Dept Biomed Engn, Pittsburgh, PA 15213 USA
[4] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA 15213 USA
基金
美国安德鲁·梅隆基金会;
关键词
DRUG DISCOVERY; PREDICTION; TARGETS;
D O I
10.1021/acs.jcim.3c01706
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
With the rise of transformers and large language models (LLMs) in chemistry and biology, new avenues for the design and understanding of therapeutics have been opened up to the scientific community. Protein sequences can be modeled as language and can take advantage of recent advances in LLMs, specifically with the abundance of our access to the protein sequence data sets. In this letter, we developed the GPCR-BERT model for understanding the sequential design of G protein-coupled receptors (GPCRs). GPCRs are the target of over one-third of Food and Drug Administration-approved pharmaceuticals. However, there is a lack of comprehensive understanding regarding the relationship among amino acid sequence, ligand selectivity, and conformational motifs (such as NPxxY, CWxP, and E/DRY). By utilizing the pretrained protein model (Prot-Bert) and fine-tuning with prediction tasks of variations in the motifs, we were able to shed light on several relationships between residues in the binding pocket and some of the conserved motifs. To achieve this, we took advantage of attention weights and hidden states of the model that are interpreted to extract the extent of contributions of amino acids in dictating the type of masked ones. The fine-tuned models demonstrated high accuracy in predicting hidden residues within the motifs. In addition, the analysis of embedding was performed over 3D structures to elucidate the higher-order interactions within the conformations of the receptors.
引用
收藏
页码:1134 / 1144
页数:11
相关论文
共 50 条
  • [31] The conformational transition during G protein-coupled receptor (GPCR) and G protein interaction
    Ahn, Donghoon
    Ham, Donghee
    Chung, Ka Young
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2021, 69 : 117 - 123
  • [32] GPCR-MPredictor: multi-level prediction of G protein-coupled receptors using genetic ensemble
    Muhammad Naveed
    Asif Ullah Khan
    Amino Acids, 2012, 42 : 1809 - 1823
  • [33] Reconstitution of G protein-coupled receptors with recombinant G protein α and βγ subunits
    McIntire, WE
    Myung, CS
    MacCleery, G
    Wang, Q
    Garrison, JC
    G PROTEIN PATHWAYS, PT A, RECEPTORS, 2002, 343 : 372 - 393
  • [34] Olfactory receptors: G protein-coupled receptors and beyond
    Spehr, Marc
    Munger, Steven D.
    JOURNAL OF NEUROCHEMISTRY, 2009, 109 (06) : 1570 - 1583
  • [35] G protein-coupled receptor (GPCR) regulation of cardiac fibrosis
    Snead, Aaron Nathan
    He, Steven
    Insel, Paul A.
    FASEB JOURNAL, 2012, 26
  • [36] Purification of G protein-coupled receptors (GPCRs) using nanodiscs
    Liu, Yuting
    Mitra, Nivedita
    Yan, Elsa C. Y.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 245
  • [37] Classification of G protein-coupled receptors using attention mechanism
    Paki, Reza
    Nourani, Esmaeil
    Farajzadeh, Davoud
    GENE REPORTS, 2020, 21
  • [38] GPCR-MPredictor: multi-level prediction of G protein-coupled receptors using genetic ensemble
    Naveed, Muhammad
    Khan, Asif Ullah
    AMINO ACIDS, 2012, 42 (05) : 1809 - 1823
  • [39] G protein-coupled receptors that influence lifespan of human and animal models
    Francisco Alejandro Lagunas-Rangel
    Biogerontology, 2022, 23 : 1 - 19
  • [40] G protein-coupled receptor (GPCR) regulation of cardiac fibrosis
    Snead, Aaron Nathan
    Insel, Paul A.
    FASEB JOURNAL, 2011, 25