Accurate prediction of functional effect of single amino acid variants with deep learning

被引:4
|
作者
Derbel, Houssemeddine [1 ]
Zhao, Zhongming [2 ]
Liu, Qian [1 ,3 ]
机构
[1] Univ Nevada, Nevada Inst Personalized Med, Las Vegas, NV 89154 USA
[2] Univ Texas Hlth Sci Ctr Houston, Ctr Precis Hlth, McWilliams Sch Biomed Informat, Houston, TX 77030 USA
[3] Univ Nevada, Coll Sci, Sch Life Sci, Las Vegas, NV 89154 USA
基金
美国国家卫生研究院;
关键词
Functional effect; Deep learning; Single amino acid variant; Precise estimation; High-throughput experiments; PROTEIN; LANDSCAPE; SEQUENCE; FITNESS; MUTATIONS; DOMAIN;
D O I
10.1016/j.csbj.2023.11.017
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The assessment of functional effect of amino acid variants is a critical biological problem in proteomics for clinical medicine and protein engineering. Although natively occurring variants offer insights into deleterious variants, high-throughput deep mutational experiments enable comprehensive investigation of amino acid variants for a given protein. However, these mutational experiments are too expensive to dissect millions of variants on thousands of proteins. Thus, computational approaches have been proposed, but they heavily rely on hand-crafted evolutionary conservation, limiting their accuracy. Recent advancement in transformers provides a promising solution to precisely estimate the functional effects of protein variants on high-throughput experimental data. Here, we introduce a novel deep learning model, namely Rep2Mut-V2, which leverages learned representation from transformer models. Rep2Mut-V2 significantly enhances the prediction accuracy for 27 types of measurements of functional effects of protein variants. In the evaluation of 38 protein datasets with 118,933 single amino acid variants, Rep2Mut-V2 achieved an average Spearman's correlation coefficient of 0.7. This surpasses the performance of six state-of-the-art methods, including the recently released methods ESM, DeepSequence and EVE. Even with limited training data, Rep2Mut-V2 outperforms ESM and DeepSequence, showing its potential to extend high-throughput experimental analysis for more protein variants to reduce experimental cost. In conclusion, Rep2Mut-V2 provides accurate predictions of the functional effects of single amino acid variants of protein coding sequences. This tool can significantly aid in the interpretation of variants in human disease studies.
引用
收藏
页码:5776 / 5784
页数:9
相关论文
共 50 条
  • [21] Deep metric learning for accurate protein secondary structure prediction
    Yang, Wei
    Liu, Yang
    Xiao, Chunjing
    KNOWLEDGE-BASED SYSTEMS, 2022, 242
  • [22] Navigating the amino acid sequence space between functional proteins using a deep learning framework
    Bitard-Feildel T.
    PeerJ Computer Science, 2021, 7 : 1 - 22
  • [23] Navigating the amino acid sequence space between functional proteins using a deep learning framework
    Bitard-Feildel, Tristan
    PEERJ COMPUTER SCIENCE, 2021, 7
  • [24] Prediction of the functional consequences of single amino acid substitution in human cytochrome P450
    Wang, Yufang
    Zhou, Qiang
    Dai, Hao
    Zhang, Tao
    Wei, Dong-Qing
    MOLECULAR SIMULATION, 2012, 38 (14-15) : 1297 - 1307
  • [25] Functional analysis of BRCA regulatory variants with deep learning models
    Troyanskaya, Olga
    CLINICAL CANCER RESEARCH, 2021, 27 (05)
  • [26] Prediction of Functional Effects of Protein Amino Acid Mutations
    Alvarez-Machancoses, Oscar
    Faraggi, Eshel
    de Andres-Galiana, Enrique J.
    Fernandez-Martinez, Juan Luis
    Kloczkowski, Andrzej
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2023, PT II, 2023, 13920 : 59 - 71
  • [27] Accurate Prediction of Stability Changes in Bacteriophage T4 Lysozyme Upon Single Amino Acid Replacements
    Masso, Majid
    Alsheddi, Tariq
    Vaisman, Iosif I.
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 26 - 30
  • [28] iAmideV-Deep: Valine Amidation Site Prediction in Proteins Using Deep Learning and Pseudo Amino Acid Compositions
    Naseer, Sheraz
    Ali, Rao Faizan
    Muneer, Amgad
    Fati, Suliman Mohamed
    SYMMETRY-BASEL, 2021, 13 (04):
  • [29] In silico Prediction of Deleterious Single Amino Acid Polymorphisms from Amino Acid Sequence
    Li, Shuyan
    Xi, Lili
    Li, Jiazhong
    Wang, Chengqi
    Lei, Beilei
    Shen, Yulin
    Liu, Huanxiang
    Yao, Xiaojun
    Li, Biao
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2011, 32 (07) : 1211 - 1216
  • [30] Effect of sequence padding on the performance of deep learning models in archaeal protein functional prediction
    Angela Lopez-del Rio
    Maria Martin
    Alexandre Perera-Lluna
    Rabie Saidi
    Scientific Reports, 10