Accurate prediction of functional effect of single amino acid variants with deep learning

被引:4
|
作者
Derbel, Houssemeddine [1 ]
Zhao, Zhongming [2 ]
Liu, Qian [1 ,3 ]
机构
[1] Univ Nevada, Nevada Inst Personalized Med, Las Vegas, NV 89154 USA
[2] Univ Texas Hlth Sci Ctr Houston, Ctr Precis Hlth, McWilliams Sch Biomed Informat, Houston, TX 77030 USA
[3] Univ Nevada, Coll Sci, Sch Life Sci, Las Vegas, NV 89154 USA
基金
美国国家卫生研究院;
关键词
Functional effect; Deep learning; Single amino acid variant; Precise estimation; High-throughput experiments; PROTEIN; LANDSCAPE; SEQUENCE; FITNESS; MUTATIONS; DOMAIN;
D O I
10.1016/j.csbj.2023.11.017
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The assessment of functional effect of amino acid variants is a critical biological problem in proteomics for clinical medicine and protein engineering. Although natively occurring variants offer insights into deleterious variants, high-throughput deep mutational experiments enable comprehensive investigation of amino acid variants for a given protein. However, these mutational experiments are too expensive to dissect millions of variants on thousands of proteins. Thus, computational approaches have been proposed, but they heavily rely on hand-crafted evolutionary conservation, limiting their accuracy. Recent advancement in transformers provides a promising solution to precisely estimate the functional effects of protein variants on high-throughput experimental data. Here, we introduce a novel deep learning model, namely Rep2Mut-V2, which leverages learned representation from transformer models. Rep2Mut-V2 significantly enhances the prediction accuracy for 27 types of measurements of functional effects of protein variants. In the evaluation of 38 protein datasets with 118,933 single amino acid variants, Rep2Mut-V2 achieved an average Spearman's correlation coefficient of 0.7. This surpasses the performance of six state-of-the-art methods, including the recently released methods ESM, DeepSequence and EVE. Even with limited training data, Rep2Mut-V2 outperforms ESM and DeepSequence, showing its potential to extend high-throughput experimental analysis for more protein variants to reduce experimental cost. In conclusion, Rep2Mut-V2 provides accurate predictions of the functional effects of single amino acid variants of protein coding sequences. This tool can significantly aid in the interpretation of variants in human disease studies.
引用
收藏
页码:5776 / 5784
页数:9
相关论文
共 50 条
  • [31] Effect of sequence padding on the performance of deep learning models in archaeal protein functional prediction
    Lopez-del Rio, Angela
    Martin, Maria
    Perera-Lluna, Alexandre
    Saidi, Rabie
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [32] Deep Learning Approaches for the Prediction of Protein Functional Sites
    Pitarch, Borja
    Pazos, Florencio
    MOLECULES, 2025, 30 (02):
  • [33] Functional Connectivity Prediction With Deep Learning for Graph Transformation
    Etemadyrad, Negar
    Gao, Yuyang
    Li, Qingzhe
    Guo, Xiaojie
    Krueger, Frank
    Lin, Qixiang
    Qiu, Deqiang
    Zhao, Liang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4862 - 4875
  • [34] A Fast Accurate Deep Learning Framework for Prediction of All Cancer Types
    Fadel, Magdy M.
    Elseddeq, Nadia G.
    Arnous, Reham
    Ali, Zainab H.
    Eldesouky, Ali I.
    IEEE Access, 2022, 10 : 122586 - 122600
  • [35] Hybrid deep learning and evolutionary algorithms for accurate cloud workload prediction
    Ali, Tassawar
    Khan, Hikmat Ullah
    Alarfaj, Fawaz Khaled
    Alreshoodi, Mohammed
    COMPUTING, 2024, 106 (12) : 3905 - 3944
  • [36] A Novel Hybrid Deep Learning Method for Accurate Exchange Rate Prediction
    Iqbal, Farhat
    Koutmos, Dimitrios
    Ahmed, Eman A.
    Al-Essa, Lulwah M.
    RISKS, 2024, 12 (09)
  • [37] Accurate Prediction of Human Essential Proteins Using Ensemble Deep Learning
    Li, Yiming
    Zeng, Min
    Wu, Yifan
    Li, Yaohang
    Li, Min
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3263 - 3271
  • [38] A Deep Multimodal Representation Learning Framework for Accurate Molecular Properties Prediction
    Yang, Yuxin
    Wang, Zixu
    Ahadian, Pegah
    Jerger, Abby
    Zucker, Jeremy
    Feng, Song
    Cheng, Feixiong
    Guan, Qiang
    PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 760 - 765
  • [39] Technical Study of Deep Learning in Cloud Computing for Accurate Workload Prediction
    Ahamed, Zaakki
    Khemakhem, Maher
    Eassa, Fathy
    Alsolami, Fawaz
    Al-Ghamdi, Abdullah S. Al-Malaise
    ELECTRONICS, 2023, 12 (03)
  • [40] RNAdegformer: accurate prediction of mRNA degradation at nucleotide resolution with deep learning
    He, Shujun
    Gao, Baizhen
    Sabnis, Rushant
    Sun, Qing
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)