Accurate prediction of functional effect of single amino acid variants with deep learning

被引:4
|
作者
Derbel, Houssemeddine [1 ]
Zhao, Zhongming [2 ]
Liu, Qian [1 ,3 ]
机构
[1] Univ Nevada, Nevada Inst Personalized Med, Las Vegas, NV 89154 USA
[2] Univ Texas Hlth Sci Ctr Houston, Ctr Precis Hlth, McWilliams Sch Biomed Informat, Houston, TX 77030 USA
[3] Univ Nevada, Coll Sci, Sch Life Sci, Las Vegas, NV 89154 USA
基金
美国国家卫生研究院;
关键词
Functional effect; Deep learning; Single amino acid variant; Precise estimation; High-throughput experiments; PROTEIN; LANDSCAPE; SEQUENCE; FITNESS; MUTATIONS; DOMAIN;
D O I
10.1016/j.csbj.2023.11.017
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The assessment of functional effect of amino acid variants is a critical biological problem in proteomics for clinical medicine and protein engineering. Although natively occurring variants offer insights into deleterious variants, high-throughput deep mutational experiments enable comprehensive investigation of amino acid variants for a given protein. However, these mutational experiments are too expensive to dissect millions of variants on thousands of proteins. Thus, computational approaches have been proposed, but they heavily rely on hand-crafted evolutionary conservation, limiting their accuracy. Recent advancement in transformers provides a promising solution to precisely estimate the functional effects of protein variants on high-throughput experimental data. Here, we introduce a novel deep learning model, namely Rep2Mut-V2, which leverages learned representation from transformer models. Rep2Mut-V2 significantly enhances the prediction accuracy for 27 types of measurements of functional effects of protein variants. In the evaluation of 38 protein datasets with 118,933 single amino acid variants, Rep2Mut-V2 achieved an average Spearman's correlation coefficient of 0.7. This surpasses the performance of six state-of-the-art methods, including the recently released methods ESM, DeepSequence and EVE. Even with limited training data, Rep2Mut-V2 outperforms ESM and DeepSequence, showing its potential to extend high-throughput experimental analysis for more protein variants to reduce experimental cost. In conclusion, Rep2Mut-V2 provides accurate predictions of the functional effects of single amino acid variants of protein coding sequences. This tool can significantly aid in the interpretation of variants in human disease studies.
引用
收藏
页码:5776 / 5784
页数:9
相关论文
共 50 条
  • [41] UFold: fast and accurate RNA secondary structure prediction with deep learning
    Fu, Laiyi
    Cao, Yingxin
    Wu, Jie
    Peng, Qinke
    Nie, Qing
    Xie, Xiaohui
    NUCLEIC ACIDS RESEARCH, 2022, 50 (03) : E14
  • [42] Deep Learning Enables Accurate Prediction of Interplay Between lncRNA and Disease
    Hu, Jialu
    Gao, Yiqun
    Li, Jing
    Shang, Xuequn
    FRONTIERS IN GENETICS, 2019, 10
  • [43] Investigation and Highly Accurate Prediction of Missed Tryptic Cleavages by Deep Learning
    Sun, Bo
    Smialowski, Pawel
    Straub, Tobias
    Imhof, Axel
    JOURNAL OF PROTEOME RESEARCH, 2021, 20 (07) : 3749 - 3757
  • [44] Accurate solar power prediction with advanced hybrid deep learning approach
    Song, Dongran
    Rehman, Muhammad Shams Ur
    Deng, Xiaofei
    Xiao, Zhao
    Noor, Javeria
    Yang, Jian
    Dong, Mi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148
  • [45] Accurate Prediction of Required Virtual Resources via Deep Reinforcement Learning
    Huang, Haojun
    Li, Zhaoxi
    Tian, Jialin
    Min, Geyong
    Miao, Wang
    Wu, Dapeng Oliver
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (02) : 920 - 933
  • [46] Optimising Deep Learning at the Edge for Accurate Hourly Air Quality Prediction
    Wardana, I. Nyoman Kusuma
    Gardner, Julian W.
    Fahmy, Suhaib A.
    SENSORS, 2021, 21 (04) : 1 - 28
  • [47] A Fast Accurate Deep Learning Framework for Prediction of All Cancer Types
    Fadel, Magdy M.
    Elseddeq, Nadia G.
    Arnous, Reham
    Ali, Zainab H.
    Eldesouky, Ali I.
    IEEE ACCESS, 2022, 10 : 122586 - 122600
  • [48] Accurate Prediction of Network Distance via Federated Deep Reinforcement Learning
    Huang, Haojun
    Cai, Yiming
    Min, Geyong
    Wang, Haozhe
    Liu, Gaoyang
    Wu, Dapeng Oliver
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (04) : 3301 - 3314
  • [49] Comparing machine learning and deep learning regression frameworks for accurate prediction of dielectrophoretic force
    Ajala, Sunday
    Jalajamony, Harikrishnan Muraleedharan
    Nair, Midhun
    Marimuthu, Pradeep
    Fernandez, Renny Edwin
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [50] Comparing machine learning and deep learning regression frameworks for accurate prediction of dielectrophoretic force
    Sunday Ajala
    Harikrishnan Muraleedharan Jalajamony
    Midhun Nair
    Pradeep Marimuthu
    Renny Edwin Fernandez
    Scientific Reports, 12