Accurate prediction of functional effect of single amino acid variants with deep learning

被引:4
|
作者
Derbel, Houssemeddine [1 ]
Zhao, Zhongming [2 ]
Liu, Qian [1 ,3 ]
机构
[1] Univ Nevada, Nevada Inst Personalized Med, Las Vegas, NV 89154 USA
[2] Univ Texas Hlth Sci Ctr Houston, Ctr Precis Hlth, McWilliams Sch Biomed Informat, Houston, TX 77030 USA
[3] Univ Nevada, Coll Sci, Sch Life Sci, Las Vegas, NV 89154 USA
基金
美国国家卫生研究院;
关键词
Functional effect; Deep learning; Single amino acid variant; Precise estimation; High-throughput experiments; PROTEIN; LANDSCAPE; SEQUENCE; FITNESS; MUTATIONS; DOMAIN;
D O I
10.1016/j.csbj.2023.11.017
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The assessment of functional effect of amino acid variants is a critical biological problem in proteomics for clinical medicine and protein engineering. Although natively occurring variants offer insights into deleterious variants, high-throughput deep mutational experiments enable comprehensive investigation of amino acid variants for a given protein. However, these mutational experiments are too expensive to dissect millions of variants on thousands of proteins. Thus, computational approaches have been proposed, but they heavily rely on hand-crafted evolutionary conservation, limiting their accuracy. Recent advancement in transformers provides a promising solution to precisely estimate the functional effects of protein variants on high-throughput experimental data. Here, we introduce a novel deep learning model, namely Rep2Mut-V2, which leverages learned representation from transformer models. Rep2Mut-V2 significantly enhances the prediction accuracy for 27 types of measurements of functional effects of protein variants. In the evaluation of 38 protein datasets with 118,933 single amino acid variants, Rep2Mut-V2 achieved an average Spearman's correlation coefficient of 0.7. This surpasses the performance of six state-of-the-art methods, including the recently released methods ESM, DeepSequence and EVE. Even with limited training data, Rep2Mut-V2 outperforms ESM and DeepSequence, showing its potential to extend high-throughput experimental analysis for more protein variants to reduce experimental cost. In conclusion, Rep2Mut-V2 provides accurate predictions of the functional effects of single amino acid variants of protein coding sequences. This tool can significantly aid in the interpretation of variants in human disease studies.
引用
收藏
页码:5776 / 5784
页数:9
相关论文
共 50 条
  • [1] Accurate Prediction of Transcriptional Activity of Single Missense Variants in HIV Tat with Deep Learning
    Derbel, Houssemeddine
    Giacoletto, Christopher J. J.
    Benjamin, Ronald
    Chen, Gordon R.
    Schiller, Martin R. R.
    Liu, Qian
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (07)
  • [2] Accurate prediction of somatic variants using deep learning model.
    Zhang, Peng
    Wang, Kai
    Yao, Ming
    Wang, Aodi
    Chen, Lijuan
    Liu, Angen
    Shi, Xiaoliang
    Zhang, Shiyue
    JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (15)
  • [3] Pathogenicity Prediction of Single Amino Acid Variants With Machine Learning Model Based on Protein Structural Energies
    Wu, Tzu-Hsuan
    Lin, Peng-Chan
    Chou, Hsin-Hung
    Shen, Meng-Ru
    Hsieh, Sun-Yuan
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 606 - 615
  • [4] SAAVpedia: Identification, Functional Annotation, and Retrieval of Single Amino Acid Variants for Proteogenomic Interpretation
    Lee, Soo Youn
    Hwang, Heeyoun
    Kang, Young-Mook
    Kim, Hyejin
    Kim, Dong Geun
    Jeong, Ji Eun
    Kim, Jin Young
    Yoo, Jong Shin
    JOURNAL OF PROTEOME RESEARCH, 2019, 18 (12) : 4133 - 4142
  • [5] DLPacker: Deep learning for prediction of amino acid side chain conformations in proteins
    Misiura, Mikita
    Shroff, Raghav
    Thyer, Ross
    Kolomeisky, Anatoly B.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2022, 90 (06) : 1278 - 1290
  • [6] FunSAV: Predicting the Functional Effect of Single Amino Acid Variants Using a Two-Stage Random Forest Model
    Wang, Mingjun
    Zhao, Xing-Ming
    Takemoto, Kazuhiro
    Xu, Haisong
    Li, Yuan
    Akutsu, Tatsuya
    Song, Jiangning
    PLOS ONE, 2012, 7 (08):
  • [7] The Impact of Amino Acid Encoding on the Prediction of Antigenic Variants
    Forghani, Majid
    Khachay, Michael
    AlyanNezhadi, Mohammad M.
    2020 6TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2020,
  • [8] Deep Learning for the Accurate Prediction of Triggered Drug Delivery
    Husseini, Ghaleb A.
    Sabouni, Rana
    Puzyrev, Vladimir
    Ghommem, Mehdi
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2025, 24 (01) : 102 - 112
  • [9] Robust Deep Learning for Accurate Landslide Identification and Prediction
    Bhuvaneswari, T.
    Sekar, R. Chandra Guru
    Selvi, M. Chengathir
    Rubavathi, J. Jemima
    Kaviyaa, V.
    DOKLADY EARTH SCIENCES, 2024, 518 (02) : 1700 - 1708
  • [10] MCNN-AAPT: accurate classification and functional prediction of amino acid and peptide transporters in secondary active transporters using protein language models and multi-window deep learning
    Malik, Muhammad Shahid
    Le, Van The
    Shah, Syed Muazzam Ali
    Ou, Yu-Yen
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2024,