Sitetack: a deep learning model that improves PTM prediction by using known PTMs

被引:0
|
作者
Gutierrez, Clair S. [1 ,2 ]
Kassim, Alia A. [1 ]
Gutierrez, Benjamin D. [2 ]
Raines, Ronald T. [1 ,3 ]
机构
[1] MIT, Dept Chem, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Broad Inst MIT & Harvard, Cambridge, MA 02143 USA
[3] MIT, Koch Inst Integrated Canc Res, Cambridge, MA 02139 USA
基金
美国国家卫生研究院;
关键词
POSTTRANSLATIONAL MODIFICATIONS; CD-HIT; PROTEIN;
D O I
10.1093/bioinformatics/btae602
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Post-translational modifications (PTMs) increase the diversity of the proteome and are vital to organismal life and therapeutic strategies. Deep learning has been used to predict PTM locations. Still, limitations in datasets and their analyses compromise success. Results: We evaluated the use of known PTM sites in prediction via sequence-based deep learning algorithms. For each PTM, known locations of that PTM were encoded as a separate amino acid before sequences were encoded via word embedding and passed into a convolutional neural network that predicts the probability of that PTM at a given site. Without labeling known PTMs, our models are on par with others. With labeling, however, we improved significantly upon extant models. Moreover, knowing PTM locations can increase the predictability of a different PTM. Our findings highlight the importance of PTMs for the installation of additional PTMs. We anticipate that including known PTM locations will enhance the performance of other proteomic machine learning algorithms.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] SiteTack: A Deep Learning Model for PTM Site Prediction that Improves Accuracy by Tacking on PTM Information
    Gutierrez, Clair
    Kassim, Alia
    Raines, Ronald
    PROTEIN SCIENCE, 2023, 32 (12)
  • [2] Prediction of Stroke Using Deep Learning Model
    Chantamit-o-pas, Pattanapong
    Goyal, Madhu
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 774 - 781
  • [4] Deep learning model improves COPD risk prediction and gene discovery
    Cosentino, Justin
    Hormozdiari, Farhad
    NATURE GENETICS, 2023, 55 (05) : 738 - 739
  • [5] Prediction Model for Coronavirus Pandemic Using Deep Learning
    Humayun M.
    Alsayat A.
    Computer Systems Science and Engineering, 2021, 40 (03): : 947 - 961
  • [6] Prediction Model for Coronavirus Pandemic Using Deep Learning
    Humayun, Mamoona
    Alsayat, Ahmed
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 40 (03): : 947 - 961
  • [7] Chicken pox prediction using deep learning model
    Lee M.
    Kim J.W.
    Jang B.
    Jang, Beakcheol (bjang@smu.ac.kr), 2020, Korean Institute of Electrical Engineers (69): : 127 - 137
  • [8] An Efficient Deep Learning Model for Disease Prediction Using Cnn
    Sathishkumar, P.
    Swath, R.
    Balaji, K. Yukesh
    Kumar, V. Ruthra
    JOURNAL OF POPULATION THERAPEUTICS AND CLINICAL PHARMACOLOGY, 2023, 30 (08): : E262 - E267
  • [9] A smart model for prediction of viscosity of nanofluids using deep learning
    Changdar, Satyasaran
    Saha, Susmita
    De, Soumen
    SMART SCIENCE, 2020, 8 (04) : 242 - 256
  • [10] Prediction of chemical compounds properties using a deep learning model
    Galushka, Mykola
    Swain, Chris
    Browne, Fiona
    Mulvenna, Maurice D.
    Bond, Raymond
    Gray, Darren
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (20): : 13345 - 13366