A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites

被引:8
|
作者
Thapa, Niraj [1 ]
Chaudhari, Meenal [1 ]
Iannetta, Anthony A. [2 ]
White, Clarence [1 ]
Roy, Kaushik [3 ]
Newman, Robert H. [4 ]
Hicks, Leslie M. [2 ]
Kc, Dukka B. [5 ]
机构
[1] North Carolina A&T State Univ, Dept Computat Data Sci & Engn, Greensboro, NC USA
[2] Univ N Carolina, Dept Chem, Chapel Hill, NC 27515 USA
[3] North Carolina A&T State Univ, Dept Comp Sci, Greensboro, NC USA
[4] North Carolina A&T State Univ, Dept Biol, Greensboro, NC USA
[5] Wichita State Univ, Elect Engn & Comp Sci Dept, Wichita, KS 67260 USA
基金
美国国家科学基金会;
关键词
RIBOSOMAL-PROTEIN S6; PHOSPHOPROTEOME; METHYLATION; FLAGELLA; REVEALS;
D O I
10.1038/s41598-021-91840-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein phosphorylation, which is one of the most important post-translational modifications (PTMs), is involved in regulating myriad cellular processes. Herein, we present a novel deep learning based approach for organism-specific protein phosphorylation site prediction in Chlamydomonas reinhardtii, a model algal phototroph. An ensemble model combining convolutional neural networks and long short-term memory (LSTM) achieves the best performance in predicting phosphorylation sites in C. reinhardtii. Deemed Chlamy-EnPhosSite, the measured best AUC and MCC are 0.90 and 0.64 respectively for a combined dataset of serine (S) and threonine (T) in independent testing higher than those measures for other predictors. When applied to the entire C. reinhardtii proteome (totaling 1,809,304 S and T sites), Chlamy-EnPhosSite yielded 499,411 phosphorylated sites with a cut-off value of 0.5 and 237,949 phosphorylated sites with a cut-off value of 0.7. These predictions were compared to an experimental dataset of phosphosites identified by liquid chromatography-tandem mass spectrometry (LC-MS/MS) in a blinded study and approximately 89.69% of 2,663 C. reinhardtii S and T phosphorylation sites were successfully predicted by Chlamy-EnPhosSite at a probability cut-off of 0.5 and 76.83% of sites were successfully identified at a more stringent 0.7 cut-off. Interestingly, Chlamy-EnPhosSite also successfully predicted experimentally confirmed phosphorylation sites in a protein sequence (e.g., RPS6 S245) which did not appear in the training dataset, highlighting prediction accuracy and the power of leveraging predictions to identify biologically relevant PTM sites. These results demonstrate that our method represents a robust and complementary technique for high-throughput phosphorylation site prediction in C. reinhardtii. It has potential to serve as a useful tool to the community. Chlamy-EnPhosSite will contribute to the understanding of how protein phosphorylation influences various biological processes in this important model microalga.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
    Niraj Thapa
    Meenal Chaudhari
    Anthony A. Iannetta
    Clarence White
    Kaushik Roy
    Robert H. Newman
    Leslie M. Hicks
    Dukka B. KC
    Scientific Reports, 11
  • [2] In silico prediction of mRNA poly(A) sites in Chlamydomonas reinhardtii
    Wu, Xiaohui
    Ji, Guoli
    Zeng, Yong
    MOLECULAR GENETICS AND GENOMICS, 2012, 287 (11-12) : 895 - 907
  • [3] In silico prediction of mRNA poly(A) sites in Chlamydomonas reinhardtii
    Xiaohui Wu
    Guoli Ji
    Yong Zeng
    Molecular Genetics and Genomics, 2012, 287 : 895 - 907
  • [4] DeepPhos: prediction of protein phosphorylation sites with deep learning
    Luo, Fenglin
    Wang, Minghui
    Liu, Yu
    Zhao, Xing-Ming
    Li, Ao
    BIOINFORMATICS, 2019, 35 (16) : 2766 - 2773
  • [5] Predicting Protein Phosphorylation Sites Based on Deep Learning
    Long, Haixia
    Sun, Zhao
    Li, Manzhi
    Fu, Hai Yan
    Lin, Ming Cai
    CURRENT BIOINFORMATICS, 2020, 15 (04) : 300 - 308
  • [6] PHOSPHORYLATION OF AXONEMAL PROTEINS IN CHLAMYDOMONAS-REINHARDTII
    PIPERNO, G
    LUCK, DJ
    JOURNAL OF BIOLOGICAL CHEMISTRY, 1976, 251 (07) : 2161 - 2167
  • [7] DeepRMethylSite: a deep learning based approach for prediction of arginine methylation sites in proteins
    Chaudhari, Meenal
    Thapa, Niraj
    Roy, Kaushik
    Newman, Robert H.
    Saigo, Hiroto
    Dukka, B. K. C.
    MOLECULAR OMICS, 2020, 16 (05) : 448 - 454
  • [8] Active Learning for the Prediction of Phosphorylation Sites
    Jiang, Jun
    Ip, Horace H. S.
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3158 - +
  • [9] DeepNphos: A deep-learning architecture for prediction of N-phosphorylation sites
    Chang, Xulin
    Zhu, Yafei
    Chen, Yu
    Li, Lei
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
  • [10] CNNPSP: Pseudouridine sites prediction based on deep learning
    Fan, Yongxian
    Li, Yongzhen
    Yang, Huihua
    Pan, Xiaoyong
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, 11871 LNCS : 291 - 301