MetODeep: A Deep Learning Approach for Prediction of Methionine Oxidation Sites in Proteins

被引:0
|
作者
Lopez-Garcia, Guillermo [1 ]
Jerez, Jose M. [1 ]
Urda, Daniel [2 ]
Veredas, Francisco J. [1 ]
机构
[1] Univ Malaga, Dept Lenguajes & Ciencias Comp, Malaga, Spain
[2] Univ Cadiz, Dept Ingn Informat, Cadiz, Spain
关键词
Deep learning; convolutional neural network; transfer-learning; bioinformatics; proteomics application; post-translational modification; methionine oxidation; PHOSPHORYLATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
After being synthesized by ribosomes in the cells, proteins can suffer from post-translational modifications (PTM) that affect their functionality. One of the most studied PTMs is phosphorylation. Mass-spectrometry methods aimed at identifying phosphorylation sites in proteins are arduous and expensive. For these reasons, numerous studies propose the use of machine leaning techniques to predict this PTM. Like phosphorylation, methionine oxidation is another important PTM. Recently, we have proposed a machine learning approach that extracts a set of features from the primary and tertiary structure of the proteins to predict methionine oxidation sites. However, this work had an important limitation that impairs feature extraction, since the 3D structure of many proteins is not fully resolved. In this study, we present MetODeep, a deep learning approach to predict methionine oxidation. Unlike phosphorylation, for which datasets with several hundred thousands samples are available to train effective predictive models, methionine oxidation counts on small datasets, which could lead a deep neural network to experiment over-fitting issues. The recently evidenced existence of a cross-talk between phosphorylation and methionine oxidation, has motivated our transfer-learning approach. Thus, on the basis of a deep convolutional neural network (CNN) pre-trained with phosphorylation data, MetODeep is fine-tuned to predict methionine oxidation. The resulting CNN architecture allows us to omit manual feature extraction, since it accepts raw protein sequences as input data. The final model gives performance results (AUC 0.8267 +/- 0.0174) that surpass state-of-art of computational models for the prediction of methionine oxidation.
引用
收藏
页数:8
相关论文
共 50 条
  • [11] A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
    Thapa, Niraj
    Chaudhari, Meenal
    Iannetta, Anthony A.
    White, Clarence
    Roy, Kaushik
    Newman, Robert H.
    Hicks, Leslie M.
    Kc, Dukka B.
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [12] Prediction of transport proteins from sequence information with the deep learning approach
    Wang, Qian
    Xu, Teng
    Xu, Kai
    Lu, Zhongqiu
    Ying, Jianchao
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 160
  • [13] Prediction of Transcription Factor Binding Sites Using a Combined Deep Learning Approach
    Cao, Linan
    Liu, Pei
    Chen, Jialong
    Deng, Lei
    [J]. FRONTIERS IN ONCOLOGY, 2022, 12
  • [14] Improved sequence-based prediction of interaction sites in α-helical transmembrane proteins by deep learning
    Sun, Jianfeng
    Frishman, Dmitrij
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 1512 - 1530
  • [15] Methionine oxidation and reduction in proteins
    Kim, Geumsoo
    Weiss, Stephen J.
    Levine, Rodney L.
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2014, 1840 (02): : 901 - 905
  • [16] Deep learning in prediction of intrinsic disorder in proteins
    Zhao, Bi
    Kurgan, Lukasz
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 1286 - 1294
  • [17] Rainfall Prediction: A Deep Learning Approach
    Hernandez, Emilcy
    Sanchez-Anguix, Victor
    Julian, Vicente
    Palanca, Javier
    Duque, Nestor
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, 2016, 9648 : 151 - 162
  • [18] CNNPSP: Pseudouridine sites prediction based on deep learning
    Fan, Yongxian
    Li, Yongzhen
    Yang, Huihua
    Pan, Xiaoyong
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, 11871 LNCS : 291 - 301
  • [19] DeepPhos: prediction of protein phosphorylation sites with deep learning
    Luo, Fenglin
    Wang, Minghui
    Liu, Yu
    Zhao, Xing-Ming
    Li, Ao
    [J]. BIOINFORMATICS, 2019, 35 (16) : 2766 - 2773
  • [20] CNNPSP: Pseudouridine Sites Prediction Based on Deep Learning
    Fan, Yongxian
    Li, Yongzhen
    Yang, Huihua
    Pan, Xiaoyong
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2019, PT I, 2019, 11871 : 291 - 301