MetODeep: A Deep Learning Approach for Prediction of Methionine Oxidation Sites in Proteins

被引:0
|
作者
Lopez-Garcia, Guillermo [1 ]
Jerez, Jose M. [1 ]
Urda, Daniel [2 ]
Veredas, Francisco J. [1 ]
机构
[1] Univ Malaga, Dept Lenguajes & Ciencias Comp, Malaga, Spain
[2] Univ Cadiz, Dept Ingn Informat, Cadiz, Spain
关键词
Deep learning; convolutional neural network; transfer-learning; bioinformatics; proteomics application; post-translational modification; methionine oxidation; PHOSPHORYLATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
After being synthesized by ribosomes in the cells, proteins can suffer from post-translational modifications (PTM) that affect their functionality. One of the most studied PTMs is phosphorylation. Mass-spectrometry methods aimed at identifying phosphorylation sites in proteins are arduous and expensive. For these reasons, numerous studies propose the use of machine leaning techniques to predict this PTM. Like phosphorylation, methionine oxidation is another important PTM. Recently, we have proposed a machine learning approach that extracts a set of features from the primary and tertiary structure of the proteins to predict methionine oxidation sites. However, this work had an important limitation that impairs feature extraction, since the 3D structure of many proteins is not fully resolved. In this study, we present MetODeep, a deep learning approach to predict methionine oxidation. Unlike phosphorylation, for which datasets with several hundred thousands samples are available to train effective predictive models, methionine oxidation counts on small datasets, which could lead a deep neural network to experiment over-fitting issues. The recently evidenced existence of a cross-talk between phosphorylation and methionine oxidation, has motivated our transfer-learning approach. Thus, on the basis of a deep convolutional neural network (CNN) pre-trained with phosphorylation data, MetODeep is fine-tuned to predict methionine oxidation. The resulting CNN architecture allows us to omit manual feature extraction, since it accepts raw protein sequences as input data. The final model gives performance results (AUC 0.8267 +/- 0.0174) that surpass state-of-art of computational models for the prediction of methionine oxidation.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] A machine learning approach for predicting methionine oxidation sites
    Aledo, Juan C.
    Canton, Francisco R.
    Veredas, Francisco J.
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [2] A machine learning approach for predicting methionine oxidation sites
    Juan C. Aledo
    Francisco R. Cantón
    Francisco J. Veredas
    [J]. BMC Bioinformatics, 18
  • [3] DeepRMethylSite: a deep learning based approach for prediction of arginine methylation sites in proteins
    Chaudhari, Meenal
    Thapa, Niraj
    Roy, Kaushik
    Newman, Robert H.
    Saigo, Hiroto
    Dukka, B. K. C.
    [J]. MOLECULAR OMICS, 2020, 16 (05) : 448 - 454
  • [4] Combining feature engineering and feature selection to improve the prediction of methionine oxidation sites in proteins
    Veredas, Francisco J.
    Urda, Daniel
    Subirats, Jose L.
    Canton, Francisco R.
    Aledo, Juan C.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (02): : 323 - 334
  • [5] Combining feature engineering and feature selection to improve the prediction of methionine oxidation sites in proteins
    Francisco J. Veredas
    Daniel Urda
    José L. Subirats
    Francisco R. Cantón
    Juan C. Aledo
    [J]. Neural Computing and Applications, 2020, 32 : 323 - 334
  • [6] DeepUbi: a deep learning framework for prediction of ubiquitination sites in proteins
    Hongli Fu
    Yingxi Yang
    Xiaobo Wang
    Hui Wang
    Yan Xu
    [J]. BMC Bioinformatics, 20
  • [7] DeepGlut: A Deep Learning Framework for Prediction of Glutarylation Sites in Proteins
    Sen, Urmi
    Hasan, Md Al Mehedi
    [J]. 2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 941 - 944
  • [8] DeepUbi: a deep learning framework for prediction of ubiquitination sites in proteins
    Fu, Hongli
    Yang, Yingxi
    Wang, Xiaobo
    Wang, Hui
    Xu, Yan
    [J]. BMC BIOINFORMATICS, 2019, 20 (1)
  • [9] DeepSurf: a surface-based deep learning approach for the prediction of ligand binding sites on proteins
    Mylonas, Stelios K.
    Axenopoulos, Apostolos
    Daras, Petros
    [J]. BIOINFORMATICS, 2021, 37 (12) : 1681 - 1690
  • [10] A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
    Niraj Thapa
    Meenal Chaudhari
    Anthony A. Iannetta
    Clarence White
    Kaushik Roy
    Robert H. Newman
    Leslie M. Hicks
    Dukka B. KC
    [J]. Scientific Reports, 11