Identification of Protein Lysine Crotonylation Sites by a Deep Learning Framework with Convolutional Neural Networks

被引:0
|
作者
Zhao Y. [1 ]
He N. [2 ]
Chen Z. [2 ]
Li L. [1 ,2 ,3 ]
机构
[1] School of Data Science and Software Engineering, Qingdao University, Qingdao
[2] School of Basic Medicine, Qingdao University, Qingdao
[3] Qingdao Cancer Institute, Qingdao University, Qingdao
来源
Chen, Zhen (zhenchen@qdu.edu.cn) | 1600年 / Institute of Electrical and Electronics Engineers Inc., United States卷 / 08期
基金
中国国家自然科学基金;
关键词
convolutional neural network; Feature extraction; lysine crotonylation; word embedding;
D O I
10.1109/aCCESS.2020.2966592
中图分类号
学科分类号
摘要
Protein lysine crotonylation (Kcr) is an important type of post-translational modification that regulates various activities. The experimental approaches to identify the Kcr sites are time-consuming and it is necessary to develop computational prediction approaches. Previously, a few classifiers were based on over 100 Kcr sites from histone proteins. Recently, thousands of Kcr sites have been experimentally verified on non-histone proteins from the plant species Papaya. We found that the previous classifiers fail to identify non-histone Kcr sites. Therefore, it is necessary to develop classifiers for non-histone proteins. accordingly, we constructed 11 different classifiers to recognize non-histone Kcr sites by combining different features and algorithms (such as random forest and convolutional neural network (CNN)). They were compared using both ten-fold cross validation and independent test dataset. The classifier based on CNN and the word embedding approach, dubbed as pKcr, performed better than other classifiers. pKcr obtained aUC value of 0.855 and 0.853 for ten-fold cross-validation and independent data test, respectively. No statistical difference of its performances on these two tests indicates that pKcr does not overfit. In the pKcr framework, a peptide is cleaved into biological characters followed by transformation into digital vectors. These vectors are input into the CNN with participation of multiple convolution kernels to automatically extract various features and pooling layers to perform feature selection. The superior performance of pKcr suggests that this algorithm is well suited for the Kcr prediction and may be applied broadly to predicting other types of PTM sites. pKcr can be available at http://www.bioinfogo.org/pkcr. © 2013 IEEE.
引用
收藏
页码:14244 / 14252
页数:8
相关论文
共 50 条
  • [1] y Identification of Protein Lysine Crotonylation Sites by a Deep Learning Framework With Convolutional Neural Networks
    Zhao, Yiming
    He, Ningning
    Chen, Zhen
    Li, Lei
    IEEE ACCESS, 2020, 8 : 14244 - 14252
  • [2] Improved Prediction Model of Protein Lysine Crotonylation Sites Using Bidirectional Recurrent Neural Networks
    Tng, Sian Soo
    Le, Nguyen Quoc Khanh
    Yeh, Hui-Yuan
    Chua, Matthew Chin Heng
    JOURNAL OF PROTEOME RESEARCH, 2022, 21 (01) : 265 - 273
  • [3] Deep Learning Convolutional Neural Networks for Radio Identification
    Riyaz, Shamnaz
    Sankhe, Kunal
    Ioannidis, Stratis
    Chowdhury, Kaushik
    IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (09) : 146 - 152
  • [4] DeepGly: A Deep Learning Framework With Recurrent and Convolutional Neural Networks to Identify Protein Glycation Sites From Imbalanced Data
    Chen, Jingui
    Yang, Runtao
    Zhang, Chengjin
    Zhang, Lina
    Zhang, Qian
    IEEE ACCESS, 2019, 7 : 142368 - 142378
  • [5] Deep-Kcr: accurate detection of lysine crotonylation sites using deep learning method
    Lv, Hao
    Dao, Fu-Ying
    Guan, Zheng-Xing
    Yang, Hui
    Li, Yan-Wen
    Lin, Hao
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [6] Adapt-Kcr: a novel deep learning framework for accurate prediction of lysine crotonylation sites based on learning embedding features and attention architecture
    Li, Zutan
    Fang, Jingya
    Wang, Shining
    Zhang, Liangyun
    Chen, Yuanyuan
    Pian, Cong
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [7] Characterization and identification of lysine crotonylation sites based on machine learning method on both plant and mammalian
    Wang, Rulan
    Wang, Zhuo
    Wang, Hongfei
    Pang, Yuxuan
    Lee, Tzong-Yi
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [8] Characterization and identification of lysine crotonylation sites based on machine learning method on both plant and mammalian
    Rulan Wang
    Zhuo Wang
    Hongfei Wang
    Yuxuan Pang
    Tzong-Yi Lee
    Scientific Reports, 10
  • [9] Deep learning networks for olive cultivar identification: A comprehensive analysis of convolutional neural networks
    Mendes, Joao
    Lima, Jose
    Costa, Lino
    Rodrigues, Nuno
    Pereira, Ana I.
    SMART AGRICULTURAL TECHNOLOGY, 2024, 8
  • [10] A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks
    Huu-Thiet Nguyen
    Li, Sitan
    Cheah, Chien Chern
    IEEE ACCESS, 2022, 10 : 14270 - 14287