Characterization and identification of lysine crotonylation sites based on machine learning method on both plant and mammalian

被引:0
|
作者
Rulan Wang
Zhuo Wang
Hongfei Wang
Yuxuan Pang
Tzong-Yi Lee
机构
[1] The Chinese University of Hong Kong (Shenzhen),School of Science and Engineering
[2] The Chinese University of Hong Kong (Shenzhen),Warshel Institute for Computational Biology
[3] University of Science and Technology of China,School of Life Sciences
[4] The University of Hong Kong,Department of Orthopaedics and Traumatology
[5] The Chinese University of Hong Kong (Shenzhen),School of Life and Health Sciences
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Lysine crotonylation (Kcr) is a type of protein post-translational modification (PTM), which plays important roles in a variety of cellular regulation and processes. Several methods have been proposed for the identification of crotonylation. However, most of these methods can predict efficiently only on histone or non-histone protein. Therefore, this work aims to give a more balanced performance in different species, here plant (non-histone) and mammalian (histone) are involved. SVM (support vector machine) and RF (random forest) were employed in this study. According to the results of cross-validations, the RF classifier based on EGAAC attribute achieved the best predictive performance which performs competitively good as existed methods, meanwhile more robust when dealing with imbalanced datasets. Moreover, an independent test was carried out, which compared the performance of this study and existed methods based on the same features or the same classifier. The classifiers of SVM and RF could achieve best performances with 92% sensitivity, 88% specificity, 90% accuracy, and an MCC of 0.80 in the mammalian dataset, and 77% sensitivity, 83% specificity, 70% accuracy and 0.54 MCC in a relatively small dataset of mammalian and a large-scaled plant dataset respectively. Moreover, a cross-species independent testing was also carried out in this study, which has proved the species diversity in plant and mammalian.
引用
收藏
相关论文
共 50 条
  • [41] CapsNh-Kcr: Capsule network-based prediction of lysine crotonylation sites in human non-histone proteins
    Khanal, Jhabindra
    Kandel, Jeevan
    Tayara, Hilal
    Chong, Kil To
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 120 - 127
  • [42] Applying Extreme Learning Machine to Plant Species Identification
    Zhai, Chuan-Min
    Du, Ji-Xiang
    2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 879 - 884
  • [43] LSTMCNNsucc: A Bidirectional LSTM and CNN-Based Deep Learning Method for Predicting Lysine Succinylation Sites
    Huang, Guohua
    Shen, Qingfeng
    Zhang, Guiyang
    Wang, Pan
    Yu, Zu-Guo
    BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [44] A Machine Learning Technique for Identification of Plant Diseases in Leaves
    Deepa
    Rashmi, N.
    Shetty, Chinmai
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 481 - 484
  • [45] nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning
    Chen, Yong-Zi
    Wang, Zhuo-Zhi
    Wang, Yanan
    Ying, Guoguang
    Chen, Zhen
    Song, Jiangning
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [46] DeepDN_iGlu: prediction of lysine glutarylation sites based on attention residual learning method and DenseNet
    Jia, Jianhua
    Sun, Mingwei
    Wu, Genqiang
    Qiu, Wangren
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (02) : 2815 - 2830
  • [47] Prediction Method for Lysine Acetylation Sites Based on LSTM Network
    Xiu, Qingxiao
    Li, Dancheng
    Li, Hailong
    Wang, Ning
    Ding, Chen
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 179 - 182
  • [48] Identification and Characterization of Propionylation at Histone H3 Lysine 23 in Mammalian Cells
    Liu, Bo
    Lin, Yihui
    Darwanto, Agus
    Song, Xuehui
    Xu, Guoliang
    Zhang, Kangling
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2009, 284 (47) : 32288 - 32295
  • [49] Flow field characterization and evaluation method based on unsupervised machine learning
    Li, Shanshan
    Feng, Qihong
    Zhang, Xianmin
    Liu, Haicheng
    Liu, Lijie
    Huang, Yingsong
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2022, 215
  • [50] Machine learning-based identification and characterization of mast cells in eosinophilic esophagitis
    Zhang, Simin
    Caldwell, Julie M.
    Rochman, Mark
    Collins, Margaret H.
    Rothenberg, Marc E.
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2024, 153 (05) : 1381 - 1391.e6