Characterization and identification of lysine crotonylation sites based on machine learning method on both plant and mammalian

被引:0
|
作者
Rulan Wang
Zhuo Wang
Hongfei Wang
Yuxuan Pang
Tzong-Yi Lee
机构
[1] The Chinese University of Hong Kong (Shenzhen),School of Science and Engineering
[2] The Chinese University of Hong Kong (Shenzhen),Warshel Institute for Computational Biology
[3] University of Science and Technology of China,School of Life Sciences
[4] The University of Hong Kong,Department of Orthopaedics and Traumatology
[5] The Chinese University of Hong Kong (Shenzhen),School of Life and Health Sciences
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Lysine crotonylation (Kcr) is a type of protein post-translational modification (PTM), which plays important roles in a variety of cellular regulation and processes. Several methods have been proposed for the identification of crotonylation. However, most of these methods can predict efficiently only on histone or non-histone protein. Therefore, this work aims to give a more balanced performance in different species, here plant (non-histone) and mammalian (histone) are involved. SVM (support vector machine) and RF (random forest) were employed in this study. According to the results of cross-validations, the RF classifier based on EGAAC attribute achieved the best predictive performance which performs competitively good as existed methods, meanwhile more robust when dealing with imbalanced datasets. Moreover, an independent test was carried out, which compared the performance of this study and existed methods based on the same features or the same classifier. The classifiers of SVM and RF could achieve best performances with 92% sensitivity, 88% specificity, 90% accuracy, and an MCC of 0.80 in the mammalian dataset, and 77% sensitivity, 83% specificity, 70% accuracy and 0.54 MCC in a relatively small dataset of mammalian and a large-scaled plant dataset respectively. Moreover, a cross-species independent testing was also carried out in this study, which has proved the species diversity in plant and mammalian.
引用
收藏
相关论文
共 50 条
  • [31] A Novel Spatial Feature For Predicting Lysine Malonylation Sites Using Machine Learning
    Liu, Yuan
    Yan, Changhui
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 76 - 79
  • [32] A smartphone based machine learning method for the fine characterization of snoring
    Holzwarth, M.
    Bricout, A.
    Colas, D.
    JOURNAL OF SLEEP RESEARCH, 2018, 27
  • [33] Machine Learning Method for Paraphrase Identification
    Marchenko, Oleksandr
    Anisimov, Anatoly
    Nykonenko, Andrii
    Rossada, Tetiana
    Melnikov, Egor
    FLEXIBLE QUERY ANSWERING SYSTEMS, FQAS 2017, 2017, 10333 : 164 - 173
  • [34] Identification of the ε-(γ-glutamyl)lysine cross-linking sites in α-lactalbumin polymerized by mammalian and microbial transglutaminases
    Lee, DS
    Matsumoto, S
    Matsumura, Y
    Mori, T
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2002, 50 (25) : 7412 - 7419
  • [35] Machine learning-based identification of general transcriptional predictors for plant disease
    Sia, Jayson
    Zhang, Wei
    Cheng, Mingxi
    Bogdan, Paul
    Cook, David E.
    NEW PHYTOLOGIST, 2024,
  • [36] Lithology Identification Method Based on Machine Learning and Geophysical Well Logging
    Chen, Sisi
    Yu, Hongyan
    Liu, Wenhui
    Wang, Xiaofeng
    Zhang, Dongdong
    Wang, Lei
    JOURNAL OF ENERGY ENGINEERING, 2025, 151 (01)
  • [37] Experimental researches on an UWB NLOS identification method based on machine learning
    Li, Weijie
    Zhang, Tingting
    Zhang, Qinyu
    2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2013, : 473 - 477
  • [38] Identification Method of Power Internet Attack Information Based on Machine Learning
    Niu, Yitong
    Andrei, Korneev
    Iraqi Journal for Computer Science and Mathematics, 2022, 3 (02): : 1 - 8
  • [39] A method based on rules and machine learning for logic form identification in Spanish
    Martinez-Santiago, F.
    Diaz-Galiano, M. C.
    Garcia-Cumbreras, M. A.
    Montejo-Raez, A.
    NATURAL LANGUAGE ENGINEERING, 2017, 23 (01) : 131 - 153
  • [40] Computational Identification of Lysine Glutarylation Sites Using Positive-Unlabeled Learning
    Ju, Zhe
    Wang, Shi-Yun
    CURRENT GENOMICS, 2020, 21 (03) : 204 - 211