Detecting Succinylation sites from protein sequences using ensemble support vector machine

被引:34
|
作者
Ning, Qiao [1 ]
Zhao, Xiaosa [1 ]
Bao, Lingling [1 ]
Ma, Zhiqiang [1 ]
Zhao, Xiaowei [2 ]
机构
[1] Northeast Normal Univ, Sch Informat Sci & Technol, Changchun 130117, Jilin, Peoples R China
[2] Northeast Normal Univ, Key Lab Intelligent Informat Proc Jilin Univ, Changchun 130117, Jilin, Peoples R China
来源
BMC BIOINFORMATICS | 2018年 / 19卷
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Predict succinylation sites; Multiple features; Grey pseudo amino acid composition; Information gain; SVM; Ensemble learning algorithm; AMINO-ACID-COMPOSITION; PREDICTING SUBCELLULAR-LOCALIZATION; S-NITROSYLATION SITES; LYSINE SUCCINYLATION; PSEUDO COMPONENTS; DIFFERENT MODES; GENERAL-FORM; IDENTIFICATION; PSEAAC; TOOL;
D O I
10.1186/s12859-018-2249-4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Lysine succinylation is a new kind of post-translational modification which plays a key role in protein conformation regulation and cellular function control. To understand the mechanism of succinylation profoundly, it is necessary to identify succinylation sites in proteins accurately. However, traditional methods, experimental approaches, are labor-intensive and time-consuming. Computational prediction methods have been proposed recent years, and they are popular because of their convenience and high speed. In this study, we developed a new method to predict succinylation sites in protein combining multiple features, including amino acid composition, binary encoding, physicochemical property and grey pseudo amino acid composition, with a feature selection scheme (information gain). And then, it was trained using SVM (Support Vector Machine) and an ensemble learning algorithm. Results: The performance of this method was measured with an accuracy of 89.14% and a MCC (Matthew Correlation Coefficient) of 0.79 using 10-fold cross validation on training dataset and an accuracy of 84.5% and a MCC of 0.2 on independent dataset. Conclusions: The conclusions made from this study can help to understand more of the succinylation mechanism. These results suggest that our method was very promising for predicting succinylation sites.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Detecting Succinylation sites from protein sequences using ensemble support vector machine
    Qiao Ning
    Xiaosa Zhao
    Lingling Bao
    Zhiqiang Ma
    Xiaowei Zhao
    [J]. BMC Bioinformatics, 19
  • [2] Lysine acetylation sites prediction using an ensemble of support vector machine classifiers
    Xu, Yan
    Wang, Xiao-Bo
    Ding, Jun
    Wu, Ling-Yun
    Deng, Nai-Yang
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2010, 264 (01) : 130 - 135
  • [3] Predicting protein–protein interaction sites using modified support vector machine
    Hong Guo
    Bingjing Liu
    Danli Cai
    Tun Lu
    [J]. International Journal of Machine Learning and Cybernetics, 2018, 9 : 393 - 398
  • [4] Detecting N6-methyladenosine sites from RNA transcriptomes using ensemble Support Vector Machines
    Chen, Wei
    Xing, Pengwei
    Zou, Quan
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [5] Detecting N6-methyladenosine sites from RNA transcriptomes using ensemble Support Vector Machines
    Wei Chen
    Pengwei Xing
    Quan Zou
    [J]. Scientific Reports, 7
  • [6] Predict prokaryotic proteins through detecting N-formylmethionine residues in protein sequences using support vector machine
    Yang, Zheng Rong
    [J]. BIOSYSTEMS, 2009, 97 (03) : 141 - 145
  • [7] Predicting protein-protein interaction sites using modified support vector machine
    Guo, Hong
    Liu, Bingjing
    Cai, Danli
    Lu, Tun
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (03) : 393 - 398
  • [8] Using Support Vector Machine and Evolutionary Profiles to Predict Antifreeze Protein Sequences
    Zhao, Xiaowei
    Ma, Zhiqiang
    Yin, Minghao
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2012, 13 (02) : 2196 - 2207
  • [9] Imbalanced classification using support vector machine ensemble
    Tian, Jiang
    Gu, Hong
    Liu, Wenqi
    [J]. NEURAL COMPUTING & APPLICATIONS, 2011, 20 (02): : 203 - 209
  • [10] Imbalanced classification using support vector machine ensemble
    Jiang Tian
    Hong Gu
    Wenqi Liu
    [J]. Neural Computing and Applications, 2011, 20 : 203 - 209