SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites

被引:17
|
作者
Al-barakati, Hussam J. [1 ]
McConnell, Evan W. [2 ]
Hicks, Leslie M. [2 ]
Poole, Leslie B. [3 ]
Newman, Robert H. [4 ]
Kc, Dukka B. [1 ]
机构
[1] North Carolina A&T State Univ, Dept Computat Sci & Engn, Greensboro, NC 27411 USA
[2] Univ North Carolina Chapel Hill, Dept Chem, Chapel Hill, NC 27599 USA
[3] Wake Forest Univ, Bowman Gray Sch Med, Dept Biochem, Winston Salem, NC 27157 USA
[4] North Carolina A&T State Univ, Dept Biol, Greensboro, NC 27411 USA
来源
SCIENTIFIC REPORTS | 2018年 / 8卷
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
AMINO-ACID-COMPOSITION; CLEAVABLE LINKER; SULFENIC ACID; RANDOM FOREST; TOOL; CYSTEINES; PROTEINS; THIOLS;
D O I
10.1038/s41598-018-29126-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein S-sulfenylation, which results from oxidation of free thiols on cysteine residues, has recently emerged as an important post-translational modification that regulates the structure and function of proteins involved in a variety of physiological and pathological processes. By altering the size and physiochemical properties of modified cysteine residues, sulfenylation can impact the cellular function of proteins in several different ways. Thus, the ability to rapidly and accurately identify putative sulfenylation sites in proteins will provide important insights into redox-dependent regulation of protein function in a variety of cellular contexts. Though bottom-up proteomic approaches, such as tandem mass spectrometry (MS/MS), provide a wealth of information about global changes in the sulfenylation state of proteins, MS/MS-based experiments are often labor-intensive, costly and technically challenging. Therefore, to complement existing proteomic approaches, researchers have developed a series of computational tools to identify putative sulfenylation sites on proteins. However, existing methods often suffer from low accuracy, specificity, and/or sensitivity. In this study, we developed SVM-SulfoSite, a novel sulfenylation prediction tool that uses support vector machines (SVM) to identify key determinants of sulfenylation among five feature classes: binary code, physiochemical properties, k-space amino acid pairs, amino acid composition and high-quality physiochemical indices. Using 10-fold cross-validation, SVM-SulfoSite achieved 95% sensitivity and 83% specificity, with an overall accuracy of 89% and Matthew's correlation coefficient (MCC) of 0.79. Likewise, using an independent test set of experimentally identified sulfenylation sites, our method achieved scores of 74%, 62%, 80% and 0.42 for accuracy, sensitivity, specificity and MCC, with an area under the receiver operator characteristic (ROC) curve of 0.81. Moreover, in side-by-side comparisons, SVM-SulfoSite performed as well as or better than existing sulfenylation prediction tools. Together, these results suggest that our method represents a robust and complementary technique for advanced exploration of protein S-sulfenylation.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites
    Hussam J. AL-barakati
    Evan W. McConnell
    Leslie M. Hicks
    Leslie B. Poole
    Robert H. Newman
    Dukka B. KC
    Scientific Reports, 8
  • [2] A support vector machine (SVM) based voltage stability
    Dosano, Rodel D.
    Song, Hwachang
    Lee, Byongjun
    PROCEEDINGS OF THE SEVENTH IASTED INTERNATIONAL CONFERENCE ON POWER AND ENERGY SYSTEMS, 2007, : 265 - +
  • [3] Prediction of S-sulfenylation sites using mRMR feature selection and fuzzy support vector machine algorithm
    Ju, Zhe
    Wang, Shi-Yun
    JOURNAL OF THEORETICAL BIOLOGY, 2018, 457 : 6 - 13
  • [4] 'pDHS-SVM: A prediction method for plant DNase I hypersensitive sites based on support vector machine
    Zhang, Shanxin
    Zhou, Zhiping
    Chen, Xinmeng
    Hu, Yong
    Yang, Lindong
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 426 : 126 - 133
  • [5] β_SVM a new Support Vector Machine kernel
    Hamdani, TM
    Alimi, AM
    ARTIFICIAL NEURAL NETS AND GENETIC ALGORITHMS, PROCEEDINGS, 2003, : 63 - 68
  • [6] Evaluation of waste recycling of fruits based on Support Vector Machine (SVM)
    Farjami, Javad
    Dehyouri, Sahar
    Mohamadi, Mohamad
    COGENT ENVIRONMENTAL SCIENCE, 2020, 6 (01):
  • [7] Fault diagnosis of FDM process based on support vector machine (SVM)
    Hu, Huaqing
    He, Ketai
    Zhong, Tianlin
    Hong, Yili
    RAPID PROTOTYPING JOURNAL, 2020, 26 (02) : 330 - 348
  • [8] Optimization and application of support vector machine based on SVM algorithm parameters
    Yan, Hui-Feng
    Wang, Wei-Feng
    Liu, Jie
    Journal of Digital Information Management, 2013, 11 (02): : 165 - 169
  • [9] Quantitative analysis of multiple components based on support vector machine (SVM)
    Yu, Yinshan
    Shao, Mingzhen
    Jiang, Lingjie
    Ke, Yongbin
    Wei, Dandan
    Zhang, Dongyang
    Jiang, Mingxin
    Yang, Yudong
    OPTIK, 2021, 237
  • [10] Prediction of phosphorylation sites based on granular support vector machine
    Gong Cheng
    Qingfeng Chen
    Ruchang Zhang
    Granular Computing, 2021, 6 : 107 - 117