SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites

被引:18
|
作者
Al-barakati, Hussam J. [1 ]
McConnell, Evan W. [2 ]
Hicks, Leslie M. [2 ]
Poole, Leslie B. [3 ]
Newman, Robert H. [4 ]
Kc, Dukka B. [1 ]
机构
[1] North Carolina A&T State Univ, Dept Computat Sci & Engn, Greensboro, NC 27411 USA
[2] Univ North Carolina Chapel Hill, Dept Chem, Chapel Hill, NC 27599 USA
[3] Wake Forest Univ, Bowman Gray Sch Med, Dept Biochem, Winston Salem, NC 27157 USA
[4] North Carolina A&T State Univ, Dept Biol, Greensboro, NC 27411 USA
来源
SCIENTIFIC REPORTS | 2018年 / 8卷
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
AMINO-ACID-COMPOSITION; CLEAVABLE LINKER; SULFENIC ACID; RANDOM FOREST; TOOL; CYSTEINES; PROTEINS; THIOLS;
D O I
10.1038/s41598-018-29126-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein S-sulfenylation, which results from oxidation of free thiols on cysteine residues, has recently emerged as an important post-translational modification that regulates the structure and function of proteins involved in a variety of physiological and pathological processes. By altering the size and physiochemical properties of modified cysteine residues, sulfenylation can impact the cellular function of proteins in several different ways. Thus, the ability to rapidly and accurately identify putative sulfenylation sites in proteins will provide important insights into redox-dependent regulation of protein function in a variety of cellular contexts. Though bottom-up proteomic approaches, such as tandem mass spectrometry (MS/MS), provide a wealth of information about global changes in the sulfenylation state of proteins, MS/MS-based experiments are often labor-intensive, costly and technically challenging. Therefore, to complement existing proteomic approaches, researchers have developed a series of computational tools to identify putative sulfenylation sites on proteins. However, existing methods often suffer from low accuracy, specificity, and/or sensitivity. In this study, we developed SVM-SulfoSite, a novel sulfenylation prediction tool that uses support vector machines (SVM) to identify key determinants of sulfenylation among five feature classes: binary code, physiochemical properties, k-space amino acid pairs, amino acid composition and high-quality physiochemical indices. Using 10-fold cross-validation, SVM-SulfoSite achieved 95% sensitivity and 83% specificity, with an overall accuracy of 89% and Matthew's correlation coefficient (MCC) of 0.79. Likewise, using an independent test set of experimentally identified sulfenylation sites, our method achieved scores of 74%, 62%, 80% and 0.42 for accuracy, sensitivity, specificity and MCC, with an area under the receiver operator characteristic (ROC) curve of 0.81. Moreover, in side-by-side comparisons, SVM-SulfoSite performed as well as or better than existing sulfenylation prediction tools. Together, these results suggest that our method represents a robust and complementary technique for advanced exploration of protein S-sulfenylation.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Support Vector Machine (SVM) based prestack AVO inversion and its applications
    Li, Guangcai
    You, Jiachun
    Liu, Xuewei
    JOURNAL OF APPLIED GEOPHYSICS, 2015, 120 : 60 - 68
  • [22] Private colleges teachers evaluation system based on support vector machine (SVM)
    Liu, Xiao-Yan
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING INNOVATION, 2015, 12 : 1918 - 1921
  • [23] Support Vector Machine (SVM) based compression artifact-reduction technique
    Biswas, Mainak
    Kumar, Sanjeev
    Nguyen, T. Q.
    Balram, Nikhil
    JOURNAL OF THE SOCIETY FOR INFORMATION DISPLAY, 2007, 15 (08) : 625 - 634
  • [24] Support vector machine (SVM) based liver classification: fibrosis, steatosis, and inflammation
    Baek, Jihye
    Swanson, Terri A.
    Tuthill, Theresa
    Parker, Kevin J.
    PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2020,
  • [25] A fuzzy regression based support vector machine (SVM) approach to fuzzy classification
    Chen, Yu
    Pedrycz, Witold
    Watada, Junzo
    ICIC Express Letters, 2010, 4 (6 B): : 2355 - 2362
  • [26] Performance evaluation of support vector machine (SVM)-based predictors in genomic selection
    Kasnavi, Seyed Amir
    Afshar, Mahdi Amin
    Shariati, Mohammad Mahdi
    Kashan, Nasser Emam Jomeh
    Honarvar, Mahmood
    INDIAN JOURNAL OF ANIMAL SCIENCES, 2017, 87 (10): : 1226 - 1231
  • [27] Analysis of dengue infection based on Raman spectroscopy and support vector machine (SVM)
    Khan, Saranjam
    Ullah, Rahat
    Khan, Asifullah
    Wahab, Noorul
    Bilal, Muhammad
    Ahmed, Mushtaq
    BIOMEDICAL OPTICS EXPRESS, 2016, 7 (06): : 2249 - 2256
  • [28] If-SVM: Iterative factoring support vector machine
    Yuqing Pan
    Wenpeng Zhai
    Wei Gao
    Xiangjun Shen
    Multimedia Tools and Applications, 2020, 79 : 25441 - 25461
  • [29] Parallel randomized sampling for support vector machine (SVM) and support vector regression (SVR)
    Yumao Lu
    Vwani Roychowdhury
    Knowledge and Information Systems, 2008, 14 : 233 - 247
  • [30] Support Vector Machine (SVM) Based Sybil Attack Detection in Vehicular Networks
    Gu, Pengwenlong
    Khatoun, Rida
    Begriche, Youcef
    Serhrouchni, Ahmed
    2017 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2017,