Prediction of human disease-associated phosphorylation sites with combined feature selection approach and support vector machine

被引:11
|
作者
Xu, Xiaoyi [1 ]
Li, Ao [1 ,2 ]
Wang, Minghui [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, AH-230027 Hefei, Peoples R China
[2] Univ Sci & Technol China, Ctr Biomed Engn, AH-230027 Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
proteins; cellular biophysics; diseases; support vector machines; feature selection; filtering theory; medical computing; bioinformatics; forward feature selection process; minimum-redundancy-maximum-relevance filtering process; cellular process; post-translational modification; support vector machine; human disease-associated phosphorylation sites; PROTEIN-PHOSPHORYLATION; PATTERN-RECOGNITION; IDENTIFICATION; SEQUENCE;
D O I
10.1049/iet-syb.2014.0051
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Phosphorylation is a crucial post-translational modification, which regulates almost all cellular processes in life. It has long been recognised that protein phosphorylation has close relationship with diseases, and therefore many researches are undertaken to predict phosphorylation sites for disease treatment and drug design. However, despite the success achieved by these approaches, no method focuses on disease-associated phosphorylation sites prediction. Herein, for the first time the authors propose a novel approach that is specially designed to identify associations between phosphorylation sites and human diseases. To take full advantage of local sequence information, a combined feature selection method-based support vector machine (CFS-SVM) that incorporates minimum-redundancy-maximum-relevance filtering process and forward feature selection process is developed. Performance evaluation shows that CFS-SVM is significantly better than the widely used classifiers including Bayesian decision theory, k nearest neighbour and random forest. With the extremely high specificity of 99%, CFS-SVM can still achieve a high sensitivity. Besides, tests on extra data confirm the effectiveness and general applicability of CFS-SVM approach on a variety of diseases. Finally, the analysis of selected features and corresponding kinases also help the understanding of the potential mechanism of disease-phosphorylation relationships and guide further experimental validations.
引用
收藏
页码:155 / 163
页数:9
相关论文
共 50 条
  • [41] Diagnosis of Chronic Kidney Disease Based on Support Vector Machine by Feature Selection Methods
    Polat, Huseyin
    Mehr, Homay Danaei
    Cetin, Aydin
    JOURNAL OF MEDICAL SYSTEMS, 2017, 41 (04)
  • [42] A feature selection using improved dragonfly algorithm with support vector machine for breast cancer prediction
    Mary, S. Roselin
    Prasad, R. Murali
    Suguna, R.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (05): : 2039 - 2049
  • [43] A feature selection Newton method for support vector machine classification
    Fung, GM
    Mangasarian, OL
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2004, 28 (02) : 185 - 202
  • [44] A novel feature selection method for twin support vector machine
    Bai, Lan
    Wang, Zhen
    Shao, Yuan-Hai
    Deng, Nai-Yang
    KNOWLEDGE-BASED SYSTEMS, 2014, 59 : 1 - 8
  • [45] Sparse Support Vector Machine with Lp Penalty for Feature Selection
    Lan Yao
    Feng Zeng
    Dong-Hui Li
    Zhi-Gang Chen
    Journal of Computer Science and Technology, 2017, 32 : 68 - 77
  • [46] A memetic algorithm with support vector machine for feature selection and classification
    Nekkaa, Messaouda
    Boughaci, Dalila
    MEMETIC COMPUTING, 2015, 7 (01) : 59 - 73
  • [47] Feature Selection for Cancer Classification Based on Support Vector Machine
    Luo, Wei
    Wang, Lipo
    Sun, Jingjing
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 422 - +
  • [48] A Feature Selection Method for Projection Twin Support Vector Machine
    A. Rui Yan
    B. Qiaolin Ye
    C. Liyan Zhang
    D. Ning Ye
    E. Xiangbo Shu
    Neural Processing Letters, 2018, 47 : 21 - 38
  • [49] On domain knowledge and feature selection using a support vector machine
    Barzilay, O
    Brailovsky, VL
    PATTERN RECOGNITION LETTERS, 1999, 20 (05) : 475 - 484
  • [50] A Feature Selection Method for Projection Twin Support Vector Machine
    Yan, A. Rui
    Ye, B. Qiaolin
    Zhang, C. Liyan
    Ye, D. Ning
    Shu, E. Xiangbo
    NEURAL PROCESSING LETTERS, 2018, 47 (01) : 21 - 38