Feature Selection and Granular SVM Classification for Protein Arginine Methylation Identification

被引:2
|
作者
Ding, Zejin [1 ]
Zhang, Yan-Qing [1 ]
Zheng, Yujun George [2 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
[2] Georgia State Univ, Dept Chem, Atlanta, GA 30303 USA
关键词
Protein Methylation; Imbalanced Data Mining; Granular Support Vector Machines (GSVM); Methylation Prediction; Feature Selction; PREDICTION;
D O I
10.1109/ICSMC.2009.5345973
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Protein methylation modification has been discovered for half a century but still far less been studied than other modifications. Computational analysis is recently introduced to discover other unknown methylation sites based on few known ones. To effectively predict possible methylation, sophisticated classification strategy should be well devised. In this paper, we first extracted informative features from methylated fragments in many protein sequences, including the physicochemical properties, secondary structure information, evolutionary profiles, and solvent accessibility of surrounding residues. Then, an efficient feature selection method (mRMR) is applied to eliminate redundant features but keep important ones. Since methylated residues are far less than non-methylated, the collected data is relatively imbalanced. Thus, we propose to use the granular support vector machine (GSVM) which is specially designed for imbalanced classification problems. A 7-fold cross validation shows that our strategy generates comparable predication accuracy with many current methods or even better. Meanwhile, our method provides insights to identify the underlying mechanisms of protein methylation.
引用
收藏
页码:2979 / +
页数:3
相关论文
共 50 条
  • [11] Beam search for feature selection in automatic SVM defect classification
    Gupta, P
    Doermann, D
    DeMenthon, D
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 212 - 215
  • [12] Genetic Algorithm Assisted by a SVM for Feature Selection in Gait Classification
    Yeoh, TzeWei
    Zapotecas-Martinez, Saul
    Akimoto, Youhei
    Aguirre, Hernan
    Tanaka, Kiyoshi
    [J]. 2014 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2014, : 191 - 195
  • [13] Feature Selection Based on the SVM Weight Vector for Classification of Dementia
    Bron, Esther E.
    Smits, Marion
    Niessen, Wiro J.
    Klein, Stefan
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2015, 19 (05) : 1617 - 1626
  • [14] sEMG feature selection and classification using SVM-RFE
    Tosin, Mauricio C.
    Majolo, Mariano
    Chedid, Raissan
    Cene, Vinicius H.
    Balbinot, Alexandre
    [J]. 2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 390 - 393
  • [15] Neighborhood based sample and feature selection for SVM classification learning
    He, Qiang
    Xie, Zongxia
    Hu, Qinghua
    Wu, Congxin
    [J]. NEUROCOMPUTING, 2011, 74 (10) : 1585 - 1594
  • [16] AdaBoost for Feature Selection, Classification and Its Relation with SVM*, A Review
    Wang, Ruihu
    [J]. INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 800 - 807
  • [17] Feature Selection Based on SVM Significance Maps for Classification of Dementia
    Bron, Esther
    Smits, Marion
    van Swieten, John
    Niessen, Wiro
    Klein, Stefan
    [J]. MACHINE LEARNING IN MEDICAL IMAGING (MLMI 2014), 2014, 8679 : 272 - 279
  • [18] Feature selection and classification improvement of Kinnow using SVM classifier
    Singh, Sukhpreet
    Malik, Kamal
    [J]. Measurement: Sensors, 2022, 24
  • [19] penalizedSVM: a R-package for feature selection SVM classification
    Becker, Natalia
    Werft, Wiebke
    Toedt, Grischa
    Lichter, Peter
    Benner, Axel
    [J]. BIOINFORMATICS, 2009, 25 (13) : 1711 - 1712
  • [20] Feature selection and classification of polarimetric SAR images using SVM
    Wu, Yong-Hui
    Ji, Ke-Feng
    Li, Yu
    Yu, Wen-Xian
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2008, 30 (10): : 2347 - 2351