Feature Selection and Granular SVM Classification for Protein Arginine Methylation Identification

被引:2
|
作者
Ding, Zejin [1 ]
Zhang, Yan-Qing [1 ]
Zheng, Yujun George [2 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
[2] Georgia State Univ, Dept Chem, Atlanta, GA 30303 USA
关键词
Protein Methylation; Imbalanced Data Mining; Granular Support Vector Machines (GSVM); Methylation Prediction; Feature Selction; PREDICTION;
D O I
10.1109/ICSMC.2009.5345973
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Protein methylation modification has been discovered for half a century but still far less been studied than other modifications. Computational analysis is recently introduced to discover other unknown methylation sites based on few known ones. To effectively predict possible methylation, sophisticated classification strategy should be well devised. In this paper, we first extracted informative features from methylated fragments in many protein sequences, including the physicochemical properties, secondary structure information, evolutionary profiles, and solvent accessibility of surrounding residues. Then, an efficient feature selection method (mRMR) is applied to eliminate redundant features but keep important ones. Since methylated residues are far less than non-methylated, the collected data is relatively imbalanced. Thus, we propose to use the granular support vector machine (GSVM) which is specially designed for imbalanced classification problems. A 7-fold cross validation shows that our strategy generates comparable predication accuracy with many current methods or even better. Meanwhile, our method provides insights to identify the underlying mechanisms of protein methylation.
引用
收藏
页码:2979 / +
页数:3
相关论文
共 50 条
  • [41] Optimal Feature Selection for SVM based Weed Classification via Visual Analysis
    Shahbudin, S.
    Hussain, A.
    Samad, S. A.
    Mustafa, M. M.
    Ishak, A. J.
    [J]. TENCON 2010: 2010 IEEE REGION 10 CONFERENCE, 2010, : 1647 - 1650
  • [42] AN INTELLIGENT FEATURE SELECTION AND CLASSIFICATION METHOD BASED ON HYBRID ABC-SVM
    Jie Li
    Zhang, Qiuwen
    Zhang Yongzhi
    Li Chang
    Xiao Jian
    [J]. INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2016, 9 (04): : 1859 - 1876
  • [43] Feature subset selection for multi-class SVM based image classification
    Wang, Lei
    [J]. COMPUTER VISION - ACCV 2007, PT II, PROCEEDINGS, 2007, 4844 : 145 - 154
  • [44] Protein Classification Using Hybrid Feature Selection Technique
    Singh, Upendra
    Tripathi, Sudhakar
    [J]. SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 813 - 821
  • [45] STUDY ON FEATURE SELECTION AND IDENTIFICATION METHOD OF TOOL WEAR STATES BASED ON SVM
    Li, Weilin
    Fu, Pan
    Cao, Weiqing
    [J]. INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2013, 6 (02): : 448 - 465
  • [46] Feature selection algorithm based on SVM
    Sun Jiongjiong
    Liu Jun
    Wei Xuguang
    [J]. PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 4113 - 4116
  • [47] Feature selection in SVM text categorization
    Taira, H
    Haruno, M
    [J]. SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 480 - 486
  • [48] Feature Selection based on Fuzzy SVM
    Xia, Hong
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2008, : 586 - 589
  • [49] Lagrangian relaxation for SVM feature selection
    Gaudioso, M.
    Gorgone, E.
    Labbe, M.
    Rodriguez-Chia, A. M.
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2017, 87 : 137 - 145
  • [50] An ensemble svm classifier with feature selection
    Hu, Han
    En-en, Ren
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2007, : 6 - 8