Feature Selection and Granular SVM Classification for Protein Arginine Methylation Identification

被引:2
|
作者
Ding, Zejin [1 ]
Zhang, Yan-Qing [1 ]
Zheng, Yujun George [2 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
[2] Georgia State Univ, Dept Chem, Atlanta, GA 30303 USA
关键词
Protein Methylation; Imbalanced Data Mining; Granular Support Vector Machines (GSVM); Methylation Prediction; Feature Selction; PREDICTION;
D O I
10.1109/ICSMC.2009.5345973
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Protein methylation modification has been discovered for half a century but still far less been studied than other modifications. Computational analysis is recently introduced to discover other unknown methylation sites based on few known ones. To effectively predict possible methylation, sophisticated classification strategy should be well devised. In this paper, we first extracted informative features from methylated fragments in many protein sequences, including the physicochemical properties, secondary structure information, evolutionary profiles, and solvent accessibility of surrounding residues. Then, an efficient feature selection method (mRMR) is applied to eliminate redundant features but keep important ones. Since methylated residues are far less than non-methylated, the collected data is relatively imbalanced. Thus, we propose to use the granular support vector machine (GSVM) which is specially designed for imbalanced classification problems. A 7-fold cross validation shows that our strategy generates comparable predication accuracy with many current methods or even better. Meanwhile, our method provides insights to identify the underlying mechanisms of protein methylation.
引用
收藏
页码:2979 / +
页数:3
相关论文
共 50 条
  • [31] Feature Selection for Improved Classification of Protein Structures
    Mirceva, G.
    Ivanoska, I.
    Naumoski, A.
    Kulakov, A.
    [J]. 2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1013 - 1018
  • [32] A Method for Large-scale Identification of Protein Arginine Methylation
    Uhlmann, Thomas
    Geoghegan, Vincent L.
    Thomas, Benjamin
    Ridlova, Gabriela
    Trudgian, David C.
    Acuto, Oreste
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2012, 11 (11) : 1489 - 1499
  • [33] Feature selection for identification and classification of power quality disturbances
    Chen, S
    [J]. 2005 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS, 1-3, 2005, : 2301 - 2306
  • [34] Tumor CE Image Classification Using SVM-Based Feature Selection
    Li, Baopu
    Meng, Max Q-H
    [J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010,
  • [35] Representative terrn based feature selection method for SVM based document classification
    Kang, YH
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2005, 3681 : 56 - 61
  • [36] Laplacian SVM based Feature Selection Improves Medical Event Reports Classification
    Fodeh, Samah Jamal
    Miller, Perry
    Brandt, Cynthia
    Benin, Andrea L.
    Lee, Kyle
    Koss, Michele
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 449 - 454
  • [37] An iterative SVM approach to feature selection and classification in high-dimensional datasets
    Liu, Dehua
    Qian, Hui
    Dai, Guang
    Zhang, Zhihua
    [J]. PATTERN RECOGNITION, 2013, 46 (09) : 2531 - 2537
  • [38] User Daily Activity Classification from Accelerometry Using Feature Selection and SVM
    Parera, Jordi
    Angulo, Cecilio
    Rodriguez-Molinero, A.
    Cabestany, Joan
    [J]. BIO-INSPIRED SYSTEMS: COMPUTATIONAL AND AMBIENT INTELLIGENCE, PT 1, 2009, 5517 : 1137 - +
  • [39] Information-theoretic approaches to SVM feature selection for metagenome read classification
    Garbarine, Elaine
    DePasquale, Joseph
    Gadia, Vinay
    Polikar, Robi
    Rosen, Gail
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2011, 35 (03) : 199 - 209
  • [40] Multiclass Classification of Cardiac Arrhythmia Using Improved Feature Selection and SVM Invariants
    Mustaqeem, Anam
    Anwar, Syed Muhammad
    Majid, Muahammad
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2018, 2018