Multi-class protein subcellular localization classification using support vector machines

被引:0
|
作者
Meng, PW [1 ]
Rajapakse, JC [1 ]
机构
[1] Temasek Polytech, Sch Engn, Singapore 529757, Singapore
关键词
protein subcellular localization; multi-class classification; support vector machines; amino acid composition; amino acid side-chain;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Prediction of protein subcellular localization from amino acid sequence is an important step towards elucidating the function of a protein. Here, we present an approach for predicting protein subcellular localizations from eukaryotic sequences using Support Vector Machines. Apart from using amino acid compositions, our prediction approach also considers biochemical characteristics of amino acids and their distribution patterns along the primary sequence of the query proteins. Consequently, improved predictive accuracy has been achieved on the Reinhardt and Hubbard's dataset. For the four subcellular localizations of eukaryotic proteins, the total prediction accuracy obtained using the "leave-one-out" cross-validation test is 88.88%. To the best of our knowledge, our approach obtained by far the best prediction accuracy for mitochondrial proteins, which are notoriously difficult to predict among eukaryotic proteins. Performance comparison results also showed that our approach outperformed existing protein subcellular localization prediction methods based solely on amino acid composition.
引用
收藏
页码:526 / 533
页数:8
相关论文
共 50 条
  • [32] Comments on: Support vector machines maximizing geometric margins for multi-class classification
    Yoonkyung Lee
    [J]. TOP, 2014, 22 : 852 - 855
  • [33] Fuzzy rules extraction from support vector machines for multi-class classification
    Chaves, Adriana da Costa F.
    Vellasco, Marley Maria B. R.
    Tanscheit, Ricardo
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1571 - 1580
  • [34] Predicting Protein Subcellular Localization using PsePSSM and Support Vector Machines
    Juan, Eric Y. T.
    Jhang, J. H.
    Li, W. J.
    [J]. PROCEEDINGS OF THE 11TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2008,
  • [35] Feature selection for multi-class problems using support vector machines
    Li, GZ
    Yang, J
    Liu, GP
    Xue, L
    [J]. PRICAI 2004: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3157 : 292 - 300
  • [36] Probability output of multi-class support vector machines
    Xin, Dong
    Wu, Zhao-Hui
    Pan, Yun-He
    [J]. Journal of Zhejinag University: Science, 2002, 3 (02): : 131 - 134
  • [37] Multi-class Support Vector Machines:: A new approach
    Arenas-García, J
    Pérez-Cruz, F
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 781 - 784
  • [38] An Efficient Algorithm for Multi-class Support Vector Machines
    Guo, Jun
    Takahashi, Norikazu
    Hu, Wenxin
    [J]. 2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 327 - +
  • [39] Probability output of multi-class support vector machines
    Dong Xin
    Zhao-hui Wu
    Yun-he Pan
    [J]. Journal of Zhejiang University-SCIENCE A, 2002, 3 (2): : 131 - 134
  • [40] Probability output of multi-class support vector machines
    忻栋
    吴朝晖
    潘云鹤
    [J]. Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2002, (02) : 1 - 4