PairProSVM: Protein subcellular localization based on local pairwise profile alignment and SVM

被引:55
|
作者
Mak, Man-Wai [1 ]
Guo, Jian [2 ]
Kung, Sun-Yuan [3 ]
机构
[1] Hong Kong Polytech Univ, Elect & Informat Engn Dept, Hong Kong, Hong Kong, Peoples R China
[2] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
[3] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
关键词
protein subcellular localization; sequence alignment; profile alignment; kernel methods; support vector machines;
D O I
10.1109/TCBB.2007.70256
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The subcellular locations of proteins are important functional annotations. An effective and reliable subcellular localization method is necessary for proteomics research. This paper introduces a new method-PairProSVM-to automatically predict the subcellular locations of proteins. The profiles of all protein sequences in the training set are constructed by PSI-BLAST, and the pairwise profile alignment scores are used to form feature vectors for training a support vector machine (SVM) classifier. It was found that PairProSVM outperforms the methods that are based on sequence alignment and amino acid compositions even if most of the homologous sequences have been removed. PairProSVM was evaluated on Huang and Li's and Gardy et al.'s protein data sets. The overall accuracies on these data sets reach 75.3 percent and 91.9 percent, respectively, which are higher than or comparable to those obtained by sequence alignment and composition-based methods.
引用
收藏
页码:416 / 422
页数:7
相关论文
共 50 条
  • [1] PROTEIN SUBCELLULAR LOCALIZATION PREDICTION BASED ON PROFILE ALIGNMENT AND GENE ONTOLOGY
    Wan, Shibiao
    Mak, Man-Wai
    Kung, Sun-Yuan
    [J]. 2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [2] Truncation of Protein Sequences for Fast Profile Alignment with Application to Subcellular Localization
    Mak, Man-Wai
    Wang, Wei
    Kung, Sun-Yuan
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 115 - 120
  • [3] Fusion of cleavage site detection and pairwise alignment for fast subcellular localization
    Mak, Man-Wai
    Kung, Sun-Yuan
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 573 - +
  • [4] Feature Fusion Based SVM Classifier for Protein Subcellular Localization Prediction
    Rahman, Julia
    Mondal, Md. Nazrul Islam
    Ben Islam, Md. Khaled
    Al Mehedi, Md.
    [J]. JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2016, 13 (01) : 288
  • [5] The Research of Algorithm for Protein Subcellular Localization Prediction Based on SVM-RFE
    Liu, Wenhao
    Zhai, Junjun
    Ding, Hongwei
    He, Xinlong
    [J]. 2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [6] GOASVM: PROTEIN SUBCELLULAR LOCALIZATION PREDICTION BASED ON GENE ONTOLOGY ANNOTATION AND SVM
    Wan, Shibiao
    Mak, Man-Wai
    Kung, Sun-Yuan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2229 - 2232
  • [7] Automated Protein Subcellular Localization Based on Local Invariant Features
    Li, Chao
    Wang, Xue-hong
    Zheng, Li
    Huang, Ji-feng
    [J]. PROTEIN JOURNAL, 2013, 32 (03): : 230 - 237
  • [8] Automated Protein Subcellular Localization Based on Local Invariant Features
    Chao Li
    Xue-hong Wang
    Li Zheng
    Ji-feng Huang
    [J]. The Protein Journal, 2013, 32 : 230 - 237
  • [9] Prediction of subcellular localization of proteins using pairwise sequence alignment and support vector machine
    Kim, Jong Kyoung
    Raghava, G. P. S.
    Bang, Sung-Yang
    Choi, Seungjin
    [J]. PATTERN RECOGNITION LETTERS, 2006, 27 (09) : 996 - 1001
  • [10] pSLIP: SVM based protein subcellular localization prediction using multiple physicochemical properties
    Sarda, D
    Chua, GH
    Li, KB
    Krishnan, A
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)