Prediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition

被引:121
|
作者
Shi, J.-Y. [1 ]
Zhang, S.-W. [1 ]
Pan, Q. [1 ]
Cheng, Y.-M. [1 ]
Xie, J. [1 ]
机构
[1] Northwestern Polytech Univ, Coll Automat, Xian 710072, Peoples R China
关键词
multi-scale energy; Wavelet transform; support vector machines; Chou's pseudo amino acid composition; protein subcellular localizations;
D O I
10.1007/s00726-006-0475-y
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As more and more genomes have been discovered in recent years, there is an urgent need to develop a reliable method to predict the subcellular localization for the explosion of newly found proteins. However, many well-known prediction methods based on amino acid composition have problems utilizing the sequence-order information. Here, based on the concept of Chou's pseudo amino acid composition (PseAA), a new feature extraction method, the multi-scale energy ( MSE) approach, is introduced to incorporate the sequence-order information. First, a protein sequence was mapped to a digital signal using the amino acid index. Then, by wavelet transform, the mapped signal was broken down into several scales in which the energy factors were calculated and further formed into an MSE feature vector. Following this, combining this MSE feature vector with amino acid composition ( AA), we constructed a series of MSEPseAA feature vectors to represent the protein subcellular localization sequences. Finally, according to a new kind of normalization approach, the MSEPseAA feature vectors were normalized to form the improved MSEPseAA vectors, named as IEPseAA. Using the technique of IEPseAA, C-support vector machine (C-SVM) and three multi-class SVMs strategies, quite promising results were obtained, indicating that MSE is quite effective in reflecting the sequence-order effects and might become a useful tool for predicting the other attributes of proteins as well.
引用
收藏
页码:69 / 74
页数:6
相关论文
共 50 条
  • [1] Prediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition
    J.-Y. Shi
    S.-W. Zhang
    Q. Pan
    Y.-M. Cheng
    J. Xie
    Amino Acids, 2007, 33 : 69 - 74
  • [2] Prediction of Subcellular Localization of Apoptosis Protein Using Chou’s Pseudo Amino Acid Composition
    Hao Lin
    Hao Wang
    Hui Ding
    Ying-Li Chen
    Qian-Zhong Li
    Acta Biotheoretica, 2009, 57 : 321 - 330
  • [3] Prediction of Subcellular Localization of Apoptosis Protein Using Chou's Pseudo Amino Acid Composition
    Lin, Hao
    Wang, Hao
    Ding, Hui
    Chen, Ying-Li
    Li, Qian-Zhong
    ACTA BIOTHEORETICA, 2009, 57 (03) : 321 - 330
  • [4] Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs
    Park, KJ
    Kanehisa, M
    BIOINFORMATICS, 2003, 19 (13) : 1656 - 1663
  • [5] Protein subcellular localization prediction for Gram-negative bacteria using amino acid subalphabets and a combination of multiple support vector machines
    Jiren Wang
    Wing-Kin Sung
    Arun Krishnan
    Kuo-Bin Li
    BMC Bioinformatics, 6
  • [6] Protein subcellular localization prediction for Gram-negative bacteria using amino acid subalphabets and a combination of multiple support vector machines
    Wang, JR
    Sung, WK
    Krishnan, A
    Li, KB
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [7] Multi-class protein subcellular localization classification using support vector machines
    Meng, PW
    Rajapakse, JC
    PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 526 - 533
  • [8] Using functional domain composition and support vector machines for prediction of protein subcellular location
    Chou, KC
    Cai, YD
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (48) : 45765 - 45769
  • [9] Prediction of protein subcellular locations using support vector machines
    Li, NN
    Niu, XH
    Shi, F
    Li, XY
    ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 1047 - 1051
  • [10] Prediction of Protein Subcellular Multi-Localization Based on the General form of Chou's Pseudo Amino Acid Composition
    Li, Li-Qi
    Zhang, Yuan
    Zou, Ling-Yun
    Zhou, Yue
    Zheng, Xiao-Qi
    PROTEIN AND PEPTIDE LETTERS, 2012, 19 (04): : 375 - 387