A novel approach for protein subcellular location prediction using amino acid exposure

被引:10
|
作者
Mer, Arvind Singh [1 ]
Andrade-Navarro, Miguel A. [1 ]
机构
[1] Max Delbruck Ctr Mol Med, D-13125 Berlin, Germany
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
SOLVENT ACCESSIBILITY; SECONDARY STRUCTURE; LOCALIZATION; SEQUENCE;
D O I
10.1186/1471-2105-14-342
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Proteins perform their functions in associated cellular locations. Therefore, the study of protein function can be facilitated by predictions of protein location. Protein location can be predicted either from the sequence of a protein alone by identification of targeting peptide sequences and motifs, or by homology to proteins of known location. A third approach, which is complementary, exploits the differences in amino acid composition of proteins associated to different cellular locations, and can be useful if motif and homology information are missing. Here we expand this approach taking into account amino acid composition at different levels of amino acid exposure. Results: Our method has two stages. For stage one, we trained multiple Support Vector Machines (SVMs) to score eukaryotic protein sequences for membership to each of three categories: nuclear, cytoplasmic and extracellular, plus extra category nucleocytoplasmic, accounting for the fact that a large number of proteins shuttles between those two locations. In stage two we use an artificial neural network (ANN) to propose a category from the scores given to the four locations in stage one. The method reaches an accuracy of 68% when using as input 3D-derived values of amino acid exposure. Calibration of the method using predicted values of amino acid exposure allows classifying proteins without 3D-information with an accuracy of 62% and discerning proteins in different locations even if they shared high levels of identity. Conclusions: In this study we explored the relationship between residue exposure and protein subcellular location. We developed a new algorithm for subcellular location prediction that uses residue exposure signatures. Our algorithm uses a novel approach to address the multiclass classification problem.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A novel approach for protein subcellular location prediction using amino acid exposure
    Arvind Singh Mer
    Miguel A Andrade-Navarro
    BMC Bioinformatics, 14
  • [2] Prediction of protein subcellular location using hydrophobic patterns of amino acid sequence
    Zhang, Tongliang
    Ding, Yongsheng
    Chou, Kuo-Chen
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2006, 30 (05) : 367 - 371
  • [3] Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition
    Chen, Ying-Li
    Li, Qian-Zhong
    JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (02) : 377 - 381
  • [4] Prediction of apoptosis protein subcellular location based on amphiphilic pseudo amino acid composition
    Su, Wenxia
    Deng, Shuyi
    Gu, Zhifeng
    Yang, Keli
    Ding, Hui
    Chen, Hui
    Zhang, Zhaoyue
    FRONTIERS IN GENETICS, 2023, 14
  • [5] An Ensemble Classifier for Eukaryotic Protein Subcellular Location Prediction Using Gene Ontology Categories and Amino Acid Hydrophobicity
    Li, Liqi
    Zhang, Yuan
    Zou, Lingyun
    Li, Changqing
    Yu, Bo
    Zheng, Xiaoqi
    Zhou, Yue
    PLOS ONE, 2012, 7 (01):
  • [6] A Novel Ensemble Technique for Protein Subcellular Location Prediction
    Rozza, Alessandro
    Lombardi, Gabriele
    Re, Matteo
    Casiraghi, Elena
    Valentini, Giorgio
    Campadelli, Paola
    ENSEMBLES IN MACHINE LEARNING APPLICATIONS, 2011, 373 : 151 - 167
  • [7] Predicting protein subcellular location using Chou's pseudo amino acid composition and improved hybrid approach
    Li, Feng-Min
    Li, Qian-Zhong
    PROTEIN AND PEPTIDE LETTERS, 2008, 15 (06): : 612 - 616
  • [8] Using pseudo amino acid composition to predict protein subcellular location: approached with amino acid composition distribution
    Shi, J. -Y.
    Zhang, S. -W.
    Pan, Q.
    Zhou, G. -P.
    AMINO ACIDS, 2008, 35 (02) : 321 - 327
  • [9] Using pseudo amino acid composition to predict protein subcellular location: approached with amino acid composition distribution
    J.-Y. Shi
    S.-W. Zhang
    Q. Pan
    G.-P. Zhou
    Amino Acids, 2008, 35 : 321 - 327
  • [10] Protein subcellular location prediction based on pseudo amino acid composition and immune genetic algorithm
    Zhang, Tongliang
    Ding, Yongsheng
    Shao, Shihuang
    COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, PT 3, PROCEEDINGS, 2006, 4115 : 534 - 542