A new hybrid approach to predict subcellular localization by incorporating protein evolutionary conservation information

被引:1
|
作者
Zhang, ShaoWu [1 ]
Zhang, YunLong [2 ]
Li, JunHui
Yang, HuiFeng [1 ]
Cheng, YongMei [1 ]
Zhou, GuoPing [3 ]
机构
[1] Northwestern Polytech Univ, Coll Automat, Xian 710072, Peoples R China
[2] First Aeronaut Inst Air Force, Dept Comp Sci, Henan 4640000, Peoples R China
[3] Harvard Med Sch, Dept Biol Chem& Mol Pharmacol, Boston, MA 02115 USA
基金
中国国家自然科学基金;
关键词
D O I
10.1007/978-3-540-74771-0_20
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The rapidly increasing number of sequence entering into the genome databank has created the need for fully automated methods to analyze them. Knowing the cellular location of a protein is a key step towards understanding its function. The development in statistical prediction of protein attributes generally consists of two cores: one is to construct a training dataset and the other is to formulate a predictive algorithm. The latter can be further separated into two subcores: one is how to give a mathematical expression to effectively represent a protein and the other is how to find a powerful algorithm to accurately perform the prediction. Here, an improved evolutionary conservation algorithm was proposed to calculate per residue conservation score. Then, each protein can be represented as a feature vector created with multi-scale energy (MSE). In addition, the protein can be represented as other feature vectors based on amino acid composition (AAC), weighted auto-correlation function and Moment descriptor methods. Finally, a novel hybrid approach was developed by fusing the four kinds of feature classifiers through a product rule system to predict 12 subcellular locations. Compared with existing methods, this new approach provides better predictive performance. High success accuracies were obtained in both jackknife cross-validation test and independent dataset test, suggesting that introducing protein evolutionary information and the concept of fusing multifeatures classifiers are quite promising, and might also hold a great potential as a useful vehicle for the other areas of molecular biology.
引用
收藏
页码:172 / +
页数:4
相关论文
共 50 条
  • [1] A new hybrid approach to predict subcellular localization of proteins by incorporating gene ontology
    Chou, KC
    Cai, YD
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 311 (03) : 743 - 747
  • [2] Using the concept of Chou’s pseudo amino acid composition to predict protein subcellular localization: an approach by incorporating evolutionary information and von Neumann entropies
    Shao-Wu Zhang
    Yun-Long Zhang
    Hui-Fang Yang
    Chun-Hui Zhao
    Quan Pan
    [J]. Amino Acids, 2008, 34 : 565 - 572
  • [3] Using the concept of Chou's pseudo amino acid composition to predict protein subcellular localization: an approach by incorporating evolutionary information and von Neumann entropies
    Zhang, Shao-Wu
    Zhang, Yun-Long
    Yang, Hui-Fang
    Zhao, Chun-Hui
    Pan, Quan
    [J]. AMINO ACIDS, 2008, 34 (04) : 565 - 572
  • [4] Machine learning approach to predict protein phosphorylation sites by incorporating evolutionary information
    Biswas, Ashis Kumer
    Noman, Nasimul
    Sikder, Abdur Rahman
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [5] Machine learning approach to predict protein phosphorylation sites by incorporating evolutionary information
    Ashis Kumer Biswas
    Nasimul Noman
    Abdur Rahman Sikder
    [J]. BMC Bioinformatics, 11
  • [6] Protein Subcellular Localization Based on Evolutionary Information and Segmented Distribution
    Jin, Danyu
    Zhu, Ping
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [7] Protein Subcellular Localization Based on Evolutionary Information and Segmented Distribution
    Jin, Danyu
    Zhu, Ping
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [8] Improving prediction of protein subcellular localization using evolutionary information and sequence-order information
    Wang, Minghui
    Li, Ao
    Xie, Dan
    Fan, Zhewen
    Jiang, Zhaohui
    Feng, Huanqing
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 4434 - 4436
  • [9] A algorithm for identifying disease genes by incorporating the subcellular localization information into the protein-protein interaction networks
    Tang, Xiwei
    Hu, Xiaohua
    Yang, Xuejun
    Sun, Yuan
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 308 - 311
  • [10] Evolutionary conservation of Rab protein localization
    Calero, M
    Janeczko, SD
    Whittaker, GR
    Collins, RN
    [J]. MOLECULAR BIOLOGY OF THE CELL, 2001, 12 : 86A - 86A