Prediction protein structural classes with pseudo-amino acid composition: Approximate entropy and hydrophobicity pattern

被引:158
|
作者
Zhang, Tong-Liang [1 ]
Ding, Yong-Sheng [1 ]
Chou, Kuo-Chen [2 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai, Peoples R China
[2] Gordon Life Sci Inst, San Diego, CA 92130 USA
基金
高等学校博士学科点专项科研基金;
关键词
protein structure classes; pseudo-amino acid composition; approximate entropy; hydrophobicity pattern; fuzzy KNN classifier;
D O I
10.1016/j.jtbi.2007.09.014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Compared with the conventional amino acid (AA) composition, the pseudo-amino acid (PseAA) composition as originally introduced for protein subcellular location prediction can incorporate much more information of a protein sequence, so as to remarkably enhance the power of using a discrete model to predict various attributes of a protein. In this study, based on the concept of PseAA composition, the approximate entropy and hydrophobicity pattern of a protein sequence are used to characterize the PseAA components. Also, the immune genetic algorithm (IGA) is applied to search the optimal weight factors in generating the PseAA composition. Thus, for a given protein sequence sample, a 27-D (dimensional), PseAA composition is generated as its descriptor. The fuzzy K nearest neighbors (FKNN) classifier is adopted as the prediction engine. The results thus obtained in predicting protein structural classification are quite encouraging, indicating that the current approach may also be used to improve the prediction quality of other protein attributes, or at least can play a complimentary role to the existing methods in the relevant areas. Our algorithm is written in Matlab that is available by contacting the corresponding author. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:186 / 193
页数:8
相关论文
共 50 条
  • [41] Predicting subcellular localization of proteins by hybridizing functional domain composition and pseudo-amino acid composition
    Chou, KC
    Cai, YD
    JOURNAL OF CELLULAR BIOCHEMISTRY, 2004, 91 (06) : 1197 - 1203
  • [42] Some remarks on protein attribute prediction and pseudo amino acid composition
    Chou, Kuo-Chen
    JOURNAL OF THEORETICAL BIOLOGY, 2011, 273 (01) : 236 - 247
  • [43] Weighted-support vector machines for predicting membrane protein types based on pseudo-amino acid composition
    Wang, M
    Yang, J
    Liu, GP
    Xu, ZJ
    Chou, KC
    PROTEIN ENGINEERING DESIGN & SELECTION, 2004, 17 (06): : 509 - 516
  • [44] Prediction of protein structural classes based on correlations of amino acid residues
    Wang, SQ
    Liu, H
    Du, QS
    Wei, DQ
    ACTA PHYSICO-CHIMICA SINICA, 2004, 20 (05) : 498 - 502
  • [45] Using increment of diversity to predict mitochondrial proteins of malaria parasite: integrating pseudo-amino acid composition and structural alphabet
    Chen, Ying-Li
    Li, Qian-Zhong
    Zhang, Li-Qing
    AMINO ACIDS, 2012, 42 (04) : 1309 - 1316
  • [46] Prediction of GABAA receptor proteins using the concept of Chou's pseudo-amino acid composition and support vector machine
    Mohabatkar, Hassan
    Beigi, Majid Mohammad
    Esmaeili, Abolghasem
    JOURNAL OF THEORETICAL BIOLOGY, 2011, 281 (01) : 18 - 23
  • [47] Predicting protein structural classes with pseudo amino acid composition: An approach using geometric moments of cellular automaton image
    Xiao, Xuan
    Wang, Pu
    Chou, Kuo-Chen
    JOURNAL OF THEORETICAL BIOLOGY, 2008, 254 (03) : 691 - 696
  • [48] Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes
    Zhang, T. -L.
    Ding, Y. -S.
    AMINO ACIDS, 2007, 33 (04) : 623 - 629
  • [49] Prediction of protein structural class by amino acid and polypeptide composition
    Luo, RY
    Feng, ZP
    Liu, JK
    EUROPEAN JOURNAL OF BIOCHEMISTRY, 2002, 269 (17): : 4219 - 4225
  • [50] Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes
    T.-L. Zhang
    Y.-S. Ding
    Amino Acids, 2007, 33 : 623 - 629