Is it possible to predict the average surface hydrophobicity of a protein using only its amino acid composition?

被引:25
|
作者
Salgado, JC
Rapaport, I
Asenjo, JA
机构
[1] Univ Chile, Ctr Biochem Engn & Biotechnol, Dept Chem Engn & Biotechnol, Millennium Inst Adv Studies Cell Biol & Biotechno, Santiago, Chile
[2] Univ Chile, Dept Engn Math, Ctr Modelamiento Matemat, Santiago, Chile
关键词
mathematical modeling; hydrophobicity; protein hydrophobicity prediction; neural network; proteins; aminoacidic composition;
D O I
10.1016/j.chroma.2005.04.014
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Hydrophobicity is one of the most important physicochemical properties of proteins. Moreover, it plays a fundamental role in hydrophobic interaction chromatography, a separation technique that, at present time, is used in most industrial processes for protein purification as well as in laboratory scale applications. Although there are many ways of assessing the hydrophobicity value of a protein, recently, it has been shown that the average surface hydrophobicity (ASH) is an important tool in the area of protein separation and purification particularly in protein chromatography. The ASH is calculated based on the hydrophobic characteristics of each class of amino acid present on the protein surface. The hydrophobic characteristics of the amino acids are determined by a scale of aminoacidic hydrophobicity. In this work, the scales of Cowan-Whittaker and Berggren were studied. However, to calculate the ASH, it is necessary to have the three-dimensional protein structure. Frequently this data does not exist, and the only information available is the amino acid sequence. In these cases it would be desirable to estimate the ASH based only on properties extracted from the protein sequence. It was found that it is possible to predict the ASH from a protein to an acceptable level for many practical applications (correlation coefficient > 0.8) using only the aminoacidic composition. Two predictive tools were built: one based on a simple linear model and the other on a neural network. Both tools were constructed starting from the analysis of a set of 1982 non-redundant proteins. The linear model was able to predict the ASH for an independent subset with a correlation coefficient of 0.769 for the case of Cowan-Whittaker and 0.803 for the case of Berggren. On the other hand, the neural model improved the results shown by the linear model obtaining correlation coefficients of 0.831 and 0.836, respectively. The neural model was somewhat more robust than the linear model particularly as it gave similar correlation coefficients for both hydrophobicity scales tested, moreover, the observed variabilities did not overcome 6.1% of the mean square error. Finally, we tested our models in a set of nine proteins with known retention time in hydrophobic interaction chromatography. We found that both models can predict this retention time with correlation coefficients only slightly inferior (11.5% and 5.5% for the linear and the neural network models, respectively) than models that use the information about the three-dimensional structure of proteins. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:133 / 143
页数:11
相关论文
共 50 条
  • [31] Amino acid composition and protein dimension
    Carugo, Oliviero
    PROTEIN SCIENCE, 2008, 17 (12) : 2187 - 2191
  • [32] COLD HARDINESS OF PLANTS - CORRELATION WITH CHANGES IN ELECTROPHORETIC MOBILITY, COMPOSITION OF AMINO-ACIDS AND AVERAGE HYDROPHOBICITY OF FRACTION-1-PROTEIN
    SHOMERILAN, A
    WAISEL, Y
    PHYSIOLOGIA PLANTARUM, 1975, 34 (01) : 90 - 96
  • [33] THE AMINO ACID COMPOSITION OF ENAMEL PROTEIN
    HESS, WC
    LEE, CY
    NEIDIG, BA
    JOURNAL OF DENTAL RESEARCH, 1953, 32 (04) : 585 - 587
  • [34] Using the concept of Chou's pseudo amino acid composition to predict protein solubility: An approach with entropies in information theory
    Niu Xiaohui
    Li Nana
    Xia Jingbo
    Chen Dingyan
    Peng Yuehua
    Xiao Yang
    Wei Weiquan
    Wang Dongming
    Wang Zengzhen
    JOURNAL OF THEORETICAL BIOLOGY, 2013, 332 : 211 - 217
  • [35] Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes
    Zhang, T. -L.
    Ding, Y. -S.
    AMINO ACIDS, 2007, 33 (04) : 623 - 629
  • [36] Using pseudo amino acid composition to predict transmembrane regions in protein: cellular automata and Lempel-Ziv complexity
    Y. Diao
    D. Ma
    Z. Wen
    J. Yin
    J. Xiang
    M. Li
    Amino Acids, 2008, 34 : 111 - 117
  • [37] Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes
    T.-L. Zhang
    Y.-S. Ding
    Amino Acids, 2007, 33 : 623 - 629
  • [38] Using pseudo amino acid composition to predict transmembrane regions in protein: cellular automata and Lempel-Ziv complexity
    Diao, Y.
    Ma, D.
    Wen, Z.
    Yin, J.
    Xiang, J.
    Li, M.
    AMINO ACIDS, 2008, 34 (01) : 111 - 117
  • [39] SURFACE-TENSION OF AMINO-ACID SOLUTIONS - HYDROPHOBICITY SCALE OF AMINO-ACID RESIDUES
    BULL, HB
    BREESE, K
    ARCHIVES OF BIOCHEMISTRY AND BIOPHYSICS, 1974, 161 (02) : 665 - 670
  • [40] Predicting protein folding rates using pseudo amino acid composition
    Guo, Jianxiu
    Rao, Nini
    Xu, Shanglei
    Wang, Yunhe
    2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,