Identifying natural substrates for chaperonins using a sequence-based approach

被引:23
|
作者
Stan, G
Brooks, BR
Lorimer, GH
Thirumalai, D [1 ]
机构
[1] Univ Maryland, Inst Phys Sci & Technol, Biol Sci Program, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Chem & Biochem, Biol Sci Program, College Pk, MD 20742 USA
[3] NHLBI, Lab Computat Biol, Natl Inst Hlth, Bethesda, MD 20892 USA
关键词
chaperonins; protein recognition; E; coli; yeast genomes;
D O I
10.1110/ps.04933205
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Escherichia coli chaperonin machinery, GroEL, assists the folding of a number of proteins. We describe a sequence-based approach to identify the natural substrate proteins (SPs) for GroEL. Our method is based on the hypothesis that natural SPs are those that contain patterns of residues similar to those found in either GroES mobile loop and/or strongly binding peptide in complex with GroEL. The method is validated by comparing the predicted results with experimentally determined natural SPs for GroEL. We have searched for such patterns in five genomes. In the E. coli genome, we identify 1422 (about one-third) Sequences that are putative natural SPs. In Saccharomyces cerevisiae, 2885 (32%) of sequences can be natural substrates for Hsp60, which is the analog of GroEL. The precise number of natural SPs is shown to be a function of the number of contacts an SP makes with the apical domain (N-C) and the number of binding sites (N-B) in the oligomer with which it interacts. For known SPs for GroEL we find similar to4 < N-C < 5 and 2 less than or equal to N-B less than or equal to 4. A limited analysis of the predicted binding sequences shows that they do not adopt any preferred secondary structure. Our method also predicts the putative binding regions in the identified SPs. The results of our study show that a variety of SPs. associated with diverse functions, can interact with GroEL.
引用
收藏
页码:193 / 201
页数:9
相关论文
共 50 条
  • [1] Mapping Ds insertions in barley using a sequence-based approach
    Cooper, LD
    Marquez-Cedillo, L
    Singh, J
    Sturbaum, AK
    Zhang, S
    Edwards, V
    Johnson, K
    Kleinhofs, A
    Rangel, S
    Carollo, V
    Bregitzer, P
    Lemaux, PG
    Hayes, PM
    MOLECULAR GENETICS AND GENOMICS, 2004, 272 (02) : 181 - 193
  • [2] Mapping Ds insertions in barley using a sequence-based approach
    L. D. Cooper
    L. Marquez-Cedillo
    J. Singh
    A. K. Sturbaum
    S. Zhang
    V. Edwards
    K. Johnson
    A. Kleinhofs
    S. Rangel
    V. Carollo
    P. Bregitzer
    P. G. Lemaux
    P. M. Hayes
    Molecular Genetics and Genomics, 2004, 272 : 181 - 193
  • [3] IACP: a sequence-based tool for identifying anticancer peptides
    Chen, Wei
    Ding, Hui
    Feng, Pengmian
    Lin, Hao
    Chou, Kuo-Chen
    ONCOTARGET, 2016, 7 (13) : 16895 - 16909
  • [4] Sequence-Based Pronunciation Modeling Using a Noisy-Channel Approach
    Hofmann, Hansjoerg
    Sakti, Sakriani
    Isotani, Ryosuke
    Kawai, Hisashi
    Nakamura, Satoshi
    Minker, Wolfgang
    SPOKEN DIALOGUE SYSTEMS FOR AMBIENT ENVIRONMENTS, 2010, 6392 : 156 - 162
  • [5] Hybrid sequence-based Android malware detection using natural language processing
    Zhang, Nan
    Xue, Jingfeng
    Ma, Yuxi
    Zhang, Ruyun
    Liang, Tiancai
    Tan, Yu-an
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (10) : 5770 - 5784
  • [6] A Novel Hybrid Sequence-Based Model for Identifying Anticancer Peptides
    Xu, Lei
    Liang, Guangmin
    Wang, Longjie
    Liao, Changrui
    GENES, 2018, 9 (03)
  • [7] !Sentiment Classification: A Topic Sequence-Based Approach
    Song, Xuliang
    Liang, Jiguang
    Hu, Chengcheng
    JOURNAL OF COMPUTERS, 2016, 11 (01) : 1 - 9
  • [8] SeqPoint: Identifying Representative Iterations of Sequence-based Neural Networks
    Pati, Suchita
    Aga, Shaizeen
    Sinclair, Matthew D.
    Jayasena, Nuwan
    2020 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2020, : 69 - 80
  • [9] A Prufer sequence-based approach for schema matching
    Algergawy, Alsayed
    Schallehn, Eike
    Saake, Gunter
    DATABASES AND INFORMATION SYSTEMS, 2008, : 205 - 216
  • [10] Identifying non-coding somatic cancer driver mutations using sequence-based models
    Urzua-Traslavina, Carlos
    van Lieshout, Tijs
    Barbadilla-Martinez, Lucia
    Klaassen, Noud
    Franceschini-Santos, Vinicius
    de Ridder, Jeroen
    van Steensel, Bas
    Franke, Lude
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1659 - 1659