Identifying natural substrates for chaperonins using a sequence-based approach

被引:23
|
作者
Stan, G
Brooks, BR
Lorimer, GH
Thirumalai, D [1 ]
机构
[1] Univ Maryland, Inst Phys Sci & Technol, Biol Sci Program, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Chem & Biochem, Biol Sci Program, College Pk, MD 20742 USA
[3] NHLBI, Lab Computat Biol, Natl Inst Hlth, Bethesda, MD 20892 USA
关键词
chaperonins; protein recognition; E; coli; yeast genomes;
D O I
10.1110/ps.04933205
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Escherichia coli chaperonin machinery, GroEL, assists the folding of a number of proteins. We describe a sequence-based approach to identify the natural substrate proteins (SPs) for GroEL. Our method is based on the hypothesis that natural SPs are those that contain patterns of residues similar to those found in either GroES mobile loop and/or strongly binding peptide in complex with GroEL. The method is validated by comparing the predicted results with experimentally determined natural SPs for GroEL. We have searched for such patterns in five genomes. In the E. coli genome, we identify 1422 (about one-third) Sequences that are putative natural SPs. In Saccharomyces cerevisiae, 2885 (32%) of sequences can be natural substrates for Hsp60, which is the analog of GroEL. The precise number of natural SPs is shown to be a function of the number of contacts an SP makes with the apical domain (N-C) and the number of binding sites (N-B) in the oligomer with which it interacts. For known SPs for GroEL we find similar to4 < N-C < 5 and 2 less than or equal to N-B less than or equal to 4. A limited analysis of the predicted binding sequences shows that they do not adopt any preferred secondary structure. Our method also predicts the putative binding regions in the identified SPs. The results of our study show that a variety of SPs. associated with diverse functions, can interact with GroEL.
引用
收藏
页码:193 / 201
页数:9
相关论文
共 50 条
  • [31] Sequence-Based Pronunciation Variation Modeling for Spontaneous ASR Using a Noisy Channel Approach
    Hofmann, Hansjoerg
    Sakti, Sakriani
    Hori, Chiori
    Kashioka, Hideki
    Nakamura, Satoshi
    Minker, Wolfgang
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (08) : 2084 - 2093
  • [32] Sequence-based genomics
    Andrew JG Simpson
    Genome Biology, 3 (9)
  • [33] A sequence-based multiple kernel model for identifying DNA-binding proteins
    Yuqing Qian
    Limin Jiang
    Yijie Ding
    Jijun Tang
    Fei Guo
    BMC Bioinformatics, 22
  • [34] A sequence-based multiple kernel model for identifying DNA-binding proteins
    Qian, Yuqing
    Jiang, Limin
    Ding, Yijie
    Tang, Jijun
    Guo, Fei
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 3)
  • [35] Generalized Sequence-Based and Reverse Sequence-Based Models for Broadcasting Hot Videos
    Yu, Hsiang-Fu
    Ho, Pin-Han
    Yang, Hung-Chang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 11 (01) : 152 - 165
  • [36] SeqSVM: A Sequence-Based Support Vector Machine Method for Identifying Antioxidant Proteins
    Xu, Lei
    Liang, Guangmin
    Shi, Shuhua
    Liao, Changrui
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (06):
  • [37] Using genomic databases for sequence-based biological discovery
    Baxevanis, AD
    MOLECULAR MEDICINE, 2003, 9 (9-12) : 185 - 192
  • [38] Using Genomic Databases for Sequence-Based Biological Discovery
    Andreas D Baxevanis
    Molecular Medicine, 2003, 9 : 185 - 192
  • [39] Multilocus phylogeography and phylogenetics using sequence-based markers
    Brito, Patricia H.
    Edwards, Scott V.
    GENETICA, 2009, 135 (03) : 439 - 455
  • [40] DNA Sequence-Based Approach for Classifying the Mold Status of Buildings
    Hegarty, Bridget
    Pan, Annabelle
    Haverinen-Shaughnessy, Ulla
    Shaughnessy, Richard
    Peccia, Jordan
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2020, 54 (24) : 15968 - 15975