Patterns of protein-fold usage in fight microbial genomes: A comprehensive structural census

被引:0
|
作者
Gerstein, M [1 ]
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
关键词
structure databank; superfold; protein structure;
D O I
10.1002/(SICI)1097-0134(19981201)33:4<518::AID-PROT5>3.0.CO;2-J
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Eight microbial genomes are compared in terms of protein structure. Specifically, yeast, H. influenzae, M. genitalium, M. jannaschii, Synechocystis, M. pneumoniae, II. pylori, and E. coli are compared in terms of patterns of fold usage-whether a given fold occurs in a particular organism. Of the similar to 340 soluble protein folds currently in the structure databank (PDB), 240 occur in at least one of the eight genomes, and 30 are shared amongst all eight. The shared folds are depleted in all-helical structure and enriched in mixed helix-sheet structure compared to the folds in the PDB. The top-10 most common of the shared 30 are enriched in superfolds, uniting many nonhomologous sequence families, and are especially similar in overall architecture-eight having helices packed onto a central sheet. They are also very different from the common folds in the PBD, highlighting databank biases, Folds can be ranked in terms of expression as well as genome duplication. In yeast the top-10 most highly expressed folds are considerably different from the most highly duplicated folds, A tree can be constructed grouping genomes in terms of their shared folds, This has a remarkably similar topology to more conventional classifications, based on very different measures of relatedness. Finally, folds of membrane proteins can be analyzed through transmembrane-helix (TM) prediction. All the genomes appear to have similar usage patterns for these folds, with the occurrence of a particular fold falling off rapidly with increasing numbers of TM-elements, according to a "Zipf-like" law. This implies there are no marked preferences for proteins with particular numbers of TM-helices (e.g. 7-TM) in microbial genomes. Further information pertinent to this analysis is available at http://bioinfo.mbb.yale.edu/genome. (C) 1998 Wiley-liss, Inc.
引用
收藏
页码:518 / 534
页数:17
相关论文
共 12 条
  • [1] Fold usage on genomes and protein fold evolution
    Abeln, S
    Deane, CM
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 60 (04) : 690 - 700
  • [3] Structural proteomics of minimal organisms: Conservation of protein fold usage and evolutionary implications
    Chandonia, John-Marc
    Kim, Sung-Hou
    BMC STRUCTURAL BIOLOGY, 2006, 6
  • [4] Decoding Codon Usage Patterns in High-Risk Human Papillomavirus Genomes: A Comprehensive Analysis
    Ren, Jiahuan
    Li, Qijia
    Shen, Weifeng
    Tan, Xiaochun
    CURRENT MICROBIOLOGY, 2025, 82 (04)
  • [5] Comprehensive analysis of distinctive polyketide and nonribosomal peptide structural motifs encoded in microbial genomes
    Minowa, Yohsuke
    Araki, Michihiro
    Kanehisa, Minoru
    JOURNAL OF MOLECULAR BIOLOGY, 2007, 368 (05) : 1500 - 1517
  • [6] Seven GC-rich microbial genomes adopt similar codon usage patterns regardless of their phylogenetic lineages
    Chen, LL
    Zhang, CT
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 306 (01) : 310 - 317
  • [7] Mitochondrial genomes of Macropsini (Hemiptera: Cicadellidae: Eurymelinae): Structural features, codon usage patterns, and phylogenetic implications
    Guo, Meishu
    Wang, Jiajia
    Li, Hu
    Yu, Kai
    Yang, Yanqiong
    Li, Min
    Smagghe, Guy
    Dai, Renhuai
    ECOLOGY AND EVOLUTION, 2024, 14 (09):
  • [8] A comprehensive analysis of Usutu virus (USUV) genomes revealed lineage-specific codon usage patterns and host adaptations
    Zhou, Jianglin
    Xing, Yaling
    Zhou, Zhe
    Wang, Shengqi
    FRONTIERS IN MICROBIOLOGY, 2023, 13
  • [9] Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space
    Marsden, RL
    Lee, D
    Maibaum, M
    Yeats, C
    Orengo, CA
    NUCLEIC ACIDS RESEARCH, 2006, 34 (03) : 1066 - 1080
  • [10] Comprehensive Analysis of Codon Usage Patterns in Chinese Porcine Circoviruses Based on Their Major Protein-Coding Sequences
    Feng, Hua
    Segales, Joaquim
    Wang, Fangyu
    Jin, Qianyue
    Wang, Aiping
    Zhang, Gaiping
    Franzo, Giovanni
    VIRUSES-BASEL, 2022, 14 (01):