Operons and the effect of genome redundancy in deciphering functional relationships using phylogenetic profiles

被引:25
|
作者
Moreno-Hagelsieb, Gabriel [1 ]
Janga, Sarath Chandra [2 ]
机构
[1] Wilfrid Laurier Univ, Dept Biol, Waterloo, ON N2L 3C5, Canada
[2] Univ Nacl Autonoma Mexico, CCG, Program Computat Genom, Cuernavaca 62100, Morelos, Mexico
关键词
phylogenetic profiles; phylogenomics; mutual information; operons; gold standards;
D O I
10.1002/prot.21564
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Phylogenetic profiles (PPs) are one of the most promising methods for predicting functional relationships by genomic context. The idea behind PPs is that if the products of two genes have a functional interdependence, the genes should both be either present or absent across genomes. One of the main problems with PPs is that evolutionarily close organisms tend to share a higher number of genes resulting in the overscoring of PP-relatedness. The proper measure of the overscoring effect of evolutionary redundancy requires examples of both functionally related genes (positive gold standards) and functionally unrelated genes (negative gold standards). Since experimentally verified functional interactions are only available for a few model organisms, there is a need for an alternative to gold standards. The presence of operons (polycistronic transcription units formed of functionally related genes) in prokaryotic genomes offers such an alternative. Genes in operons are located next to each other in the same DNA strand, and thus their presence should result in a higher proportion of predicted functional interactions among adjacent genes in the same strand than among adjacent genes in opposite strands. Under the preceding principle, we present a confidence value (CV) designed for evaluating predictions of functional interactions obtained using PPs. We first show that the CV corresponds to a positive predictive value calculated using experimentally known operons and further validate operon predictions based on this CV in other organisms using available microarray data. Then, we use a fixed CV of 0.90 as a reference to compare PP predictions obtained using different nonredundant genome datasets filtered at varying thresholds of genomic similarity. Our results demonstrate that nonredundant genome datasets increase the number of high-quality predictions by an average of 20%. Confidence values as those presented here should help compare other strategies and scoring systems to use phylogenetic profiles and other genomic context methods for predicting functional interactions.
引用
收藏
页码:344 / 352
页数:9
相关论文
共 33 条
  • [1] Identification of functional links between genes using phylogenetic profiles
    Wu, J
    Kasif, S
    DeLisi, C
    [J]. BIOINFORMATICS, 2003, 19 (12) : 1524 - 1530
  • [2] Assigning functional linkages to proteins using phylogenetic profiles and continuous phenotypes
    Gonzalez, Orland
    Zimmer, Ralf
    [J]. BIOINFORMATICS, 2008, 24 (10) : 1257 - 1263
  • [3] Locally defined protein phylogenetic profiles reveal previously missed protein interactions and functional relationships
    Kim, Y
    Subramaniam, S
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (04) : 1115 - 1124
  • [4] Resolving robust phylogenetic relationships of core Brassicaceae using genome skimming data
    Liu, Liang-Min
    Du, Xin-Yu
    Guo, Cen
    Li, De-Zhu
    [J]. JOURNAL OF SYSTEMATICS AND EVOLUTION, 2021, 59 (03) : 442 - 453
  • [5] Genome mining, phylogenetic, and functional analysis of arsenic (As) resistance operons in Bacillus strains, isolated from As-rich hot spring microbial mats
    Flores A.
    Valencia-Marín M.F.
    Chávez-Avila S.
    Ramírez-Díaz M.I.
    de los Santos-Villalobos S.
    Meza-Carmen V.
    Orozco-Mosqueda M.D.C.
    Santoyo G.
    [J]. Microbiological Research, 2022, 264
  • [6] Examining phylogenetic relationships of Erwinia and Pantoea species using whole genome sequence data
    Zhang, Yucheng
    Qiu, Sai
    [J]. ANTONIE VAN LEEUWENHOEK INTERNATIONAL JOURNAL OF GENERAL AND MOLECULAR MICROBIOLOGY, 2015, 108 (05): : 1037 - 1046
  • [7] Examining phylogenetic relationships of Erwinia and Pantoea species using whole genome sequence data
    Yucheng Zhang
    Sai Qiu
    [J]. Antonie van Leeuwenhoek, 2015, 108 : 1037 - 1046
  • [8] Single-pass classification of all noncoding sequences in a bacterial genome using phylogenetic profiles
    Marchais, Antonin
    Naville, Magali
    Bohn, Chantal
    Bouloc, Philippe
    Gautheret, Daniel
    [J]. GENOME RESEARCH, 2009, 19 (06) : 1084 - 1092
  • [9] Joint learning of logic relationships for studying protein function using phylogenetic profiles and the Rosetta Stone method
    Zhang, Xin
    Kim, Seungchan
    Wang, Tie
    Baral, Chitta
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) : 2427 - 2435
  • [10] Phylogenetic relationships of some economically important cereal plants based on genome characterization using molecular markers
    El Rabey, Haddad A.
    Alshubaily, Fawzia
    Al-Otaibi, Kholoud M.
    [J]. CARYOLOGIA, 2015, 68 (03) : 225 - 232