Imogene: identification of motifs and cis-regulatory modules underlying gene co-regulation

被引:10
|
作者
Rouault, Herve [1 ,2 ]
Santolini, Marc [3 ]
Schweisguth, Francois [1 ,2 ]
Hakim, Vincent [3 ]
机构
[1] Inst Pasteur, Dev & Stem Cell Biol Dept, F-75015 Paris, France
[2] CNRS, URA2578, F-75015 Paris, France
[3] Univ Paris Diderot, Univ Paris 06, Ecole Normale Super, Lab Phys Stat,CNRS, Paris, France
关键词
TRANSCRIPTIONAL ENHANCERS; DROSOPHILA GENOME; DNA-SEQUENCES; TARGET GENES; DISCOVERY; PREDICTION; DATABASE; BIOINFORMATICS; FEATURES; REVEALS;
D O I
10.1093/nar/gku209
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Cis-regulatory modules (CRMs) and motifs play a central role in tissue and condition-specific gene expression. Here we present Imogene, an ensemble of statistical tools that we have developed to facilitate their identification and implemented in a publicly available software. Starting from a small training set of mammalian or fly CRMs that drive similar gene expression profiles, Imogene determines de novo cis-regulatory motifs that underlie this co-expression. It can then predict on a genome-wide scale other CRMs with a regulatory potential similar to the training set. Imogene bypasses the need of large datasets for statistical analyses by making central use of the information provided by the sequenced genomes of multiple species, based on the developed statistical tools and explicit models for transcription factor binding site evolution. We test Imogene on characterized tissue-specific mouse developmental CRMs. Its ability to identify CRMs with the same specificity based on its de novo created motifs is comparable to that of previously evaluated 'motif-blind' methods. We further show, both in flies and in mammals, that Imogene de novo generated motifs are sufficient to discriminate CRMs related to different developmental programs. Notably, purely relying on sequence data, Imogene performs as well in this discrimination task as a previously reported learning algorithm based on Chromatin Immunoprecipitation (ChIP) data for multiple transcription factors at multiple developmental stages.
引用
收藏
页码:6128 / 6145
页数:18
相关论文
共 50 条
  • [1] Genome-wide identification of cis-regulatory motifs and modules underlying gene coregulation using statistics and phylogeny
    Rouault, Herve
    Mazouni, Khalil
    Couturier, Lydie
    Hakim, Vincent
    Schweisguth, Francois
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (33) : 14615 - 14620
  • [2] cis-regulatory modules
    Weitzman J.B.
    [J]. Genome Biology, 3 (1):
  • [3] Motifs and cis-regulatory modules mediating the expression of genes co-expressed in presynaptic neurons
    Liu, Rui
    Hannenhalli, Sridhar
    Bucan, Maja
    [J]. GENOME BIOLOGY, 2009, 10 (07):
  • [4] Motifs and cis-regulatory modules mediating the expression of genes co-expressed in presynaptic neurons
    Rui Liu
    Sridhar Hannenhalli
    Maja Bucan
    [J]. Genome Biology, 10
  • [5] Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs
    Wilczynski, Bartek
    Dojer, Norbert
    Patelak, Mateusz
    Tiuryn, Jerzy
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [6] Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs
    Bartek Wilczynski
    Norbert Dojer
    Mateusz Patelak
    Jerzy Tiuryn
    [J]. BMC Bioinformatics, 10
  • [7] Condition-specific coregulation with cis-regulatory motifs and modules in the mouse genome
    Choi, D
    Fang, YA
    Mathers, WD
    [J]. GENOMICS, 2006, 87 (04) : 500 - 508
  • [8] Computational discovery of cis-regulatory modules in Drosophila without prior knowledge of motifs
    Ivan, Andra
    Halfon, Marc S.
    Sinha, Saurabh
    [J]. GENOME BIOLOGY, 2008, 9 (01)
  • [9] Computational discovery of cis-regulatory modules in Drosophila without prior knowledge of motifs
    Andra Ivan
    Marc S Halfon
    Saurabh Sinha
    [J]. Genome Biology, 9
  • [10] Functional Evolution of cis-Regulatory Modules at a Homeotic Gene in Drosophila
    Ho, Margaret C. W.
    Johnsen, Holly
    Goetz, Sara E.
    Schiller, Benjamin J.
    Bae, Esther
    Tran, Diana A.
    Shur, Andrey S.
    Allen, John M.
    Rau, Christoph
    Bender, Welcome
    Fisher, William W.
    Celniker, Susan E.
    Drewell, Robert A.
    [J]. PLOS GENETICS, 2009, 5 (11):