A new framework for identifying cis-regulatory motifs in prokaryotes

被引:29
|
作者
Li, Guojun [1 ,2 ,3 ]
Liu, Bingqiang [1 ,2 ,3 ]
Ma, Qin [1 ,2 ,3 ]
Xu, Ying [1 ,2 ,4 ]
机构
[1] Univ Georgia, Dept Biochem & Mol Biol, Computat Syst Biol Lab, Athens, GA 30602 USA
[2] Univ Georgia, Inst Bioinformat, Athens, GA 30602 USA
[3] Shandong Univ, Sch Math, Jinan 250100, Peoples R China
[4] Jilin Univ, Coll Comp Sci & Technol, Changchun 130023, Jilin, Peoples R China
基金
美国国家科学基金会;
关键词
FACTOR-BINDING SITES; GAMMA-PROTEOBACTERIAL GENOMES; ESCHERICHIA-COLI; TRACTOR-DB; DNA; TRANSCRIPTION; DISCOVERY; SEQUENCES; DATABASE; PROTEIN;
D O I
10.1093/nar/gkq948
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a new algorithm, BOBRO, for prediction of cis-regulatory motifs in a given set of promoter sequences. The algorithm substantially improves the prediction accuracy and extends the scope of applicability of the existing programs based on two key new ideas: (i) we developed a highly effective method for reliably assessing the possibility for each position in a given promoter to be the (approximate) start of a conserved sequence motif; and (ii) we developed a highly reliable way for recognition of actual motifs from the accidental ones based on the concept of 'motif closure'. These two key ideas are embedded in a classical framework for motif finding through finding cliques in a graph but have made this framework substantially more sensitive as well as more selective in motif finding in a very noisy background. A comparative analysis shows that the performance coefficient was improved from 29% to 41% by our program compared to the best among other six state-of-the-art prediction tools on a large-scale data sets of promoters from one genome, and also consistently improved by substantial margins on another kind of large-scale data sets of orthologous promoters across multiple genomes. The power of BOBRO in dealing with noisy data was further demonstrated through identification of the motifs of the global transcriptional regulators by running it over 2390 promoter sequences of Escherichia coli K12.
引用
收藏
页码:E42 / U54
页数:9
相关论文
共 50 条
  • [31] Analysis of Cis-Regulatory Motifs in Cassette Exons by Incorporating Exon Skipping Rates
    Zhao, Sihui
    Kim, Jihye
    Heber, Steffen
    BIOINFORMATICS RESEARCH AND APPLICATIONS: 5TH INTERNATIONAL SYMPOSIUM, ISBRA 2009, 2009, 5542 : 272 - 283
  • [32] An integrated toolkit for accurate prediction and analysis of cis-regulatory motifs at a genome scale
    Ma, Qin
    Liu, Bingqiang
    Zhou, Chuan
    Yin, Yanbin
    Li, Guojun
    Xu, Ying
    BIOINFORMATICS, 2013, 29 (18) : 2261 - 2268
  • [33] Mining of cis-Regulatory Motifs Associated with Tissue-Specific Alternative Splicing
    Kim, Jihye
    Zhao, Sihui
    Howard, Brian E.
    Heber, Steffen
    BIOINFORMATICS RESEARCH AND APPLICATIONS: 5TH INTERNATIONAL SYMPOSIUM, ISBRA 2009, 2009, 5542 : 260 - 271
  • [34] MotifCombinator: a web-based tool to search for combinations of cis-regulatory motifs
    Kato, Mamoru
    Tsunoda, Tatsuhiko
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [35] Computational discovery of cis-regulatory modules in Drosophila without prior knowledge of motifs
    Ivan, Andra
    Halfon, Marc S.
    Sinha, Saurabh
    GENOME BIOLOGY, 2008, 9 (01)
  • [36] Bioinformatics Approaches to Gain Insights into cis-Regulatory Motifs Involved in mRNA Localization
    Bouvrette, Louis Philip Benoit
    Blanchette, Mathieu
    Lecuyer, Eric
    BIOLOGY OF MRNA: STRUCTURE AND FUNCTION, 2019, 1203 : 165 - 194
  • [37] Computational discovery of cis-regulatory modules in Drosophila without prior knowledge of motifs
    Andra Ivan
    Marc S Halfon
    Saurabh Sinha
    Genome Biology, 9
  • [38] ModuleDigger: an itemset mining framework for the detection of cis-regulatory modules
    Hong Sun
    Tijl De Bie
    Valerie Storms
    Qiang Fu
    Thomas Dhollander
    Karen Lemmens
    Annemieke Verstuyf
    Bart De Moor
    Kathleen Marchal
    BMC Bioinformatics, 10
  • [39] ModuleDigger: an itemset mining framework for the detection of cis-regulatory modules
    Sun, Hong
    De Bie, Tijl
    Storms, Valerie
    Fu, Qiang
    Dhollander, Thomas
    Lemmens, Karen
    Verstuyf, Annemieke
    De Moor, Bart
    Marchal, Kathleen
    BMC BIOINFORMATICS, 2009, 10
  • [40] PAZAR: a framework for collection and dissemination of cis-regulatory sequence annotation
    Elodie Portales-Casamar
    Stefan Kirov
    Jonathan Lim
    Stuart Lithwick
    Magdalena I Swanson
    Amy Ticoll
    Jay Snoddy
    Wyeth W Wasserman
    Genome Biology, 8