A new framework for identifying cis-regulatory motifs in prokaryotes

被引:29
|
作者
Li, Guojun [1 ,2 ,3 ]
Liu, Bingqiang [1 ,2 ,3 ]
Ma, Qin [1 ,2 ,3 ]
Xu, Ying [1 ,2 ,4 ]
机构
[1] Univ Georgia, Dept Biochem & Mol Biol, Computat Syst Biol Lab, Athens, GA 30602 USA
[2] Univ Georgia, Inst Bioinformat, Athens, GA 30602 USA
[3] Shandong Univ, Sch Math, Jinan 250100, Peoples R China
[4] Jilin Univ, Coll Comp Sci & Technol, Changchun 130023, Jilin, Peoples R China
基金
美国国家科学基金会;
关键词
FACTOR-BINDING SITES; GAMMA-PROTEOBACTERIAL GENOMES; ESCHERICHIA-COLI; TRACTOR-DB; DNA; TRANSCRIPTION; DISCOVERY; SEQUENCES; DATABASE; PROTEIN;
D O I
10.1093/nar/gkq948
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a new algorithm, BOBRO, for prediction of cis-regulatory motifs in a given set of promoter sequences. The algorithm substantially improves the prediction accuracy and extends the scope of applicability of the existing programs based on two key new ideas: (i) we developed a highly effective method for reliably assessing the possibility for each position in a given promoter to be the (approximate) start of a conserved sequence motif; and (ii) we developed a highly reliable way for recognition of actual motifs from the accidental ones based on the concept of 'motif closure'. These two key ideas are embedded in a classical framework for motif finding through finding cliques in a graph but have made this framework substantially more sensitive as well as more selective in motif finding in a very noisy background. A comparative analysis shows that the performance coefficient was improved from 29% to 41% by our program compared to the best among other six state-of-the-art prediction tools on a large-scale data sets of promoters from one genome, and also consistently improved by substantial margins on another kind of large-scale data sets of orthologous promoters across multiple genomes. The power of BOBRO in dealing with noisy data was further demonstrated through identification of the motifs of the global transcriptional regulators by running it over 2390 promoter sequences of Escherichia coli K12.
引用
收藏
页码:E42 / U54
页数:9
相关论文
共 50 条
  • [21] Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs
    Bartek Wilczynski
    Norbert Dojer
    Mateusz Patelak
    Jerzy Tiuryn
    BMC Bioinformatics, 10
  • [22] Systematic functional characterization of cis-regulatory motifs in human core promoters
    Sinha, Saurabh
    Adler, Adam S.
    Field, Yair
    Chang, Howard Y.
    Segal, Eran
    GENOME RESEARCH, 2008, 18 (03) : 477 - 488
  • [23] An Integrated Approach to Identifying Cis-Regulatory Modules in the Human Genome
    Won, Kyoung-Jae
    Agarwal, Saurabh
    Shen, Li
    Shoemaker, Robert
    Ren, Bing
    Wang, Wei
    PLOS ONE, 2009, 4 (05):
  • [24] Identifying the conserved network of cis-regulatory sites of a eukaryotic genome
    Wang, T
    Stormo, GD
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (48) : 17400 - 17405
  • [25] Identifying cis-regulatory enhancers associated with cichlid craniofacial evolution
    Powder, K. E.
    Albertson, R. C.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2017, 57 : E379 - E379
  • [26] A computational pipeline for high-throughput discovery of cis-regulatory noncoding RNA in prokaryotes
    Yao, Zizhen
    Barrick, Jeffrey
    Weinberg, Zasha
    Neph, Shane
    Breaker, Ronald
    Tompa, Martin
    Ruzzo, Walter L.
    PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (07) : 1212 - 1223
  • [27] Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes
    Zhang, Shaoqiang
    Xu, Minli
    Li, Shan
    Su, Zhengchang
    NUCLEIC ACIDS RESEARCH, 2009, 37 (10)
  • [28] MotifCombinator: a web-based tool to search for combinations of cis-regulatory motifs
    Mamoru Kato
    Tatsuhiko Tsunoda
    BMC Bioinformatics, 8
  • [29] Condition-specific coregulation with cis-regulatory motifs and modules in the mouse genome
    Choi, D
    Fang, YA
    Mathers, WD
    GENOMICS, 2006, 87 (04) : 500 - 508
  • [30] cis-regulatory modules
    Weitzman J.B.
    Genome Biology, 3 (1)