coMOTIF: a mixture framework for identifying transcription factor and a coregulator motif in ChIP-seq Data

被引:6
|
作者
Xu, Mengyuan [1 ]
Weinberg, Clarice R. [1 ]
Umbach, David M. [1 ]
Li, Leping [1 ]
机构
[1] Natl Inst Environm Hlth Sci, Biostat Branch, NIH, Res Triangle Pk, NC 27709 USA
基金
美国国家卫生研究院;
关键词
EM ALGORITHM; EXPECTATION MAXIMIZATION; BAYESIAN MODELS; GIBBS; DISCOVERY; SEQUENCE; ELEMENTS; SITES; IDENTIFICATION; INFORMATION;
D O I
10.1093/bioinformatics/btr397
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: ChIP-seq data are enriched in binding sites for the protein immunoprecipitated. Some sequences may also contain binding sites for a coregulator. Biologists are interested in knowing which coregulatory factor motifs may be present in the sequences bound by the protein ChIP'ed. Results: We present a finite mixture framework with an expectation-maximization algorithm that considers two motifs jointly and simultaneously determines which sequences contain both motifs, either one or neither of them. Tested on 10 simulated ChIP-seq datasets, our method performed better than repeated application of MEME in predicting sequences containing both motifs. When applied to a mouse liver Foxa2 ChIP-seq dataset involving similar to 12 000 400-bp sequences, coMOTIF identified co-occurrence of Foxa2 with Hnf4a, Cebpa, E-box, Ap1/Maf or Sp1 motifs in similar to 6-33% of these sequences. These motifs are either known as liver-specific transcription factors or have an important role in liver function.
引用
收藏
页码:2625 / 2632
页数:8
相关论文
共 50 条
  • [1] DREME: motif discovery in transcription factor ChIP-seq data
    Bailey, Timothy L.
    [J]. BIOINFORMATICS, 2011, 27 (12) : 1653 - 1659
  • [2] Identifying differential transcription factor binding in ChIP-seq
    Wu, Dai-Ying
    Bittencourt, Danielle
    Stallcup, Michael R.
    Siegmund, Kimberly D.
    [J]. FRONTIERS IN GENETICS, 2015, 6
  • [3] WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
    Zhang, Hongbo
    Zhu, Lin
    Huang, De-Shuang
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [4] WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
    Hongbo Zhang
    Lin Zhu
    De-Shuang Huang
    [J]. Scientific Reports, 7
  • [5] Extracting transcription factor targets from ChIP-Seq data
    Tuteja, Geetu
    White, Peter
    Schug, Jonathan
    Kaestner, Klaus H.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 (17) : e113 - e113
  • [6] Inferring transcription factor complexes from ChIP-seq data
    Whitington, Tom
    Frith, Martin C.
    Johnson, James
    Bailey, Timothy L.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 (15) : e98
  • [7] Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment
    Hunt, Rebecca Worsley
    Mathelier, Anthony
    del Peso, Luis
    Wasserman, Wyeth W.
    [J]. BMC GENOMICS, 2014, 15
  • [8] Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment
    Rebecca Worsley Hunt
    Anthony Mathelier
    Luis del Peso
    Wyeth W Wasserman
    [J]. BMC Genomics, 15
  • [9] A Statistical Framework for the Analysis of ChIP-Seq Data
    Kuan, Pei Fen
    Chung, Dongjun
    Pan, Guangjin
    Thomson, James A.
    Stewart, Ron
    Keles, Suenduez
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (495) : 891 - 903
  • [10] Integrative analysis of histone ChIP-seq and transcription data using Bayesian mixture models
    Klein, Hans-Ulrich
    Schaefer, Martin
    Porse, Bo T.
    Hasemann, Marie S.
    Ickstadt, Katja
    Dugas, Martin
    [J]. BIOINFORMATICS, 2014, 30 (08) : 1154 - 1162