Identifying differential transcription factor binding in ChIP-seq

被引:23
|
作者
Wu, Dai-Ying [1 ]
Bittencourt, Danielle [1 ]
Stallcup, Michael R. [1 ]
Siegmund, Kimberly D. [2 ]
机构
[1] Univ So Calif, Dept Biochem & Mol Biol, Kenneth Norris Jr Comprehens Canc Ctr, Los Angeles, CA 90089 USA
[2] Univ So Calif, Dept Prevent Med, Kenneth Norris Jr Comprehens Canc Ctr, Los Angeles, CA 90089 USA
基金
美国国家卫生研究院;
关键词
EXPRESSION ANALYSIS; GENOME; BIOCONDUCTOR; ELEMENTS; PACKAGE; SITES; MODEL;
D O I
10.3389/fgene.2015.00169
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
ChIP seq is a widely used assay to measure genome-wide protein binding. The decrease in costs associated with sequencing has led to a rise in the number of studies that investigate protein binding across treatment conditions or cell lines. In addition to the identification of binding sites, new studies evaluate the variation in protein binding between conditions. A number of approaches to study differential transcription factor binding have recently been developed. Several of these methods build upon established methods from RNA-seq to quantify differences in read counts. We compare how these new approaches perform on different data sets from the ENCODE project to illustrate the impact of data processing pipelines under different study designs. The performance of normalization methods for differential ChIP-seq depends strongly on the variation in total amount of protein bound between conditions, with total read count outperforming effective library size, or variants thereof, when a large variation in binding was studied. Use of input subtraction to correct for non-specific binding showed a relatively modest impact on the number of differential peaks found and the fold change accuracy to biological validation, however a larger impact might be expected for samples with more extreme copy number variations between them. Still, it did identify a small subset of novel differential regions while excluding some differential peaks in regions with high background signal. These results highlight proper scaling for between-sample data normalization as critical for differential transcription factor binding analysis and suggest bioinformaticians need to know about the variation in level of total protein binding between conditions to select the best analysis method. At the same time, validation using fold-change estimates from qRT-PCR suggests there is still room for further method improvement.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Extracting transcription factor targets from ChIP-Seq data
    Tuteja, Geetu
    White, Peter
    Schug, Jonathan
    Kaestner, Klaus H.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 (17) : e113 - e113
  • [22] DREME: motif discovery in transcription factor ChIP-seq data
    Bailey, Timothy L.
    [J]. BIOINFORMATICS, 2011, 27 (12) : 1653 - 1659
  • [23] Transcription factor binding predictions using TRAP for the analysis of ChIP-seq data and regulatory SNPs
    Thomas-Chollier, Morgane
    Hufton, Andrew
    Heinig, Matthias
    O'Keeffe, Sean
    El Masri, Nassim
    Roider, Helge G.
    Manke, Thomas
    Vingron, Martin
    [J]. NATURE PROTOCOLS, 2011, 6 (12) : 1860 - 1869
  • [24] Genome-Wide Profiling of Transcription Factor Binding and Epigenetic Marks in Adipocytes by ChIP-seq
    Nielsen, Ronni
    Mandrup, Susanne
    [J]. METHODS OF ADIPOSE TISSUE BIOLOGY, PT A, 2014, 537 : 261 - 279
  • [25] Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data
    Valouev, Anton
    Johnson, David S.
    Sundquist, Andreas
    Medina, Catherine
    Anton, Elizabeth
    Batzoglou, Serafim
    Myers, Richard M.
    Sidow, Arend
    [J]. NATURE METHODS, 2008, 5 (09) : 829 - 834
  • [26] Transcription factor binding predictions using TRAP for the analysis of ChIP-seq data and regulatory SNPs
    Morgane Thomas-Chollier
    Andrew Hufton
    Matthias Heinig
    Sean O'Keeffe
    Nassim El Masri
    Helge G Roider
    Thomas Manke
    Martin Vingron
    [J]. Nature Protocols, 2011, 6 : 1860 - 1869
  • [27] Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data
    Valouev A.
    Johnson D.S.
    Sundquist A.
    Medina C.
    Anton E.
    Batzoglou S.
    Myers R.M.
    Sidow A.
    [J]. Nature Methods, 2008, 5 (9) : 829 - 834
  • [28] TFEA.ChIP: a tool kit for transcription factor binding site enrichment analysis capitalizing on ChIP-seq datasets
    Puente-Santamaria, Laura
    Wasserman, Wyeth W.
    del Peso, Luis
    [J]. BIOINFORMATICS, 2019, 35 (24) : 5339 - 5340
  • [29] Discovering unknown human and mouse transcription factor binding sites and their characteristics from ChIP-seq data
    Yu, Chun-Ping
    Kuo, Chen-Hao
    Nelson, Chase W.
    Chen, Chi-An
    Soh, Zhi Thong
    Lin, Jinn-Jy
    Hsiao, Ru-Xiu
    Chang, Chih-Yao
    Li, Wen-Hsiung
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (20)
  • [30] Studying the evolution of transcription factor binding events using multi-species ChIP-Seq data
    Zheng, Wei
    Zhao, Hongyu
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2013, 12 (01) : 1 - 15