Identifying differential transcription factor binding in ChIP-seq

被引:30
|
作者
Wu, Dai-Ying [1 ]
Bittencourt, Danielle [1 ]
Stallcup, Michael R. [1 ]
Siegmund, Kimberly D. [2 ]
机构
[1] Univ So Calif, Dept Biochem & Mol Biol, Kenneth Norris Jr Comprehens Canc Ctr, Los Angeles, CA 90089 USA
[2] Univ So Calif, Dept Prevent Med, Kenneth Norris Jr Comprehens Canc Ctr, Los Angeles, CA 90089 USA
基金
美国国家卫生研究院;
关键词
EXPRESSION ANALYSIS; GENOME; BIOCONDUCTOR; ELEMENTS; PACKAGE; SITES; MODEL;
D O I
10.3389/fgene.2015.00169
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
ChIP seq is a widely used assay to measure genome-wide protein binding. The decrease in costs associated with sequencing has led to a rise in the number of studies that investigate protein binding across treatment conditions or cell lines. In addition to the identification of binding sites, new studies evaluate the variation in protein binding between conditions. A number of approaches to study differential transcription factor binding have recently been developed. Several of these methods build upon established methods from RNA-seq to quantify differences in read counts. We compare how these new approaches perform on different data sets from the ENCODE project to illustrate the impact of data processing pipelines under different study designs. The performance of normalization methods for differential ChIP-seq depends strongly on the variation in total amount of protein bound between conditions, with total read count outperforming effective library size, or variants thereof, when a large variation in binding was studied. Use of input subtraction to correct for non-specific binding showed a relatively modest impact on the number of differential peaks found and the fold change accuracy to biological validation, however a larger impact might be expected for samples with more extreme copy number variations between them. Still, it did identify a small subset of novel differential regions while excluding some differential peaks in regions with high background signal. These results highlight proper scaling for between-sample data normalization as critical for differential transcription factor binding analysis and suggest bioinformaticians need to know about the variation in level of total protein binding between conditions to select the best analysis method. At the same time, validation using fold-change estimates from qRT-PCR suggests there is still room for further method improvement.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Optimized ChIP-seq method facilitates transcription factor profiling in human tumors
    Singh, Abhishek A.
    Schuurman, Karianne
    Nevedomskaya, Ekaterina
    Stelloo, Suzan
    Linder, Simon
    Droog, Marjolein
    Kim, Yongsoo
    Sanders, Joyce
    van der Poel, Henk
    Bergman, Andries M.
    Wessels, Lodewyk F. A.
    Zwart, Wilbert
    LIFE SCIENCE ALLIANCE, 2019, 2 (01)
  • [42] Learning Useful Representations of DNA Sequences From ChIP-Seq Datasets for Exploring Transcription Factor Binding Specificities
    Quan, Lijun
    Sun, Xiaoyu
    Wu, Jian
    Mei, Jie
    Huang, Liqun
    He, Ruji
    Nie, Liangpeng
    Chen, Yu
    Lyu, Qiang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 998 - 1008
  • [43] dPeak: High Resolution Identification of Transcription Factor Binding Sites from PET and SET ChIP-Seq Data
    Chung, Dongjun
    Park, Dan
    Myers, Kevin
    Grass, Jeffrey
    Kiley, Patricia
    Landick, Robert
    Keles, Suenduez
    PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (10)
  • [44] Genome-wide Statistical Analysis of Multiple Transcription Factor Binding Sites Obtained by ChIP-seq Technologies
    Orlov, Yuriy L.
    Huss, Mikael
    Joseph, Roy
    Xu, Han
    Vega, Vinsensius B.
    Lee, Yew K.
    Goh, Wee S.
    Thomsen, Jane S.
    Cheung, Edwin
    Clarke, Neil D.
    Ng, Huck H.
    COMPBIO09: BREAKING FRONTIERS OF COMPUTATIONAL BIOLOGY, 2009, : 11 - 18
  • [45] Diverse transcription factor binding features revealed by genome-wide ChIP-seq in C. elegans
    Niu, Wei
    Lu, Zhi John
    Zhong, Mei
    Sarov, Mihail
    Murray, John I.
    Brdlik, Cathleen M.
    Janette, Judith
    Chen, Chao
    Alves, Pedro
    Preston, Elicia
    Slightham, Cindie
    Jiang, Lixia
    Hyman, Anthony A.
    Kim, Stuart K.
    Waterston, Robert H.
    Gerstein, Mark
    Snyder, Michael
    Reinke, Valerie
    GENOME RESEARCH, 2011, 21 (02) : 245 - 254
  • [46] NUCLEOSOME DISTRIBUTION AROUND TRANSCRIPTION FACTOR BINDING SITES ENRICHED FROM GENOME-WIDE CHIP-SEQ
    Wang Wei
    Lu Zuhong
    IFPT'6: PROGRESS ON POST-GENOME TECHNOLOGIES, PROCEEDINGS, 2009, : 392 - 395
  • [47] Computer and Statistical Analysis of Transcription Factor Binding and Chromatin Modifications by ChIP-seq data in Embryonic Stem Cell
    Orlov, Yuriy
    Xu, Han
    Afonnikov, Dmitri
    Lim, Bing
    Heng, Jian-Chien
    Yuan, Ping
    Chen, Ming
    Yan, Junli
    Clarke, Neil
    Orlova, Nina
    Huss, Mikael
    Gunbin, Konstantin
    Podkolodnyy, Nikolay
    Ng, Huck-Hui
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2012, 9 (02):
  • [48] Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment
    Rebecca Worsley Hunt
    Anthony Mathelier
    Luis del Peso
    Wyeth W Wasserman
    BMC Genomics, 15
  • [49] Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment
    Hunt, Rebecca Worsley
    Mathelier, Anthony
    del Peso, Luis
    Wasserman, Wyeth W.
    BMC GENOMICS, 2014, 15
  • [50] A ChIP-Seq Benchmark Shows That Sequence Conservation Mainly Improves Detection of Strong Transcription Factor Binding Sites
    Handstad, Tony
    Rye, Morten Beck
    Drablos, Finn
    Saetrom, Pal
    PLOS ONE, 2011, 6 (04):