Identifying differential transcription factor binding in ChIP-seq

被引:30
|
作者
Wu, Dai-Ying [1 ]
Bittencourt, Danielle [1 ]
Stallcup, Michael R. [1 ]
Siegmund, Kimberly D. [2 ]
机构
[1] Univ So Calif, Dept Biochem & Mol Biol, Kenneth Norris Jr Comprehens Canc Ctr, Los Angeles, CA 90089 USA
[2] Univ So Calif, Dept Prevent Med, Kenneth Norris Jr Comprehens Canc Ctr, Los Angeles, CA 90089 USA
基金
美国国家卫生研究院;
关键词
EXPRESSION ANALYSIS; GENOME; BIOCONDUCTOR; ELEMENTS; PACKAGE; SITES; MODEL;
D O I
10.3389/fgene.2015.00169
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
ChIP seq is a widely used assay to measure genome-wide protein binding. The decrease in costs associated with sequencing has led to a rise in the number of studies that investigate protein binding across treatment conditions or cell lines. In addition to the identification of binding sites, new studies evaluate the variation in protein binding between conditions. A number of approaches to study differential transcription factor binding have recently been developed. Several of these methods build upon established methods from RNA-seq to quantify differences in read counts. We compare how these new approaches perform on different data sets from the ENCODE project to illustrate the impact of data processing pipelines under different study designs. The performance of normalization methods for differential ChIP-seq depends strongly on the variation in total amount of protein bound between conditions, with total read count outperforming effective library size, or variants thereof, when a large variation in binding was studied. Use of input subtraction to correct for non-specific binding showed a relatively modest impact on the number of differential peaks found and the fold change accuracy to biological validation, however a larger impact might be expected for samples with more extreme copy number variations between them. Still, it did identify a small subset of novel differential regions while excluding some differential peaks in regions with high background signal. These results highlight proper scaling for between-sample data normalization as critical for differential transcription factor binding analysis and suggest bioinformaticians need to know about the variation in level of total protein binding between conditions to select the best analysis method. At the same time, validation using fold-change estimates from qRT-PCR suggests there is still room for further method improvement.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Discovering unknown human and mouse transcription factor binding sites and their characteristics from ChIP-seq data
    Yu, Chun-Ping
    Kuo, Chen-Hao
    Nelson, Chase W.
    Chen, Chi-An
    Soh, Zhi Thong
    Lin, Jinn-Jy
    Hsiao, Ru-Xiu
    Chang, Chih-Yao
    Li, Wen-Hsiung
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (20)
  • [32] An Integrated Pipeline for the Genome-Wide Analysis of Transcription Factor Binding Sites from ChIP-Seq
    Mercier, Eloi
    Droit, Arnaud
    Li, Leping
    Robertson, Gordon
    Zhang, Xuekui
    Gottardo, Raphael
    PLOS ONE, 2011, 6 (02):
  • [33] Studying the evolution of transcription factor binding events using multi-species ChIP-Seq data
    Zheng, Wei
    Zhao, Hongyu
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2013, 12 (01) : 1 - 15
  • [34] MotifGenie: a Python']Python application for searching transcription factor binding sequences using ChIP-Seq datasets
    Oguztuzun, Cerag
    Yasar, Pelin
    Yavuz, Kerim
    Muyan, Mesut
    Can, Tolga
    BIOINFORMATICS, 2021, 37 (22) : 4238 - 4239
  • [35] Identifying ChIP-seq enrichment using MACS
    Jianxing Feng
    Tao Liu
    Bo Qin
    Yong Zhang
    Xiaole Shirley Liu
    Nature Protocols, 2012, 7 : 1728 - 1740
  • [36] Identifying ChIP-seq enrichment using MACS
    Feng, Jianxing
    Liu, Tao
    Qin, Bo
    Zhang, Yong
    Liu, Xiaole Shirley
    NATURE PROTOCOLS, 2012, 7 (09) : 1728 - 1740
  • [37] A novel UV laser ChIP-seq method for the study of aberrant transcription factor binding in cancer cells
    Stanko, C.
    Bohm, A. -L
    Schenk, T.
    ONCOLOGY RESEARCH AND TREATMENT, 2021, 44 : 295 - 295
  • [38] Application of experimentally verified transcription factor binding sites models for computational analysis of ChIP-Seq data
    Levitsky, Victor G.
    Kulakovskiy, Ivan V.
    Ershov, Nikita I.
    Oshchepkov, Dmitry Yu
    Makeev, Vsevolod J.
    Hodgman, T. C.
    Merkulova, Tatyana I.
    BMC GENOMICS, 2014, 15
  • [39] Parallel factor ChIP provides essential internal control for quantitative differential ChIP-seq
    Guertin, Michael J.
    Cullen, Amy E.
    Markowetz, Florian
    Holding, Andrew N.
    NUCLEIC ACIDS RESEARCH, 2018, 46 (12)
  • [40] CISMAPPER: predicting regulatory interactions from transcription factor ChIP-seq data
    O'Connor, Timothy
    Boden, Mikael
    Bailey, Timothy L.
    NUCLEIC ACIDS RESEARCH, 2017, 45 (04) : e19