Is this the right normalization? A diagnostic tool for ChIP-seq normalization

被引:8
|
作者
Angelini, Claudia [1 ]
Heller, Ruth [2 ]
Volkinshtein, Rita [2 ]
Yekutieli, Daniel [2 ]
机构
[1] Ist Applicaz Calcolo Mauro Picone, I-80131 Naples, Italy
[2] Tel Aviv Univ, Dept Stat & Operat Res, IL-69978 Tel Aviv, Israel
来源
BMC BIOINFORMATICS | 2015年 / 16卷
基金
以色列科学基金会;
关键词
Chip-Seq; Diagnostic plots; Normalization; TRANSCRIPTION FACTOR-BINDING; PROTEIN-DNA INTERACTIONS; HUMAN GENOME; CHROMATIN; IDENTIFICATION; DOMAINS; DESIGN;
D O I
10.1186/s12859-015-0579-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Chip-seq experiments are becoming a standard approach for genome-wide profiling protein-DNA interactions, such as detecting transcription factor binding sites, histone modification marks and RNA Polymerase II occupancy. However, when comparing a ChIP sample versus a control sample, such as Input DNA, normalization procedures have to be applied in order to remove experimental source of biases. Despite the substantial impact that the choice of the normalization method can have on the results of a ChIP-seq data analysis, their assessment is not fully explored in the literature. In particular, there are no diagnostic tools that show whether the applied normalization is indeed appropriate for the data being analyzed. Results: In this work we propose a novel diagnostic tool to examine the appropriateness of the estimated normalization procedure. By plotting the empirical densities of log relative risks in bins of equal read count, along with the estimated normalization constant, after logarithmic transformation, the researcher is able to assess the appropriateness of the estimated normalization constant. We use the diagnostic plot to evaluate the appropriateness of the estimates obtained by CisGenome, NCIS and CCAT on several real data examples. Moreover, we show the impact that the choice of the normalization constant can have on standard tools for peak calling such as MACS or SICER. Finally, we propose a novel procedure for controlling the FDR using sample swapping. This procedure makes use of the estimated normalization constant in order to gain power over the naive choice of constant (used in MACS and SICER), which is the ratio of the total number of reads in the ChIP and Input samples. Conclusions: Linear normalization approaches aim to estimate a scale factor, r, to adjust for different sequencing depths when comparing ChIP versus Input samples. The estimated scaling factor can easily be incorporated in many peak caller algorithms to improve the accuracy of the peak identification. The diagnostic plot proposed in this paper can be used to assess how adequate ChIP/Input normalization constants are, and thus it allows the user to choose the most adequate estimate for the analysis.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] PICS: Probabilistic Inference for ChIP-seq
    Zhang, Xuekui
    Robertson, Gordon
    Krzywinski, Martin
    Ning, Kaida
    Droit, Arnaud
    Jones, Steven
    Gottardo, Raphael
    [J]. BIOMETRICS, 2011, 67 (01) : 151 - 163
  • [32] De novo ChIP-seq analysis
    Xin He
    A. Ercument Cicek
    Yuhao Wang
    Marcel H. Schulz
    Hai-Son Le
    Ziv Bar-Joseph
    [J]. Genome Biology, 16
  • [33] Identification of pyrin targets by CHiP-Seq
    G Wood
    Y Kanno
    H Sun
    G Gutierrez-Cruz
    I Aksentijevich
    D Kastner
    [J]. Pediatric Rheumatology, 13 (Suppl 1)
  • [34] De novo ChIP-seq analysis
    He, Xin
    Cicek, A. Ercument
    Wang, Yuhao
    Schulz, Marcel H.
    Le, Hai-Son
    Bar-Joseph, Ziv
    [J]. GENOME BIOLOGY, 2015, 16
  • [35] ChIP-seq: welcome to the new frontier
    Mardis, Elaine R.
    [J]. NATURE METHODS, 2007, 4 (08) : 613 - 614
  • [36] Computational methodology for ChIP-seq analysis
    Hyunjin Shin
    Tao Liu
    Xikun Duan
    Yong Zhang
    XShirley Liu
    [J]. Quantitative Biology., 2013, 1 (01) - 70
  • [37] ChIPseqR: analysis of ChIP-seq experiments
    Humburg, Peter
    Helliwell, Chris A.
    Bulger, David
    Stone, Glenn
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [38] NVT: a fast and simple tool for the assessment of RNA-seq normalization strategies
    Eder, Thomas
    Grebien, Florian
    Rattei, Thomas
    [J]. BIOINFORMATICS, 2016, 32 (23) : 3682 - 3684
  • [39] ChIP-seq: welcome to the new frontier
    Elaine R Mardis
    [J]. Nature Methods, 2007, 4 : 613 - 614
  • [40] ChIP-seq: A Powerful Tool for Studying Protein-DNA Interactions in Plants
    Chen, Xifeng
    Bhadauria, Vijai
    Ma, Bojun
    [J]. CURRENT ISSUES IN MOLECULAR BIOLOGY, 2018, 27 : 171 - 179