Comparative study on ChIP-seq data: normalization and binding pattern characterization

被引:42
|
作者
Taslim, Cenny [1 ,2 ]
Wu, Jiejun [1 ]
Yan, Pearlly [1 ]
Singer, Greg [1 ]
Parvin, Jeffrey [3 ,4 ]
Huang, Tim [1 ]
Lin, Shili [2 ]
Huang, Kun [3 ,4 ]
机构
[1] Ohio State Univ, Dept Mol Virol Immunol & Med Genet, Columbus, OH 43210 USA
[2] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
[3] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[4] Ohio State Univ, OSUCCC Biomed Informat Shared Resources, Columbus, OH 43210 USA
关键词
GENOME-WIDE ANALYSIS; ESTROGEN-RECEPTOR; GENE-EXPRESSION; METHYLATION; DESIGN; CELLS;
D O I
10.1093/bioinformatics/btp384
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Antibody-based Chromatin Immunoprecipitation assay followed by high-throughput sequencing technology (ChIP-seq) is a relatively new method to study the binding patterns of specific protein molecules over the entire genome. ChIP-seq technology allows scientist to get more comprehensive results in shorter time. Here, we present a non-linear normalization algorithm and a mixture modeling method for comparing ChIP-seq data from multiple samples and characterizing genes based on their RNA polymerase II (Pol II) binding patterns. Results: We apply a two-step non-linear normalization method based on locally weighted regression (LOESS) approach to compare ChIP-seq data across multiple samples and model the difference using an Exponential-Normal(K) mixture model. Fitted model is used to identify genes associated with differential binding sites based on local false discovery rate (fdr). These genes are then standardized and hierarchically clustered to characterize their Pol II binding patterns. As a case study, we apply the analysis procedure comparing normal breast cancer (MCF7) to tamoxifen-resistant (OHT) cell line. We find enriched regions that are associated with cancer (P < 0.0001). Our findings also imply that there may be a dysregulation of cell cycle and gene expression control pathways in the tamoxifen-resistant cells. These results show that the non-linear normalization method can be used to analyze ChIP-seq data across multiple samples.
引用
收藏
页码:2334 / 2340
页数:7
相关论文
共 50 条
  • [1] Normalization of ChIP-seq data with control
    Liang, Kun
    Keles, Sunduz
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [2] Normalization of ChIP-seq data with control
    Kun Liang
    Sündüz Keleş
    [J]. BMC Bioinformatics, 13
  • [3] Is this the right normalization? A diagnostic tool for ChIP-seq normalization
    Claudia Angelini
    Ruth Heller
    Rita Volkinshtein
    Daniel Yekutieli
    [J]. BMC Bioinformatics, 16
  • [4] Is this the right normalization? A diagnostic tool for ChIP-seq normalization
    Angelini, Claudia
    Heller, Ruth
    Volkinshtein, Rita
    Yekutieli, Daniel
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [5] THE ANALYSIS OF CHIP-SEQ DATA
    Ma, Wenxiu
    Wong, Wing Hung
    [J]. METHODS IN ENZYMOLOGY, VOL 497: SYNTHETIC BIOLOGY, METHODS FOR PART/DEVICE CHARACTERIZATION AND CHASSIS ENGINEERING, PT A, 2011, 497 : 51 - 73
  • [7] ChIP-Seq Data Completion and Transcription Factors Binding Analyses
    Huang, De-Shuang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 7 - 7
  • [8] Deep and wide digging for binding motifs in ChIP-Seq data
    Kulakovskiy, I. V.
    Boeva, V. A.
    Favorov, A. V.
    Makeev, V. J.
    [J]. BIOINFORMATICS, 2010, 26 (20) : 2622 - 2623
  • [9] Normalization, bias correction, and peak calling for ChIP-seq
    Diaz, Aaron
    Park, Kiyoub
    Lim, Daniel A.
    Song, Jun S.
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2012, 11 (03)
  • [10] A comparative study of ChIP-seq sequencing library preparation methods
    Sundaram, Arvind Y. M.
    Hughes, Timothy
    Biondi, Shea
    Bolduc, Nathalie
    Bowman, Sarah K.
    Camilli, Andrew
    Chew, Yap C.
    Couture, Catherine
    Farmer, Andrew
    Jerome, John P.
    Lazinski, David W.
    McUsic, Andrew
    Peng, Xu
    Shazand, Kamran
    Xu, Feng
    Lyle, Robert
    Gilfillan, Gregor D.
    [J]. BMC GENOMICS, 2016, 17