A fully Bayesian hidden Ising model for ChIP-seq data analysis

被引:12
|
作者
Mo, Qianxing [1 ]
机构
[1] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10065 USA
关键词
ChIP-seq; Ising model; Markov random fields; Massively parallel sequencing; Next generation sequencing; BINDING-SITES; DNA; ALGORITHM;
D O I
10.1093/biostatistics/kxr029
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq) is a powerful technique that is being used in a wide range of biological studies including genome-wide measurements of protein-DNA interactions, DNA methylation, and histone modifications. The vast amount of data and biases introduced by sequencing and/or genome mapping pose new challenges and call for effective methods and fast computer programs for statistical analysis. To systematically model ChIP-seq data, we build a dynamic signal profile for each chromosome and then model the profile using a fully Bayesian hidden Ising model. The proposed model naturally takes into account spatial dependency and global and local distributions of sequence tags. It can be used for one-sample and two-sample analyses. Through model diagnosis, the proposed method can detect falsely enriched regions caused by sequencing and/or mapping errors, which is usually not offered by the existing hypothesis-testing-based methods. The proposed method is illustrated using 3 transcription factor (TF) ChIP-seq data sets and 2 mixed ChIP-seq data sets and compared with 4 popular and/or well-documented methods: MACS, CisGenome, BayesPeak, and SISSRs. The results indicate that the proposed method achieves equivalent or higher sensitivity and spatial resolution in detecting TF binding sites with false discovery rate at a much lower level.
引用
收藏
页码:113 / 128
页数:16
相关论文
共 50 条
  • [1] BayesPeak: Bayesian analysis of ChIP-seq data
    Spyrou, Christiana
    Stark, Rory
    Lynch, Andy G.
    Tavare, Simon
    [J]. BMC BIOINFORMATICS, 2009, 10 : 299
  • [2] BayesPeak: Bayesian analysis of ChIP-seq data
    Christiana Spyrou
    Rory Stark
    Andy G Lynch
    Simon Tavaré
    [J]. BMC Bioinformatics, 10
  • [3] Hierarchical hidden Markov model with application to joint analysis of ChIP-chip and ChIP-seq data
    Choi, Hyungwon
    Nesvizhskii, Alexey I.
    Ghosh, Debashis
    Qin, Zhaohui S.
    [J]. BIOINFORMATICS, 2009, 25 (14) : 1715 - 1721
  • [4] A Bayesian Graphical Model for ChIP-Seq Data on Histone Modifications
    Mitra, Riten
    Mueller, Peter
    Liang, Shoudan
    Yue, Lu
    Ji, Yuan
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) : 69 - 80
  • [5] A Bayesian changepoint analysis of ChIP-Seq data of Lamin B
    Herrmann, S.
    Schwender, H.
    Ickstadt, K.
    Mueller, P.
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2014, 1844 (01): : 138 - 144
  • [6] THE ANALYSIS OF CHIP-SEQ DATA
    Ma, Wenxiu
    Wong, Wing Hung
    [J]. METHODS IN ENZYMOLOGY, VOL 497: SYNTHETIC BIOLOGY, METHODS FOR PART/DEVICE CHARACTERIZATION AND CHASSIS ENGINEERING, PT A, 2011, 497 : 51 - 73
  • [7] Python in ChIP-Seq data analysis
    Zhang, Li
    Hu, Yuansen
    Wang, Jinshui
    Zhang, Guangle
    [J]. Journal of Chemical and Pharmaceutical Research, 2014, 6 (03) : 1002 - 1007
  • [8] A hidden Ising model for ChIP-chip data analysis
    Mo, Qianxing
    Liang, Faming
    [J]. BIOINFORMATICS, 2010, 26 (06) : 777 - 783
  • [9] A Statistical Framework for the Analysis of ChIP-Seq Data
    Kuan, Pei Fen
    Chung, Dongjun
    Pan, Guangjin
    Thomson, James A.
    Stewart, Ron
    Keles, Suenduez
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (495) : 891 - 903
  • [10] Integrative analysis of histone ChIP-seq and transcription data using Bayesian mixture models
    Klein, Hans-Ulrich
    Schaefer, Martin
    Porse, Bo T.
    Hasemann, Marie S.
    Ickstadt, Katja
    Dugas, Martin
    [J]. BIOINFORMATICS, 2014, 30 (08) : 1154 - 1162