PolyaPeak: Detecting Transcription Factor Binding Sites from ChIP-seq Using Peak Shape Information

被引:10
|
作者
Wu, Hao [1 ]
Ji, Hongkai [2 ]
机构
[1] Emory Univ, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[2] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
来源
PLOS ONE | 2014年 / 9卷 / 03期
关键词
GENOME; IDENTIFICATION; PROFILES;
D O I
10.1371/journal.pone.0089694
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
ChIP-seq is a powerful technology for detecting genomic regions where a protein of interest interacts with DNA. ChIP-seq data for mapping transcription factor binding sites (TFBSs) have a characteristic pattern: around each binding site, sequence reads aligned to the forward and reverse strands of the reference genome form two separate peaks shifted away from each other, and the true binding site is located in between these two peaks. While it has been shown previously that the accuracy and resolution of binding site detection can be improved by modeling the pattern, efficient methods are unavailable to fully utilize that information in TFBS detection procedure. We present PolyaPeak, a new method to improve TFBS detection by incorporating the peak shape information. PolyaPeak describes peak shapes using a flexible Polya model. The shapes are automatically learnt from the data using Minorization-Maximization (MM) algorithm, then integrated with the read count information via a hierarchical model to distinguish true binding sites from background noises. Extensive real data analyses show that PolyaPeak is capable of robustly improving TFBS detection compared with existing methods. An R package is freely available.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A practical comparison of methods for detecting transcription factor binding sites in ChIP-seq experiments
    Teemu D Laajala
    Sunil Raghav
    Soile Tuomela
    Riitta Lahesmaa
    Tero Aittokallio
    Laura L Elo
    BMC Genomics, 10
  • [2] A practical comparison of methods for detecting transcription factor binding sites in ChIP-seq experiments
    Laajala, Teemu D.
    Raghav, Sunil
    Tuomela, Soile
    Lahesmaa, Riitta
    Aittokallio, Tero
    Elo, Laura L.
    BMC GENOMICS, 2009, 10
  • [3] On the detection and refinement of transcription factor binding sites using ChIP-Seq data
    Hu, Ming
    Yu, Jindan
    Taylor, Jeremy M. G.
    Chinnaiyan, Arul M.
    Qin, Zhaohui S.
    NUCLEIC ACIDS RESEARCH, 2010, 38 (07) : 2154 - 2167
  • [4] Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite
    Wang, Xi
    Zhang, Xuegong
    BMC SYSTEMS BIOLOGY, 2011, 5
  • [5] FROM BINDING MOTIFS IN CHIP-SEQ DATA TO IMPROVED MODELS OF TRANSCRIPTION FACTOR BINDING SITES
    Kulakovskiy, Ivan
    Levitsky, Victor
    Oshchepkov, Dmitry
    Bryzgalov, Leonid
    Vorontsov, Ilya
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2013, 11 (01)
  • [6] Detecting differential binding of transcription factors with ChIP-seq
    Liang, Kun
    Keles, Sunduz
    BIOINFORMATICS, 2012, 28 (01) : 121 - 122
  • [7] Identification of transcription factor binding sites from ChIP-seq data at high resolution
    Bardet, Anais F.
    Steinmann, Jonas
    Bafna, Sangeeta
    Knoblich, Juergen A.
    Zeitlinger, Julia
    Stark, Alexander
    BIOINFORMATICS, 2013, 29 (21) : 2705 - 2713
  • [8] Transcription Factor Binding Site Mapping Using ChIP-Seq
    Jaini, Suma
    Lyubetskaya, Anna
    Gomes, Antonio
    Peterson, Matthew
    Park, Sang Tae
    Raman, Sahadevan
    Schoolnik, Gary
    Galagan, James
    MICROBIOLOGY SPECTRUM, 2014, 2 (02):
  • [9] GTRD: a database of transcription factor binding sites identified by ChIP-seq experiments
    Yevshin, Ivan
    Sharipov, Ruslan
    Valeev, Tagir
    Kel, Alexander
    Kolpakov, Fedor
    NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D61 - D67
  • [10] Optimized detection of transcription factor-binding sites in ChIP-seq experiments
    Elo, Laura L.
    Kallio, Aleksi
    Laajala, Teemu D.
    Hawkins, R. David
    Korpelainen, Eija
    Aittokallio, Tero
    NUCLEIC ACIDS RESEARCH, 2012, 40 (01)