BinDNase: a discriminatory approach for transcription factor binding prediction using DNase I hypersensitivity data

被引:35
|
作者
Kahara, Juhani [1 ]
Lahdesmaki, Harri [1 ]
机构
[1] Aalto Univ, Sch Sci, Dept Informat & Comp Sci, FI-00076 Aalto, Finland
基金
芬兰科学院;
关键词
OPEN CHROMATIN;
D O I
10.1093/bioinformatics/btv294
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Transcription factors (TFs) are a class of DNA-binding proteins that have a central role in regulating gene expression. To reveal mechanisms of transcriptional regulation, a number of computational tools have been proposed for predicting TF-DNA interaction sites. Recent studies have shown that genome-wide sequencing data on open chromatin sites from a DNase I hypersensitivity experiments (DNase-seq) has a great potential to map putative binding sites of all transcription factors in a single experiment. Thus, computational methods for analysing DNase-seq to accurately map TF-DNA interaction sites are highly needed. Results: Here, we introduce a novel discriminative algorithm, BinDNase, for predicting TF-DNA interaction sites using DNase-seq data. BinDNase implements an efficient method for selecting and extracting informative features from DNase I signal for each TF, either at single nucleotide resolution or for larger regions. The method is applied to 57 transcription factors in cell line K562 and 31 transcription factors in cell line HepG2 using data from the ENCODE project. First, we show that BinDNase compares favourably to other supervised and unsupervised methods developed for TF-DNA interaction prediction using DNase-seq data. We demonstrate the importance to model each TF with a separate prediction model, reflecting TF-specific DNA accessibility around the TF-DNA interaction site. We also show that a highly standardised DNase-seq data (pre) processing is a requisite for accurate TF binding predictions and that sequencing depth has on average only a moderate effect on prediction accuracy. Finally, BinDNase's binding predictions generalise to other cell types, thus making BinDNase a versatile tool for accurate TF binding prediction.
引用
收藏
页码:2852 / 2859
页数:8
相关论文
共 50 条
  • [11] Prediction of transcription factor binding sites using genetic algorithm
    Chang, Xiaoyu
    Zhou, Wengang
    Zhou, Chunguang
    Liang, Yanchun
    2006 1ST IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-3, 2006, : 932 - +
  • [12] Prediction of transcription factor binding sites using genetic algorithm
    Chang, Xiaoyu
    Zhou, Wengang
    Zhou, Chunguang
    Liang, Yanchun
    ICIEA 2006: 1ST IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-3, PROCEEDINGS, 2006, : 430 - 433
  • [13] DNase-capture reveals differential transcription factor binding modalities
    Kang, Daniel
    Sherwood, Richard
    Barkal, Amira
    Hashimoto, Tatsunori
    Engstrom, Logan
    Gifford, David
    PLOS ONE, 2017, 12 (12):
  • [14] TRANSCRIPTION FACTOR BINDING SITE PREDICTION WITH MULTIVARIATE GENE EXPRESSION DATA
    Zhang, Nancy R.
    Wildermuth, Mary C.
    Speed, Terence P.
    ANNALS OF APPLIED STATISTICS, 2008, 2 (01): : 332 - 365
  • [15] DNase I hypersensitivity at the early histone H3 gene promoter from the sea urchin Tetrapygus niger is associated with specific transcription factor binding.
    Medina, R
    Paredes, R
    Puchi, M
    Imschenetzky, M
    Montecino, M
    MOLECULAR BIOLOGY OF THE CELL, 1999, 10 : 102A - 102A
  • [16] Differential DNase I hypersensitivity reveals factor-dependent chromatin dynamics
    He, Housheng Hansen
    Meyer, Clifford A.
    Chen, Mei Wei
    Jordan, V. Craig
    Brown, Myles
    Liu, X. Shirley
    GENOME RESEARCH, 2012, 22 (06) : 1015 - 1025
  • [17] Inferring functional transcription factor-gene binding pairs by integrating transcription factor binding data with transcription factor knockout data
    Yang, Tzu-Hsien
    Wu, Wei-Sheng
    BMC SYSTEMS BIOLOGY, 2013, 7
  • [18] GAGA factor-dependent transcription and establishment of DNase hypersensitivity are independent and unrelated events in vivo
    Pile, LA
    Cartwright, IL
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (02) : 1398 - 1404
  • [19] Prediction of transcription factor binding to DNA using rule induction methods
    Huss, Mikael
    Nordstrom, Karin
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2006, 3 (02) : 247 - 263
  • [20] High-Resolution Mapping of In vivo Genomic Transcription Factor Binding Sites Using In situ DNase I Footprinting and ChIP-seq
    Chumsakul, Onuma
    Nakamura, Kensuke
    Kurata, Tetsuya
    Sakamoto, Tomoaki
    Hobman, Jon L.
    Ogasawara, Naotake
    Oshima, Taku
    Ishikawa, Shu
    DNA RESEARCH, 2013, 20 (04) : 325 - 337