BinDNase: a discriminatory approach for transcription factor binding prediction using DNase I hypersensitivity data

被引:35
|
作者
Kahara, Juhani [1 ]
Lahdesmaki, Harri [1 ]
机构
[1] Aalto Univ, Sch Sci, Dept Informat & Comp Sci, FI-00076 Aalto, Finland
基金
芬兰科学院;
关键词
OPEN CHROMATIN;
D O I
10.1093/bioinformatics/btv294
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Transcription factors (TFs) are a class of DNA-binding proteins that have a central role in regulating gene expression. To reveal mechanisms of transcriptional regulation, a number of computational tools have been proposed for predicting TF-DNA interaction sites. Recent studies have shown that genome-wide sequencing data on open chromatin sites from a DNase I hypersensitivity experiments (DNase-seq) has a great potential to map putative binding sites of all transcription factors in a single experiment. Thus, computational methods for analysing DNase-seq to accurately map TF-DNA interaction sites are highly needed. Results: Here, we introduce a novel discriminative algorithm, BinDNase, for predicting TF-DNA interaction sites using DNase-seq data. BinDNase implements an efficient method for selecting and extracting informative features from DNase I signal for each TF, either at single nucleotide resolution or for larger regions. The method is applied to 57 transcription factors in cell line K562 and 31 transcription factors in cell line HepG2 using data from the ENCODE project. First, we show that BinDNase compares favourably to other supervised and unsupervised methods developed for TF-DNA interaction prediction using DNase-seq data. We demonstrate the importance to model each TF with a separate prediction model, reflecting TF-specific DNA accessibility around the TF-DNA interaction site. We also show that a highly standardised DNase-seq data (pre) processing is a requisite for accurate TF binding predictions and that sequencing depth has on average only a moderate effect on prediction accuracy. Finally, BinDNase's binding predictions generalise to other cell types, thus making BinDNase a versatile tool for accurate TF binding prediction.
引用
收藏
页码:2852 / 2859
页数:8
相关论文
共 50 条
  • [21] Computational framework for the prediction of transcription factor binding sites by multiple data integration
    Ambesi-Impiombato, Alberto
    Bansal, Mukesh
    Lio, Pietro
    di Bernardo, Diego
    BMC NEUROSCIENCE, 2006, 7 (Suppl 1)
  • [22] Computational framework for the prediction of transcription factor binding sites by multiple data integration
    Alberto Ambesi-Impiombato
    Mukesh Bansal
    Pietro Liò
    Diego di Bernardo
    BMC Neuroscience, 7
  • [23] Transcription Factor Binding Site Prediction: Finding the Point from Many Data
    Ishimori, Motoyuki
    PLANT AND CELL PHYSIOLOGY, 2022, 63 (10) : 1324 - 1325
  • [24] Drosophila DNase I footprint database:: a systematic genome annotation of transcription factor binding sites in the fruitfly, Drosophila melanogaster
    Bergman, CM
    Carlson, JW
    Celniker, SE
    BIOINFORMATICS, 2005, 21 (08) : 1747 - 1749
  • [25] Probabilistic framework for transcription factor binding prediction
    Laehdesmaeki, Harri
    Shmulevich, Ilya
    2007 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS, 2007, : 95 - 98
  • [26] Integrated assessment and prediction of transcription factor binding
    Beyer, Andreas
    Workman, Christopher
    Hollunder, Jens
    Radke, Doerte
    Moeller, Ulrich
    Wilhelm, Thomas
    Ideker, Trey
    PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (06) : 615 - 626
  • [27] Structure-based prediction of transcription factor binding sites using a protein-DNA docking approach
    Liu, Zhijie
    Guo, Jun-Tao
    Li, Ting
    Xu, Ying
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 72 (04) : 1114 - 1124
  • [28] Romulus: robust multi-state identification of transcription factor binding sites from DNase-seq data
    Jankowski, Aleksander
    Tiuryn, Jerzy
    Prabhakar, Shyam
    BIOINFORMATICS, 2016, 32 (16) : 2419 - 2426
  • [29] DNase I hypersensitivity analysis of the human CCAAT enhancer binding protein ε (C/EBRε) gene
    Kubota, T
    Hirama, T
    Verbeek, W
    Kawano, S
    Chih, DY
    Chumakov, AM
    Taguchi, H
    Koeffler, HP
    LEUKEMIA RESEARCH, 2001, 25 (11) : 981 - 995
  • [30] Robust Transcription Factor Binding Site Prediction Using Deep Neural Networks
    Geete, Kanu
    Pandey, Manish
    CURRENT BIOINFORMATICS, 2020, 15 (10) : 1137 - 1152