Predicting conformational ensembles and genome-wide transcription factor binding sites from DNA sequences

被引:0
|
作者
Munazah Andrabi
Andrew Paul Hutchins
Diego Miranda-Saavedra
Hidetoshi Kono
Ruth Nussinov
Kenji Mizuguchi
Shandar Ahmad
机构
[1] National Institutes of Biomedical Innovation Health and Nutrition,Department of Biology
[2] Southern University of Science and Technology of China,World Premier International (WPI) Immunology Frontier Research Center (IFReC)
[3] Osaka University,Centro de Biología Molecular Severo Ochoa
[4] CSIC/Universidad Autónoma de Madrid,Department of Computer Science
[5] University of Oxford Wolfson Building,Molecular Modeling and Simulation (MMS) Group
[6] National Institutes for Quantum and Radiological Science and Technology,National Cancer Institute, Cancer and Inflammation Program
[7] Leidos Biomedical Research,Department of Biochemistry and Human Genetics
[8] Inc. Frederick,School of Computational and Integrative Sciences
[9] Sackler School of Medicine,Faculty of Biology,Medicine and Health, Michael Smith Building
[10] Tel Aviv University,undefined
[11] Jawaharlal Nehru University,undefined
[12] The University of Manchester,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
DNA shape is emerging as an important determinant of transcription factor binding beyond just the DNA sequence. The only tool for large scale DNA shape estimates, DNAshape was derived from Monte-Carlo simulations and predicts four broad and static DNA shape features, Propeller twist, Helical twist, Minor groove width and Roll. The contributions of other shape features e.g. Shift, Slide and Opening cannot be evaluated using DNAshape. Here, we report a novel method DynaSeq, which predicts molecular dynamics-derived ensembles of a more exhaustive set of DNA shape features. We compared the DNAshape and DynaSeq predictions for the common features and applied both to predict the genome-wide binding sites of 1312 TFs available from protein interaction quantification (PIQ) data. The results indicate a good agreement between the two methods for the common shape features and point to advantages in using DynaSeq. Predictive models employing ensembles from individual conformational parameters revealed that base-pair opening - known to be important in strand separation - was the best predictor of transcription factor-binding sites (TFBS) followed by features employed by DNAshape. Of note, TFBS could be predicted not only from the features at the target motif sites, but also from those as far as 200 nucleotides away from the motif.
引用
收藏
相关论文
共 50 条
  • [21] Mapping genome-wide transcription-factor binding sites using DAP-seq
    Bartlett, Anna
    O'Malley, Ronan C.
    Huang, Shao-shan Carol
    Galli, Mary
    Nery, Joseph R.
    Gallavotti, Andrea
    Ecker, Joseph R.
    NATURE PROTOCOLS, 2017, 12 (08) : 1659 - 1672
  • [22] Genome-wide histone acetylation data improve prediction of mammalian transcription factor binding sites
    Ramsey, Stephen A.
    Knijnenburg, Theo A.
    Kennedy, Kathleen A.
    Zak, Daniel E.
    Gilchrist, Mark
    Gold, Elizabeth S.
    Johnson, Carrie D.
    Lampano, Aaron E.
    Litvak, Vladimir
    Navarro, Garnet
    Stolyar, Tetyana
    Aderem, Alan
    Shmulevich, Ilya
    BIOINFORMATICS, 2010, 26 (17) : 2071 - 2075
  • [23] Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data
    Valouev A.
    Johnson D.S.
    Sundquist A.
    Medina C.
    Anton E.
    Batzoglou S.
    Myers R.M.
    Sidow A.
    Nature Methods, 2008, 5 (9) : 829 - 834
  • [24] Genome-wide inference of bacterial transcription factor binding sites: new method and its applications
    Nikolaichik, Yevgeny
    Vychik, Pavel
    BMC BIOINFORMATICS, 2020, 21 (SUPPL 20):
  • [25] A systematic genome-wide account of binding sites for the model transcription factor Gcn4
    Coey, Christopher T.
    Clark, David J.
    GENOME RESEARCH, 2022, 32 (02) : 367 - 377
  • [26] Genome-wide analysis of DNA binding sites of protein Jun
    Xie, Jianming
    Li, Minli
    Sun, Xiao
    Lu, Zuhong
    PROGRESS ON POST-GENOME TECHNOLOGIES, 2006, : 365 - 369
  • [27] Nanobody®-based chromatin immunoprecipitation/micro-array analysis for genome-wide identification of transcription factor DNA binding sites
    Trong Nguyen-Duc
    Peeters, Eveline
    Muyldermans, Serge
    Charlier, Daniel
    Hassanzadeh-Ghassabeh, Gholamreza
    NUCLEIC ACIDS RESEARCH, 2013, 41 (05)
  • [28] CHIP-SEQ: USING HIGH-THROUGHPUT DNA SEQUENCING FOR GENOME-WIDE IDENTIFICATION OF TRANSCRIPTION FACTOR BINDING SITES
    Lefrancois, Philippe
    Zheng, Wei
    Snyder, Michael
    METHODS IN ENZYMOLOGY, VOL 470: GUIDE TO YEAST GENETICS:: FUNCTIONAL GENOMICS, PROTEOMICS, AND OTHER SYSTEMS ANALYSIS, 2ND EDITION, 2010, 470 : 77 - 104
  • [29] Genome-wide DNA methylation profiles in hematopoietic stem and progenitor cells reveal overrepresentation of ETS transcription factor binding sites
    Hogart, Amber
    Lichtenberg, Jens
    Ajay, Subramanian S.
    Anderson, Stacie
    Margulies, Elliott H.
    Bodine, David M.
    GENOME RESEARCH, 2012, 22 (08) : 1407 - 1418
  • [30] Genome-wide protein–DNA binding dynamics suggest a molecular clutch for transcription factor function
    Colin R. Lickwar
    Florian Mueller
    Sean E. Hanlon
    James G. McNally
    Jason D. Lieb
    Nature, 2012, 484 : 251 - 255