A universal framework for detecting cis-regulatory diversity in DNA regions

被引:1
|
作者
Biswas, Anushua [1 ,2 ,3 ]
Narlikar, Leelavati [1 ,2 ,3 ]
机构
[1] CSIR Natl Chem Lab, Dept Chem Engn, Pune 411008, Maharashtra, India
[2] Acad Sci & Innovat Res, Ghaziabad 201002, India
[3] Indian Inst Sci Educ & Res, Dept Data Sci, Pune 411008, Maharashtra, India
关键词
CHROMATIN ACCESSIBILITY; TRANSCRIPTION FACTORS; BINDING; ELEMENTS; ENHANCERS; PROTEINS; CTCF; MAP; IDENTIFICATION; TOOLS;
D O I
10.1101/gr.274563.120
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput sequencing-based assays measure different biochemical activities pertaining to gene regulation, genomewide. These activities include transcription factor (TF)-DNA binding, enhancer activity, open chromatin, and more. A major goal is to understand underlying sequence components, or motifs, that can explain the measured activity. It is usually not one motif but a combination of motifs bound by cooperatively acting proteins that confers activity to such regions. Furthermore, regions can be diverse, governed by different combinations of TFs/ motifs. Current approaches do not take into account this issue of combinatorial diversity. We present a new statistical framework, cisDIVERSITY, which models regions as diverse modules characterized by combinations of motifs while simultaneously learning the motifs themselves. Because cisDIVERSITY does not rely on knowledge of motifs, modules, cell type, or organism, it is general enough to be applied to regions reported by most high-throughput assays. For example, in enhancer predictions resulting from different assays-GRO-cap, STARR-seq, and those measuring chromatin structure-cisDIVERSITY discovers distinct modules and combinations of TF binding sites, some specific to the assay. From protein-DNA binding data, cisDIVERSITY identifies potential cofactors of the profiled TF, whereas from ATAC-seq data, it identifies tissue-specific regulatory modules. Finally, analysis of single-cell ATAC-seq data suggests that regions open in one cell-state encode information about future states, with certain modules staying open and others closing down in the next time point.
引用
收藏
页码:1646 / 1662
页数:17
相关论文
共 50 条
  • [1] Detecting natural selection on cis-regulatory DNA
    Hahn, Matthew W.
    [J]. GENETICA, 2007, 129 (01) : 7 - 18
  • [2] Detecting natural selection on cis-regulatory DNA
    Matthew W. Hahn
    [J]. Genetica, 2007, 129 : 7 - 18
  • [3] Evolution of cis-regulatory regions versus codifying regions
    Rodríguez-Trelles, F
    Tarrío, R
    Ayala, FJ
    [J]. INTERNATIONAL JOURNAL OF DEVELOPMENTAL BIOLOGY, 2003, 47 (7-8): : 665 - 673
  • [4] Simultaneous alignment and annotation of cis-regulatory regions
    Bais, Abha Singh
    Grossmann, Steffen
    Vingron, Martin
    [J]. BIOINFORMATICS, 2007, 23 (02) : E44 - E49
  • [5] Active DNA demethylation of developmental cis-regulatory regions predates vertebrate origins
    Skvortsova, Ksenia
    Bertrand, Stephanie
    Voronov, Danila
    Duckett, Paul E.
    Ross, Samuel E.
    Magri, Marta Silvia
    Maeso, Ignacio
    Weatheritt, Robert J.
    Skarmeta, Jose Luis Gomez
    Arnone, Maria Ina
    Escriva, Hector
    Bogdanovic, Ozren
    [J]. SCIENCE ADVANCES, 2022, 8 (48):
  • [6] The structure and evolution of cis-regulatory regions: the shavenbaby story
    Stern, David L.
    Frankel, Nicolas
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2013, 368 (1632)
  • [7] GREAT improves functional interpretation of cis-regulatory regions
    Cory Y McLean
    Dave Bristor
    Michael Hiller
    Shoa L Clarke
    Bruce T Schaar
    Craig B Lowe
    Aaron M Wenger
    Gill Bejerano
    [J]. Nature Biotechnology, 2010, 28 : 495 - 501
  • [8] BEARR:: Batch extraction and analysis of cis-regulatory regions
    Vega, VB
    Bangarusamy, DK
    Miller, LD
    Liu, ET
    Lin, CY
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : W257 - W260
  • [9] GREAT improves functional interpretation of cis-regulatory regions
    McLean, Cory Y.
    Bristor, Dave
    Hiller, Michael
    Clarke, Shoa L.
    Schaar, Bruce T.
    Lowe, Craig B.
    Wenger, Aaron M.
    Bejerano, Gill
    [J]. NATURE BIOTECHNOLOGY, 2010, 28 (05) : 495 - U155
  • [10] A new framework for identifying cis-regulatory motifs in prokaryotes
    Li, Guojun
    Liu, Bingqiang
    Ma, Qin
    Xu, Ying
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 (07) : E42 - U54