Sequence determinants of human gene regulatory elements

被引:86
|
作者
Sahu, Biswajyoti [1 ,2 ]
Hartonen, Tuomo [1 ]
Pihlajamaa, Paivi [1 ]
Wei, Bei [3 ,4 ]
Dave, Kashyap [3 ]
Zhu, Fangjie [5 ]
Kaasinen, Eevi [1 ,3 ]
Lidschreiber, Katja [6 ,7 ]
Lidschreiber, Michael [6 ,7 ]
Daub, Carsten O. [7 ,8 ]
Cramer, Patrick [6 ,7 ]
Kivioja, Teemu [1 ]
Taipale, Jussi [1 ,3 ,5 ]
机构
[1] Univ Helsinki, Fac Med, Appl Tumor Genom Res Program, Helsinki, Finland
[2] Univ Helsinki, Fac Med, Med, Helsinki, Finland
[3] Karolinska Inst, Dept Med Biochem & Biophys, Stockholm, Sweden
[4] Stanford Univ, Dept Genet, Sch Med, Stanford, CA USA
[5] Univ Cambridge, Dept Biochem, Cambridge, England
[6] Max Planck Inst Biophys Chem, Dept Mol Biol, Gottingen, Germany
[7] Karolinska Inst, Dept Biosci & Nutr, Stockholm, Sweden
[8] Sci Life Lab, Stockholm, Sweden
基金
芬兰科学院; 英国医学研究理事会;
关键词
TRANSCRIPTION-FACTOR-BINDING; DEVELOPMENTAL ENHANCERS; ACTIVE ENHANCERS; DNA; CHROMATIN; PROTEINS; MAPS; SITE; SPECIFICITIES; EXPRESSION;
D O I
10.1038/s41588-021-01009-4
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Analysis of massively parallel reporter assays measuring the transcriptional activity of DNA sequences indicates that most transcription factor (TF) activity is additive and does not rely on specific TF-TF interactions. Individual TFs can have different gene regulatory activities. DNA can determine where and when genes are expressed, but the full set of sequence determinants that control gene expression is unknown. Here, we measured the transcriptional activity of DNA sequences that represent an similar to 100 times larger sequence space than the human genome using massively parallel reporter assays (MPRAs). Machine learning models revealed that transcription factors (TFs) generally act in an additive manner with weak grammar and that most enhancers increase expression from a promoter by a mechanism that does not appear to involve specific TF-TF interactions. The enhancers themselves can be classified into three types: classical, closed chromatin and chromatin dependent. We also show that few TFs are strongly active in a cell, with most activities being similar between cell types. Individual TFs can have multiple gene regulatory activities, including chromatin opening and enhancing, promoting and determining transcription start site (TSS) activity, consistent with the view that the TF binding motif is the key atomic unit of gene expression.
引用
收藏
页码:283 / +
页数:30
相关论文
共 50 条
  • [1] Sequence determinants of human gene regulatory elements
    Biswajyoti Sahu
    Tuomo Hartonen
    Päivi Pihlajamaa
    Bei Wei
    Kashyap Dave
    Fangjie Zhu
    Eevi Kaasinen
    Katja Lidschreiber
    Michael Lidschreiber
    Carsten O. Daub
    Patrick Cramer
    Teemu Kivioja
    Jussi Taipale
    Nature Genetics, 2022, 54 : 283 - 294
  • [2] Multiple regulatory elements in the 5′-flanking sequence of the human ε-globin gene
    Li, J
    Noguchi, CT
    Miller, W
    Hardison, R
    Schechter, AN
    JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (17) : 10202 - 10209
  • [3] IDENTIFICATION OF REGULATORY ELEMENTS IN THE HUMAN RENIN GENE
    SMITH, DL
    MORRIS, BJ
    DO, KJSY
    HSUEH, WA
    CIRCULATION, 1992, 86 (04) : 601 - 601
  • [4] 5'-REGULATORY ELEMENTS OF THE HUMAN HPRT GENE
    PATEL, PI
    TSAO, T
    CASKEY, CT
    CHINAULT, AC
    JOURNAL OF CELLULAR BIOCHEMISTRY, 1986, : 178 - 178
  • [5] REGULATORY ELEMENTS OF THE HUMAN PROOPIOMELANOCORTIN GENE PROMOTER
    KRAUS, J
    BUCHFELDER, M
    HOLLT, V
    DNA AND CELL BIOLOGY, 1993, 12 (06) : 527 - 536
  • [6] The 5′-flanking sequence and regulatory elements of the cystatin S gene
    Shaw, PA
    Chaparro, O
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 1999, 261 (03) : 705 - 711
  • [7] Identifying regulatory elements and gene boundaries with comparative genomic sequence analysis
    Lund, Jim
    BMC BIOINFORMATICS, 2008, 9 (Suppl 7)
  • [9] Conserved regulatory elements of the promoter sequence of the gene rpoH of enteric bacteria
    Ramírez-Santos, J
    Collado-Vides, J
    García-Varela, M
    Gómez-Eichelmann, MC
    NUCLEIC ACIDS RESEARCH, 2001, 29 (02) : 380 - 386
  • [10] Identifying regulatory elements and gene boundaries with comparative genomic sequence analysis
    Jim Lund
    BMC Bioinformatics, 9