Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome

被引:22
|
作者
Trinklein, Nathan D.
Karaoz, Ulas
Wu, Jiaqian
Halees, Anason
Aldred, Shelley Force
Collins, Patrick J.
Zheng, Deyou
Zhang, Zhengdong D.
Gerstein, Mark B.
Snyder, Michael
Myers, Richard M. [1 ]
Weng, Zhiping
机构
[1] Stanford Univ, Dept Genet, Sch Med, Stanford, CA 94305 USA
[2] Boston Univ, Bioinformat Program, Boston, MA 02215 USA
[3] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT 06520 USA
[4] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[5] Boston Univ, Dept Biomed Engn, Boston, MA 02215 USA
关键词
D O I
10.1101/gr.5716607
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The regulation of transcriptional initiation in the human genome is a critical component of global gene regulation, but a complete catalog of human promoters currently does not exist. In order to identify regulatory regions, we developed four computational methods to integrate 129 sets of ENCODE-wide chromatin immunoprecipitation data. They collectively predicted 1393 regions. Roughly 47% of the regions were unique to one method, as each method makes different assumptions about the data. Overall, predicted regions tend to localize to highly conserved, DNase I hypersensitive, and actively transcribed regions in the genome. Interestingly, a significant portion of the regions overlaps with annotated 3'-UTRs, suggesting that some of them might regulate anti-sense transcription. The majority of the predicted regions are > 2 kb away from the 5'-ends of previously annotated human cDNAs and hence are novel. These novel regions may regulate unannotated transcripts or may represent new alternative transcription start sites of known genes. We tested 163 such regions for promoter activity in four cell lines using transient transfection assays, and 25% of them showed transcriptional activity above background in at least one cell line. We also performed 5'-RACE experiments on 62 novel regions, and 76% of the regions were associated with the 5'-ends of at least two RACE products. Our results suggest that there are at least 35% more functional promoters in the human genome than currently annotated.
引用
收藏
页码:720 / 731
页数:12
相关论文
共 50 条
  • [31] A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters
    Saxonov, S
    Berg, P
    Brutlag, DL
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (05) : 1412 - 1417
  • [32] Genome-Wide Analysis of Androgen Receptor Targets Reveals COUP-TF1 as a Novel Player in Human Prostate Cancer
    Perets, Ruth
    Kaplan, Tommy
    Stein, Ilan
    Hidas, Guy
    Tayeb, Shay
    Avraham, Eti
    Ben-Neriah, Yinon
    Simon, Itamar
    Pikarsky, Eli
    PLOS ONE, 2012, 7 (10):
  • [33] Integrated analysis of transcriptomic and proteomic data reveals novel regulators of soybean (Glycine max) hypocotyl development
    Zhang, Xueliang
    Shen, Zhikang
    Sun, Xiaohu
    Chen, Min
    Zhang, Naichao
    FUNCTIONAL PLANT BIOLOGY, 2023, 50 (12) : 1086 - 1098
  • [34] Analysis of the human HP1 interactome reveals novel binding partners
    Rosnoblet, Claire
    Vandamme, Julien
    Voelkel, Pamela
    Angrand, Pierre-Olivier
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2011, 413 (02) : 206 - 211
  • [35] Phylogenomic analysis reveals ancient segmental duplications in the human genome
    Hafeez, Madiha
    Shabbir, Madiha
    Altaf, Fouzia
    Abbasi, Arnir Ali
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 2016, 94 : 95 - 100
  • [36] Genome-wide analysis reveals specificities of Cpf1 endonucleases in human cells
    Daesik Kim
    Jungeun Kim
    Junho K Hur
    Kyung Wook Been
    Sun-heui Yoon
    Jin-Soo Kim
    Nature Biotechnology, 2016, 34 : 863 - 868
  • [37] Integrated analysis of genomics and proteomics reveals that CKIP-1 is a novel macrophage migration regulator
    Zhang, Luo
    Xia, Xia
    Zhang, Minghua
    Wang, Yiwu
    Xing, Guichun
    Yin, Xiushan
    Song, Lei
    He, Fuchu
    Zhang, Lingqiang
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2013, 436 (03) : 382 - 387
  • [38] Complete genome sequence analysis of novel human bocavirus reveals genetic recombination between human bocavirus 2 and human bocavirus 4
    Khamrin, Pattara
    Okitsu, Shoko
    Ushijima, Hiroshi
    Maneekarn, Niwat
    INFECTION GENETICS AND EVOLUTION, 2013, 17 : 132 - 136
  • [39] Systematic analysis of genome-wide fitness data in yeast reveals novel gene function and drug action
    Maureen E Hillenmeyer
    Elke Ericson
    Ronald W Davis
    Corey Nislow
    Daphne Koller
    Guri Giaever
    Genome Biology, 11
  • [40] A comparative analysis of HGSC and Celera human genome assemblies and gene sets
    Li, SY
    Cutler, G
    Liu, JJJ
    Hoey, T
    Chen, LB
    Schultz, PG
    Liao, JY
    Ling, XFB
    BIOINFORMATICS, 2003, 19 (13) : 1597 - 1605