Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome

被引:22
|
作者
Trinklein, Nathan D.
Karaoz, Ulas
Wu, Jiaqian
Halees, Anason
Aldred, Shelley Force
Collins, Patrick J.
Zheng, Deyou
Zhang, Zhengdong D.
Gerstein, Mark B.
Snyder, Michael
Myers, Richard M. [1 ]
Weng, Zhiping
机构
[1] Stanford Univ, Dept Genet, Sch Med, Stanford, CA 94305 USA
[2] Boston Univ, Bioinformat Program, Boston, MA 02215 USA
[3] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT 06520 USA
[4] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[5] Boston Univ, Dept Biomed Engn, Boston, MA 02215 USA
关键词
D O I
10.1101/gr.5716607
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The regulation of transcriptional initiation in the human genome is a critical component of global gene regulation, but a complete catalog of human promoters currently does not exist. In order to identify regulatory regions, we developed four computational methods to integrate 129 sets of ENCODE-wide chromatin immunoprecipitation data. They collectively predicted 1393 regions. Roughly 47% of the regions were unique to one method, as each method makes different assumptions about the data. Overall, predicted regions tend to localize to highly conserved, DNase I hypersensitive, and actively transcribed regions in the genome. Interestingly, a significant portion of the regions overlaps with annotated 3'-UTRs, suggesting that some of them might regulate anti-sense transcription. The majority of the predicted regions are > 2 kb away from the 5'-ends of previously annotated human cDNAs and hence are novel. These novel regions may regulate unannotated transcripts or may represent new alternative transcription start sites of known genes. We tested 163 such regions for promoter activity in four cell lines using transient transfection assays, and 25% of them showed transcriptional activity above background in at least one cell line. We also performed 5'-RACE experiments on 62 novel regions, and 76% of the regions were associated with the 5'-ends of at least two RACE products. Our results suggest that there are at least 35% more functional promoters in the human genome than currently annotated.
引用
收藏
页码:720 / 731
页数:12
相关论文
共 50 条
  • [21] Integrated analysis of genome-wide copy number and expression changes reveals novel genes in oesophageal adenocarcinoma
    Goh, X. Y.
    Rees, J. R. E.
    Paterson, A. L.
    Chin, S. F.
    O'Donovan, M.
    Alderson, D.
    Ylstra, B.
    Caldas, C.
    Fitzgerald, R. C.
    EJC SUPPLEMENTS, 2010, 8 (07): : 190 - 190
  • [22] Genome Analysis of A Novel Recombinant Human Adenovirus Type 1 in China
    Wanju Zhang
    Lisu Huang
    Scientific Reports, 9
  • [23] Genome Analysis of A Novel Recombinant Human Adenovirus Type 1 in China
    Zhang, Wanju
    Huang, Lisu
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [24] Genome-wide prediction of transcriptional regulatory elements of human promoters using gene expression and promoter analysis data
    Kim, Seon-Young
    Kim, YongSung
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [25] Structure-based whole-genome realignment reveals many novel noncoding RNAs
    Will, Sebastian
    Yu, Michael
    Berger, Bonnie
    GENOME RESEARCH, 2013, 23 (06) : 1018 - 1027
  • [26] Genome-wide prediction of transcriptional regulatory elements of human promoters using gene expression and promoter analysis data
    Seon-Young Kim
    YongSung Kim
    BMC Bioinformatics, 7
  • [27] Genome information management and integrated data analysis with HaloLex
    Friedhelm Pfeiffer
    Alexander Broicher
    Thomas Gillich
    Kathrin Klee
    José Mejía
    Markus Rampp
    Dieter Oesterhelt
    Archives of Microbiology, 2008, 190 : 281 - 299
  • [28] Genome information management and integrated data analysis with HaloLex
    Pfeiffer, Friedhelm
    Broicher, Alexander
    Gillich, Thomas
    Klee, Kathrin
    Mejia, Jose
    Rampp, Markus
    Oesterhelt, Dieter
    ARCHIVES OF MICROBIOLOGY, 2008, 190 (03) : 281 - 299
  • [29] Comparative analysis of human genome assemblies reveals genome-level differences
    Li, SY
    Liao, JY
    Cutler, G
    Hoey, T
    Hogenesch, JB
    Cooke, MP
    Schultz, PG
    Ling, XFB
    GENOMICS, 2002, 80 (02) : 138 - 139
  • [30] Whole genome-based comparative analysis of the genus Streptomyces reveals many misclassifications
    Mispelaere, Marieke
    De Rop, Anne-Sofie
    Hermans, Cedric
    De Maeseneire, Sofie L.
    Soetaert, Wim K.
    De Mol, Maarten L.
    Hulpiau, Paco
    APPLIED MICROBIOLOGY AND BIOTECHNOLOGY, 2024, 108 (01)