Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome

被引:22
|
作者
Trinklein, Nathan D.
Karaoz, Ulas
Wu, Jiaqian
Halees, Anason
Aldred, Shelley Force
Collins, Patrick J.
Zheng, Deyou
Zhang, Zhengdong D.
Gerstein, Mark B.
Snyder, Michael
Myers, Richard M. [1 ]
Weng, Zhiping
机构
[1] Stanford Univ, Dept Genet, Sch Med, Stanford, CA 94305 USA
[2] Boston Univ, Bioinformat Program, Boston, MA 02215 USA
[3] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT 06520 USA
[4] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[5] Boston Univ, Dept Biomed Engn, Boston, MA 02215 USA
关键词
D O I
10.1101/gr.5716607
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The regulation of transcriptional initiation in the human genome is a critical component of global gene regulation, but a complete catalog of human promoters currently does not exist. In order to identify regulatory regions, we developed four computational methods to integrate 129 sets of ENCODE-wide chromatin immunoprecipitation data. They collectively predicted 1393 regions. Roughly 47% of the regions were unique to one method, as each method makes different assumptions about the data. Overall, predicted regions tend to localize to highly conserved, DNase I hypersensitive, and actively transcribed regions in the genome. Interestingly, a significant portion of the regions overlaps with annotated 3'-UTRs, suggesting that some of them might regulate anti-sense transcription. The majority of the predicted regions are > 2 kb away from the 5'-ends of previously annotated human cDNAs and hence are novel. These novel regions may regulate unannotated transcripts or may represent new alternative transcription start sites of known genes. We tested 163 such regions for promoter activity in four cell lines using transient transfection assays, and 25% of them showed transcriptional activity above background in at least one cell line. We also performed 5'-RACE experiments on 62 novel regions, and 76% of the regions were associated with the 5'-ends of at least two RACE products. Our results suggest that there are at least 35% more functional promoters in the human genome than currently annotated.
引用
收藏
页码:720 / 731
页数:12
相关论文
共 50 条
  • [1] Comparative analysis of genome tiling array data reveals many novel primate-specific functional RNAs in human
    Zhang, Zhaolei
    Pang, Andy Wing Chun
    Gerstein, Mark
    BMC EVOLUTIONARY BIOLOGY, 2007, 7 (Suppl 1)
  • [2] Comparative analysis of genome tiling array data reveals many novel primate-specific functional RNAs in human
    Zhaolei Zhang
    Andy Wing Chun Pang
    Mark Gerstein
    BMC Evolutionary Biology, 7
  • [3] Coexpression analysis of human genes across many microarray data sets
    Lee, HK
    Hsu, AK
    Sajdak, J
    Qin, J
    Pavlidis, P
    GENOME RESEARCH, 2004, 14 (06) : 1085 - 1094
  • [4] Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segments
    Lam, Tommy Tsan-Yuk
    Chong, Yee Ling
    Shi, Mang
    Hon, Chung-Chau
    Li, Jun
    Martin, Darren P.
    Tang, Julian Wei-Tze
    Mok, Chee-Keng
    Shih, Shin-Ru
    Yip, Chi-Wai
    Jiang, Jingwei
    Hui, Raymond Kin-Hei
    Pybus, Oliver G.
    Holmes, Edward C.
    Leung, Frederick Chi-Ching
    INFECTION GENETICS AND EVOLUTION, 2013, 18 : 367 - 378
  • [5] Computational and experimental analysis identifies many novel human genes
    Miyajima, N
    Burge, CB
    Saito, T
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2000, 272 (03) : 801 - 807
  • [6] Genome Reconstruction from Metagenomic Data Sets Reveals Novel Microbes in the Brackish Waters of the Caspian Sea
    Mehrshad, Maliheh
    Amoozegar, Mohammad Ali
    Ghai, Rohit
    Fazeli, Seyed Abolhassan Shahzadeh
    Rodriguez-Valera, Francisco
    APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2016, 82 (05) : 1599 - 1612
  • [7] Analysis of The Cancer Genome Atlas sequencing data reveals novel properties of the human papillomavirus 16 genome in head and neck squamous cell carcinoma
    Nulton, Tara J.
    Olex, Amy L.
    Dozmorov, Mikhail
    Morgan, Iain M.
    Windle, Brad
    ONCOTARGET, 2017, 8 (11) : 17684 - 17699
  • [8] Characterization of promoters integrated in the genome of bovine herpesvirus-1 (BHV-1)
    Murata, T
    Xuan, XN
    Otsuka, H
    JOURNAL OF VETERINARY MEDICAL SCIENCE, 1999, 61 (05): : 453 - 457
  • [9] Bayesian analysis of genome-wide inflammatory bowel disease data sets reveals new risk loci
    Zhang, Yu
    Tian, Lifeng
    Sleiman, Patrick
    Ghosh, Soumitra
    Hakonarson, Hakon
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 (02) : 265 - 274
  • [10] Bayesian analysis of genome-wide inflammatory bowel disease data sets reveals new risk loci
    Yu Zhang
    Lifeng Tian
    Patrick Sleiman
    Soumitra Ghosh
    Hakon Hakonarson
    European Journal of Human Genetics, 2018, 26 : 265 - 274