Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data

被引:13
|
作者
Wood, David L. A. [1 ]
Nones, Katia [1 ]
Steptoe, Anita [1 ]
Christ, Angelika [1 ]
Harliwong, Ivon [1 ]
Newell, Felicity [1 ]
Bruxner, Timothy J. C. [1 ]
Miller, David [1 ]
Cloonan, Nicole [2 ]
Grimmond, Sean M. [1 ,3 ]
机构
[1] Univ Queensland, Queensland Ctr Med Genom, Brisbane, Qld, Australia
[2] QIMR Berghofer Med Res Inst, Herston, Qld 4006, Australia
[3] Univ Glasgow, Translat Res Ctr, Glasgow, Lanark, Scotland
来源
PLOS ONE | 2015年 / 10卷 / 05期
基金
澳大利亚研究理事会;
关键词
HUMAN GENOME; TRANSCRIPTOME; HUMANS; METHYLATION; IMBALANCE; SEQUENCE; DISEASE; READS; RISK;
D O I
10.1371/journal.pone.0126911
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual's phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Fully Bayesian analysis of allele-specific RNA-seq data
    Alvarez-Castro, Ignacio
    Niemi, Jarad
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (06) : 7751 - 7770
  • [2] Estimates of allele-specific expression in Drosophila with a single genome sequence and RNA-seq data
    Quinn, Andrew
    Juneja, Punita
    Jiggins, Francis M.
    BIOINFORMATICS, 2014, 30 (18) : 2603 - 2610
  • [3] RNA-Seq Reveals Allele-Specific Expression of Somatic Mutations in Neuroblastoma
    Sun, Lan
    Liu, Qian
    Tu, Lingli
    Li, Xiaoqing
    Wang, Jingyi
    Wang, Kai
    Zhong, Jiang F.
    MOLECULAR THERAPY, 2021, 29 (04) : 194 - 195
  • [4] Allelic imbalance metre (Allim), a new tool for measuring allele-specific gene expression with RNA-seq data
    Pandey, Ram Vinay
    Franssen, Susanne U.
    Futschik, Andreas
    Schloetterer, Christian
    MOLECULAR ECOLOGY RESOURCES, 2013, 13 (04) : 740 - 745
  • [5] A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data
    Skelly, Daniel A.
    Johansson, Marnie
    Madeoy, Jennifer
    Wakefield, Jon
    Akey, Joshua M.
    GENOME RESEARCH, 2011, 21 (10) : 1728 - 1737
  • [6] A Bayesian approach for estimating allele-specific expression from RNA-Seq data with diploid genomes
    Naoki Nariai
    Kaname Kojima
    Takahiro Mimori
    Yosuke Kawai
    Masao Nagasaki
    BMC Genomics, 17
  • [7] Assessing allele-specific expression across multiple tissues from RNA-seq read data
    Pirinen, Matti
    Lappalainen, Tuuli
    Zaitlen, Noah A.
    Dermitzakis, Emmanouil T.
    Donnelly, Peter
    McCarthy, Mark I.
    Rivas, Manuel A.
    BIOINFORMATICS, 2015, 31 (15) : 2497 - 2504
  • [8] A Bayesian approach for estimating allele-specific expression from RNA-Seq data with diploid genomes
    Nariai, Naoki
    Kojima, Kaname
    Mimori, Takahiro
    Kawai, Yosuke
    Nagasaki, Masao
    BMC GENOMICS, 2016, 17
  • [9] Hierarchical analysis of RNA-seq reads improves the accuracy of allele-specific expression
    Raghupathy, Narayanan
    Choi, Kwangbom
    Vincent, Matthew J.
    Beane, Glen L.
    Sheppard, Keith S.
    Munger, Steven C.
    Korstanje, Ron
    Pardo-Manual de Villena, Fernando
    Churchill, Gary A.
    BIOINFORMATICS, 2018, 34 (13) : 2177 - 2184
  • [10] An Alignment-Free Regression Approach for Estimating Allele-Specific Expression Using RNA-Seq Data
    Fu, Chen-Ping
    Jojic, Vladimir
    McMillan, Leonard
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB2014, 2014, 8394 : 69 - 84