Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data

被引:13
|
作者
Wood, David L. A. [1 ]
Nones, Katia [1 ]
Steptoe, Anita [1 ]
Christ, Angelika [1 ]
Harliwong, Ivon [1 ]
Newell, Felicity [1 ]
Bruxner, Timothy J. C. [1 ]
Miller, David [1 ]
Cloonan, Nicole [2 ]
Grimmond, Sean M. [1 ,3 ]
机构
[1] Univ Queensland, Queensland Ctr Med Genom, Brisbane, Qld, Australia
[2] QIMR Berghofer Med Res Inst, Herston, Qld 4006, Australia
[3] Univ Glasgow, Translat Res Ctr, Glasgow, Lanark, Scotland
来源
PLOS ONE | 2015年 / 10卷 / 05期
基金
澳大利亚研究理事会;
关键词
HUMAN GENOME; TRANSCRIPTOME; HUMANS; METHYLATION; IMBALANCE; SEQUENCE; DISEASE; READS; RISK;
D O I
10.1371/journal.pone.0126911
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual's phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci.
引用
收藏
页数:27
相关论文
共 50 条
  • [41] Deep Learning to Analyze RNA-Seq Gene Expression Data
    Urda, D.
    Montes-Torres, J.
    Moreno, F.
    Franco, L.
    Jerez, J. M.
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2017, PT II, 2017, 10306 : 50 - 59
  • [42] Comparison between RNA-Seq and Affymetrix gene expression data
    Fumagalli, D.
    Haibe-Kains, B.
    Michiels, S.
    Brown, D. N.
    Gacquer, D.
    Majjaj, S.
    Salgado, R.
    Larsimont, D.
    Detour, V.
    Piccart, M.
    Sotiriou, C.
    Desmedt, C.
    CANCER RESEARCH, 2012, 72
  • [43] Bulk segregant RNA-seq reveals expression and positional candidate genes and allele-specific expression for disease resistance against enteric septicemia of catfish
    Ruijia Wang
    Luyang Sun
    Lisui Bao
    Jiaren Zhang
    Yanliang Jiang
    Jun Yao
    Lin Song
    Jianbin Feng
    Shikai Liu
    Zhanjiang Liu
    BMC Genomics, 14
  • [44] Bulk segregant RNA-seq reveals expression and positional candidate genes and allele-specific expression for disease resistance against enteric septicemia of catfish
    Wang, Ruijia
    Sun, Luyang
    Bao, Lisui
    Zhang, Jiaren
    Jiang, Yanliang
    Yao, Jun
    Song, Lin
    Feng, Jianbin
    Liu, Shikai
    Liu, Zhanjiang
    BMC GENOMICS, 2013, 14
  • [45] Application of bulk segregant RNA-Seq (BSR-Seq) and allele-specific primers to study soybean powdery mildew resistance
    Huang, Cheng-Chun
    Lin, Chen-Hsiang
    Lin, Yu-Cheng
    Chang, Hao-Xun
    BMC PLANT BIOLOGY, 2024, 24 (01)
  • [46] Application of bulk segregant RNA-Seq (BSR-Seq) and allele-specific primers to study soybean powdery mildew resistance
    Cheng-Chun Huang
    Chen-Hsiang Lin
    Yu-Cheng Lin
    Hao-Xun Chang
    BMC Plant Biology, 24
  • [47] BYASE: a Python']Python library for estimating gene and isoform level allele-specific expression
    Dong, Lili
    Wang, Jianan
    Wang, Guohua
    BIOINFORMATICS, 2020, 36 (19) : 4955 - 4956
  • [48] Fast and accurate approximate inference of transcript expression from RNA-seq data
    Hensman, James
    Papastamoulis, Panagiotis
    Glaus, Peter
    Honkela, Antti
    Rattray, Magnus
    BIOINFORMATICS, 2015, 31 (24) : 3881 - 3889
  • [49] Towards Reliable Isoform Quantification Using RNA-Seq Data
    Howard, Brian E.
    Heber, Steffen
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 130 - 135
  • [50] Towards reliable isoform quantification using RNA-SEQ data
    Brian E Howard
    Steffen Heber
    BMC Bioinformatics, 11