Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data

被引:13
|
作者
Wood, David L. A. [1 ]
Nones, Katia [1 ]
Steptoe, Anita [1 ]
Christ, Angelika [1 ]
Harliwong, Ivon [1 ]
Newell, Felicity [1 ]
Bruxner, Timothy J. C. [1 ]
Miller, David [1 ]
Cloonan, Nicole [2 ]
Grimmond, Sean M. [1 ,3 ]
机构
[1] Univ Queensland, Queensland Ctr Med Genom, Brisbane, Qld, Australia
[2] QIMR Berghofer Med Res Inst, Herston, Qld 4006, Australia
[3] Univ Glasgow, Translat Res Ctr, Glasgow, Lanark, Scotland
来源
PLOS ONE | 2015年 / 10卷 / 05期
基金
澳大利亚研究理事会;
关键词
HUMAN GENOME; TRANSCRIPTOME; HUMANS; METHYLATION; IMBALANCE; SEQUENCE; DISEASE; READS; RISK;
D O I
10.1371/journal.pone.0126911
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual's phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] WemIQ: an accurate and robust isoform quantification method for RNA-seq data
    Zhang, Jing
    Kuo, C. -C. Jay
    Chen, Liang
    BIOINFORMATICS, 2015, 31 (06) : 878 - 885
  • [22] ISOFORM ABUNDANCE INFERENCE PROVIDES A MORE ACCURATE ESTIMATION OF GENE EXPRESSION LEVELS IN RNA-SEQ
    Wang, Xi
    Wu, Zhengpeng
    Zhang, Xuegong
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2010, 8 : 177 - 192
  • [23] Analysis of allele-specific expression using RNA-seq of the Korean native pig and Landrace reciprocal cross
    Ahn, Byeongyong
    Choi, Min-Kyeung
    Yum, Joori
    Cho, In-Cheol
    Kim, Jin-Hoi
    Park, Chankyu
    ASIAN-AUSTRALASIAN JOURNAL OF ANIMAL SCIENCES, 2019, 32 (12): : 1816 - 1825
  • [24] GeneiASE: Detection of condition-dependent and static allele-specific expression from RNA-seq data without haplotype information
    Edsgard, Daniel
    Iglesias, Maria Jesus
    Reilly, Sarah-Jayne
    Hamsten, Anders
    Tornvall, Per
    Odeberg, Jacob
    Emanuelsson, Olof
    SCIENTIFIC REPORTS, 2016, 6
  • [25] GeneiASE: Detection of condition-dependent and static allele-specific expression from RNA-seq data without haplotype information
    Daniel Edsgärd
    Maria Jesus Iglesias
    Sarah-Jayne Reilly
    Anders Hamsten
    Per Tornvall
    Jacob Odeberg
    Olof Emanuelsson
    Scientific Reports, 6
  • [26] Variant calling from RNA-seq data of the brain transcriptome of pigs and its application for allele-specific expression and imprinting analysis
    Oczkowicz, Maria
    Szmatola, Tomasz
    Piorkowska, Katarzyna
    Ropka-Molik, Katarzyna
    GENE, 2018, 641 : 367 - 375
  • [27] PennSeq: accurate isoform-specific gene expression quantification in RNA-Seq by modeling non-uniform read distribution
    Hu, Yu
    Liu, Yichuan
    Mao, Xianyun
    Jia, Cheng
    Ferguson, Jane F.
    Xue, Chenyi
    Reilly, Muredach P.
    Li, Hongzhe
    Li, Mingyao
    NUCLEIC ACIDS RESEARCH, 2014, 42 (03)
  • [28] Joint estimation of isoform expression and isoform-specific read distribution using multisample RNA-Seq data
    Suo, Chen
    Calza, Stefano
    Salim, Agus
    Pawitan, Yudi
    BIOINFORMATICS, 2014, 30 (04) : 506 - 513
  • [29] Discovering Single Nucleotide Polymorphisms Regulating Human Gene Expression Using Allele Specific Expression from RNA-seq Data
    Kang, Eun Yong
    Martin, Lisa J.
    Mangul, Serghei
    Isvilanonda, Warin
    Zou, Jennifer
    Ben-David, Eyal
    Han, Buhm
    Lusis, Aldons J.
    Shifman, Sagiv
    Eskin, Eleazar
    GENETICS, 2016, 204 (03) : 1057 - +
  • [30] Statistical inferences for isoform expression in RNA-Seq
    Jiang, Hui
    Wong, Wing Hung
    BIOINFORMATICS, 2009, 25 (08) : 1026 - 1032