Clustering of mRNA-Seq data based on alternative splicing patterns

被引:4
|
作者
Johnson, Marla [1 ]
Purdom, Elizabeth [2 ]
机构
[1] Univ Calif Berkeley, Div Biostat, 367 Evans Hall Berkeley, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, 367 Evans Hall Berkeley, Berkeley, CA 94720 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Alternative splicing; Clustering; mRNA-Seq; ISOFORM EXPRESSION; SF3B1; MUTATIONS; MACHINERY; PATHWAY;
D O I
10.1093/biostatistics/kxw044
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sequencing of messenger RNA (mRNA) can provide estimates of the levels of individual isoforms within the cell. It remains to adapt many standard statistical methods commonly used for analyzing gene expression levels to take advantage of this additional information. One novel question is whether we can find clusters of samples that are distinguished not by their gene expression but by their isoform usage. We propose a novel approach for clustering mRNA-Seq data that identifies such clusters. We show via simulation that our methods are more sensitive to finding clusters based on isoform usage than standard clustering techniques. We demonstrate its performance by finding a technical artifact that resulted in different batches having different isoform usage patterns, and illustrate its usage on several The Cancer Genome Atlas datasets.
引用
收藏
页码:295 / 307
页数:13
相关论文
共 50 条
  • [1] Assessment of translational importance of mammalian mRNA sequence features based on Ribo-Seq and mRNA-Seq data
    Volkova, Oxana A.
    Kondrakhin, Yury V.
    Yevshin, Ivan S.
    Valeev, Tagir F.
    Sharipov, Ruslan N.
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2016, 14 (02)
  • [2] DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data
    Gong, Ting
    Szustakowski, Joseph D.
    BIOINFORMATICS, 2013, 29 (08) : 1083 - 1085
  • [3] Identification of key genes and miRNAs associated with carotid atherosclerosis based on mRNA-seq data
    Mao, Zhanglin
    Wu, Fen
    Shan, Yunfeng
    MEDICINE, 2018, 97 (13)
  • [4] Mining alternative splicing patterns in scRNA-seq data using scASfind
    Song, Yuyao
    Parada, Guillermo
    Lee, Jimmy Tsz Hang
    Hemberg, Martin
    GENOME BIOLOGY, 2024, 25 (01):
  • [5] mRNA-Seq of testis and liver tissues reveals a testis-specific gene and alternative splicing associated with hybrid male sterility in dzo
    Hong, Rui
    Wu, Jiaxin
    Chen, Xining
    Zhang, Zhenghao
    Liu, Xinyue
    Li, Meichen
    Zuo, Fuyuan
    Zhang, Gong-Wei
    JOURNAL OF ANIMAL SCIENCE, 2024, 102
  • [6] Technical and biological variance structure in mRNA-Seq data: life in the real world
    Oberg, Ann L.
    Bot, Brian M.
    Grill, Diane E.
    Poland, Gregory A.
    Therneau, Terry M.
    BMC GENOMICS, 2012, 13 : 2 - 14
  • [7] Regulatory divergence in Drosophila revealed by mRNA-seq
    McManus, C. Joel
    Coolon, Joseph D.
    Duff, Michael O.
    Eipper-Mains, Jodi
    Graveley, Brenton R.
    Wittkopp, Patricia J.
    GENOME RESEARCH, 2010, 20 (06) : 816 - 825
  • [8] Analysis of Genomic Alternative Splicing Patterns in Rat under Heat Stress Based on RNA-Seq Data
    Huang, Shangzhen
    Dou, Jinhuan
    Li, Zhongshu
    Hu, Lirong
    Yu, Ying
    Wang, Yachun
    GENES, 2022, 13 (02)
  • [9] Technical and biological variance structure in mRNA-Seq data: life in the real world
    Ann L Oberg
    Brian M Bot
    Diane E Grill
    Gregory A Poland
    Terry M Therneau
    BMC Genomics, 13
  • [10] TENOR: Database for Comprehensive mRNA-Seq Experiments in Rice
    Kawahara, Yoshihiro
    Oono, Youko
    Wakimoto, Hironobu
    Ogata, Jun
    Kanamori, Hiroyuki
    Sasaki, Harumi
    Mori, Satomi
    Matsumoto, Takashi
    Itoh, Takeshi
    PLANT AND CELL PHYSIOLOGY, 2016, 57 (01) : e7