Modeling and analysis of RNA-seq data: a review from a statistical perspective

被引:39
|
作者
Li, Wei Vivian [1 ]
Li, Jingyi Jessica [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
基金
美国国家科学基金会;
关键词
RNA-seq; statistical modeling; differentially expressed genes; alternatively spliced exons; isoform reconstruction and quantification;
D O I
10.1007/s40484-018-0144-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundSince the invention of next-generation RNA sequencing (RNA-seq) technologies, they have become a powerful tool to study the presence and quantity of RNA molecules in biological samples and have revolutionized transcriptomic studies. The analysis of RNA-seq data at four different levels (samples, genes, transcripts, and exons) involve multiple statistical and computational questions, some of which remain challenging up to date.ResultsWe review RNA-seq analysis tools at the sample, gene, transcript, and exon levels from a statistical perspective. We also highlight the biological and statistical questions of most practical considerations.ConclusionsThe development of statistical and computational methods for analyzing RNA-seq data has made significant advances in the past decade. However, methods developed to answer the same biological question often rely on diverse statistical models and exhibit different performance under different scenarios. This review discusses and compares multiple commonly used statistical models regarding their assumptions, in the hope of helping users select appropriate methods as needed, as well as assisting developers for future method development.
引用
收藏
页码:195 / 209
页数:15
相关论文
共 50 条
  • [41] De novo assembly and analysis of RNA-seq data
    Gordon Robertson
    Jacqueline Schein
    Readman Chiu
    Richard Corbett
    Matthew Field
    Shaun D Jackman
    Karen Mungall
    Sam Lee
    Hisanaga Mark Okada
    Jenny Q Qian
    Malachi Griffith
    Anthony Raymond
    Nina Thiessen
    Timothee Cezard
    Yaron S Butterfield
    Richard Newsome
    Simon K Chan
    Rong She
    Richard Varhol
    Baljit Kamoh
    Anna-Liisa Prabhu
    Angela Tam
    YongJun Zhao
    Richard A Moore
    Martin Hirst
    Marco A Marra
    Steven J M Jones
    Pamela A Hoodless
    Inanc Birol
    [J]. Nature Methods, 2010, 7 : 909 - 912
  • [42] sRNAflow: A Tool for the Analysis of Small RNA-Seq Data
    Zayakin, Pawel
    [J]. NON-CODING RNA, 2024, 10 (01)
  • [43] Differential expression analysis for paired RNA-seq data
    Lisa M Chung
    John P Ferguson
    Wei Zheng
    Feng Qian
    Vincent Bruno
    Ruth R Montgomery
    Hongyu Zhao
    [J]. BMC Bioinformatics, 14
  • [44] A survey of best practices for RNA-seq data analysis
    Ana Conesa
    Pedro Madrigal
    Sonia Tarazona
    David Gomez-Cabrero
    Alejandra Cervera
    Andrew McPherson
    Michał Wojciech Szcześniak
    Daniel J. Gaffney
    Laura L. Elo
    Xuegong Zhang
    Ali Mortazavi
    [J]. Genome Biology, 17
  • [45] A comprehensive workflow for optimizing RNA-seq data analysis
    Jiang, Gao
    Zheng, Juan-Yu
    Ren, Shu-Ning
    Yin, Weilun
    Xia, Xinli
    Li, Yun
    Wang, Hou-Ling
    [J]. BMC GENOMICS, 2024, 25 (01):
  • [46] Oqtans: a multifunctional workbench for RNA-seq data analysis
    Vipin T Sreedharan
    Sebastian J Schultheiss
    Géraldine Jean
    André Kahles
    Regina Bohnert
    Philipp Drewe
    Pramod Mudrakarta
    Nico Görnitz
    Georg Zeller
    Gunnar Rätsch
    [J]. BMC Bioinformatics, 15
  • [47] Modeling Exon-Specific Bias Distribution Improves the Analysis of RNA-Seq Data
    Liu, Xuejun
    Zhang, Li
    Chen, Songcan
    [J]. PLOS ONE, 2015, 10 (10):
  • [48] A statistical normalization method and differential expression analysis for RNA-seq data between different species
    Zhou, Yan
    Zhu, Jiadi
    Tong, Tiejun
    Wang, Junhui
    Lin, Bingqing
    Zhang, Jun
    [J]. BMC BIOINFORMATICS, 2019, 20 (1)
  • [49] A statistical normalization method and differential expression analysis for RNA-seq data between different species
    Yan Zhou
    Jiadi Zhu
    Tiejun Tong
    Junhui Wang
    Bingqing Lin
    Jun Zhang
    [J]. BMC Bioinformatics, 20
  • [50] Disease Biomarker Query from RNA-Seq Data
    Han, Henry
    Jiang, Xiaoqian
    [J]. CANCER INFORMATICS, 2014, 13 : 81 - 94