Sample size calculations for the differential expression analysis of RNA-seq data using a negative binomial regression model

被引:4
|
作者
Li, Xiaohong [1 ,2 ]
Wu, Dongfeng [1 ]
Cooper, Nigel G. F. [2 ]
Rai, Shesh N. [1 ]
机构
[1] Univ Louisville, Sch Publ Hlth & Informat Sci, Dept Bioinformat & Biostat, Louisville, KY 40202 USA
[2] Univ Louisville, Sch Med, Dept Anat Sci & Neurobiol, Louisville, KY 40292 USA
基金
美国国家卫生研究院;
关键词
a Wald test; differentially expressed genes (DEGs); power analysis; RNA-seq; sample size; SIGNATURE;
D O I
10.1515/sagmb-2018-0021
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High throughput RNA sequencing (RNA-seq) technology is increasingly used in disease-related biomarker studies. A negative binomial distribution has become the popular choice for modeling read counts of genes in RNA-seq data due to over-dispersed read counts. In this study, we propose two explicit sample size calculation methods for RNA-seq data using a negative binomial regression model. To derive these new sample size formulas, the common dispersion parameter and the size factor as an offset via a natural logarithm link function are incorporated. A two-sided Wald test statistic derived from the coefficient parameter is used for testing a single gene at a nominal significance level 0.05 and multiple genes at a false discovery rate 0.05. The variance for the Wald test is computed from the variance-covariance matrix with the parameters estimated from the maximum likelihood estimates under the unrestricted and constrained scenarios. The performance and a sideby-side comparison of our new formulas with three existing methods with a Wald test, a likelihood ratio test or an exact test are evaluated via simulation studies. Since other methods are much computationally extensive, we recommend our M1 method for quick and direct estimation of sample sizes in an experimental design. Finally, we illustrate sample sizes estimation using an existing breast cancer RNA-seq data.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Statistical inference for time course RNA-Seq data using a negative binomial mixed-effect model
    Xiaoxiao Sun
    David Dalpiaz
    Di Wu
    Jun S. Liu
    Wenxuan Zhong
    Ping Ma
    [J]. BMC Bioinformatics, 17
  • [32] Statistical inference for time course RNA-Seq data using a negative binomial mixed-effect model
    Sun, Xiaoxiao
    Dalpiaz, David
    Wu, Di
    Liu, Jun S.
    Zhong, Wenxuan
    Ma, Ping
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [33] ZIAQ: a quantile regression method for differential expression analysis of single-cell RNA-seq data
    Zhang, Wenfei
    Wei, Ying
    Zhang, Donghui
    Xu, Ethan Y.
    [J]. BIOINFORMATICS, 2020, 36 (10) : 3124 - 3130
  • [34] Data Driven Feature Selection for RNA-Seq Differential Expression Analysis
    Han, Henry
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, PRIB 2014, 2014, 8626 : 114 - 115
  • [35] A scaling normalization method for differential expression analysis of RNA-seq data
    Robinson, Mark D.
    Oshlack, Alicia
    [J]. GENOME BIOLOGY, 2010, 11 (03):
  • [36] A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data
    Zhang, Zong Hong
    Jhaveri, Dhanisha J.
    Marshall, Vikki M.
    Bauer, Denis C.
    Edson, Janette
    Narayanan, Ramesh K.
    Robinson, Gregory J.
    Lundberg, Andreas E.
    Bartlett, Perry F.
    Wray, Naomi R.
    Zhao, Qiong-Yi
    [J]. PLOS ONE, 2014, 9 (08):
  • [37] LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
    Bingqing Lin
    Li-Feng Zhang
    Xin Chen
    [J]. BMC Genomics, 15
  • [38] A scaling normalization method for differential expression analysis of RNA-seq data
    Mark D Robinson
    Alicia Oshlack
    [J]. Genome Biology, 11
  • [39] iDEP: integrated differential expression and pathway analysis of RNA-Seq data
    Ge, Steven X.
    [J]. CANCER RESEARCH, 2018, 78 (13)
  • [40] LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
    Lin, Bingqing
    Zhang, Li-Feng
    Chen, Xin
    [J]. BMC GENOMICS, 2014, 15