Negative binomial additive model for RNA-Seq data analysis

被引:14
|
作者
Ren Xu [1 ]
Kuan Pei-Fen [1 ]
机构
[1] SUNY Stony Brook, Dept Appl Math & Stat, Stony Brook, NY 11794 USA
关键词
Bayesian shrinkage; Differential expression analysis; Generalized additive model; RNA-Seq; Spline model; DIFFERENTIAL EXPRESSION ANALYSIS; SMOOTHING PARAMETER; GENE; POWERFUL; PACKAGE; TESTS;
D O I
10.1186/s12859-020-3506-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: High-throughput sequencing experiments followed by differential expression analysis is a widely used approach for detecting genomic biomarkers. A fundamental step in differential expression analysis is to model the association between gene counts and covariates of interest. Existing models assume linear effect of covariates, which is restrictive and may not be sufficient for certain phenotypes. Results: We introduce NBAMSeq, a flexible statistical model based on the generalized additive model and allows for information sharing across genes in variance estimation. Specifically, we model the logarithm of mean gene counts as sums of smooth functions with the smoothing parameters and coefficients estimated simultaneously within a nested iterative method. The variance is estimated by the Bayesian shrinkage approach to fully exploit the information across all genes. Conclusions: Based on extensive simulations and case studies of RNA-Seq data, we show that NBAMSeq offers improved performance in detecting nonlinear effect and maintains equivalent performance in detecting linear effect compared to existing methods. The vignette and source code of NBAMSeq are available at http:// bioconductor.org/packages/release/bioc/html/NBAMSeq.html.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Negative binomial additive model for RNA-Seq data analysis
    Xu Ren
    Pei-Fen Kuan
    [J]. BMC Bioinformatics, 21
  • [2] NBLDA: negative binomial linear discriminant analysis for RNA-Seq data
    Kai Dong
    Hongyu Zhao
    Tiejun Tong
    Xiang Wan
    [J]. BMC Bioinformatics, 17
  • [3] NBLDA: negative binomial linear discriminant analysis for RNA-Seq data
    Dong, Kai
    Zhao, Hongyu
    Tong, Tiejun
    Wan, Xiang
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [4] A sparse negative binomial mixture model for clustering RNA-seq count data
    Li, Yujia
    Rahman, Tanbin
    Ma, Tianzhou
    Tang, Lu
    Tseng, George C.
    [J]. BIOSTATISTICS, 2022, 24 (01) : 68 - 84
  • [5] Bayesian Analysis of RNA-Seq Data Using a Family of Negative Binomial Models
    Zhao, Lili
    Wu, Weisheng
    Feng, Dai
    Jiang, Hui
    Nguyen, XuanLong
    [J]. BAYESIAN ANALYSIS, 2018, 13 (02): : 411 - 436
  • [6] A SPARSE NEGATIVE BINOMIAL CLASSIFIER WITH COVARIATE ADJUSTMENT FOR RNA-SEQ DATA
    Rahman, Tanbin
    Huang, Hsin-En
    Li, Yujia
    Tai, An-Shun
    Hseih, Wen-Ping
    McClung, Colleen A.
    Tseng, George
    [J]. ANNALS OF APPLIED STATISTICS, 2022, 16 (02): : 1071 - 1089
  • [7] Sample size calculations for the differential expression analysis of RNA-seq data using a negative binomial regression model
    Li, Xiaohong
    Wu, Dongfeng
    Cooper, Nigel G. F.
    Rai, Shesh N.
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2019, 18 (01)
  • [8] Marginal likelihood estimation of negative binomial parameters with applications to RNA-seq data
    Leon-Novelo, Luis
    Fuentes, Claudio
    Emerson, Sarah
    [J]. BIOSTATISTICS, 2017, 18 (04) : 637 - 650
  • [9] Dynamic Model for RNA-seq Data Analysis
    Li, Lerong
    Xiong, Momiao
    [J]. BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [10] Statistical inference for time course RNA-Seq data using a negative binomial mixed-effect model
    Xiaoxiao Sun
    David Dalpiaz
    Di Wu
    Jun S. Liu
    Wenxuan Zhong
    Ping Ma
    [J]. BMC Bioinformatics, 17