Negative binomial additive model for RNA-Seq data analysis

被引:14
|
作者
Ren Xu [1 ]
Kuan Pei-Fen [1 ]
机构
[1] SUNY Stony Brook, Dept Appl Math & Stat, Stony Brook, NY 11794 USA
关键词
Bayesian shrinkage; Differential expression analysis; Generalized additive model; RNA-Seq; Spline model; DIFFERENTIAL EXPRESSION ANALYSIS; SMOOTHING PARAMETER; GENE; POWERFUL; PACKAGE; TESTS;
D O I
10.1186/s12859-020-3506-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: High-throughput sequencing experiments followed by differential expression analysis is a widely used approach for detecting genomic biomarkers. A fundamental step in differential expression analysis is to model the association between gene counts and covariates of interest. Existing models assume linear effect of covariates, which is restrictive and may not be sufficient for certain phenotypes. Results: We introduce NBAMSeq, a flexible statistical model based on the generalized additive model and allows for information sharing across genes in variance estimation. Specifically, we model the logarithm of mean gene counts as sums of smooth functions with the smoothing parameters and coefficients estimated simultaneously within a nested iterative method. The variance is estimated by the Bayesian shrinkage approach to fully exploit the information across all genes. Conclusions: Based on extensive simulations and case studies of RNA-Seq data, we show that NBAMSeq offers improved performance in detecting nonlinear effect and maintains equivalent performance in detecting linear effect compared to existing methods. The vignette and source code of NBAMSeq are available at http:// bioconductor.org/packages/release/bioc/html/NBAMSeq.html.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] RseqFlow: workflows for RNA-Seq data analysis
    Wang, Ying
    Mehta, Gaurang
    Mayani, Rajiv
    Lu, Jingxi
    Souaiaia, Tade
    Chen, Yangho
    Clark, Andrew
    Yoon, Hee Jae
    Wan, Lin
    Evgrafov, Oleg V.
    Knowles, James A.
    Deelman, Ewa
    Chen, Ting
    [J]. BIOINFORMATICS, 2011, 27 (18) : 2598 - 2600
  • [22] A Unified Model for Robust Differential Expression Analysis of RNA-Seq Data
    Liu, Kefei
    Shen, Li
    Jiang, Hui
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 437 - 442
  • [23] A Comprehensive Review on RNA-seq Data Analysis
    Zhang Li
    Liu Xuejun
    [J]. Transactions of Nanjing University of Aeronautics and Astronautics, 2016, 33 (03) : 339 - 361
  • [24] Parametric analysis of RNA-seq expression data
    Konishi, Tomokazu
    [J]. GENES TO CELLS, 2016, 21 (06) : 639 - 647
  • [25] RNA-Seq UD: A bioinformatics plattform for RNA-Seq analysis
    Ramirez, Miguel
    Alejandro Rojas-Quintero, Cristian
    Enrique Vera-Parra, Nelson
    [J]. 2015 10TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2015,
  • [26] Single-gene negative binomial regression models for RNA-Seq data with higher-order asymptotic inference
    Di, Yanming
    [J]. STATISTICS AND ITS INTERFACE, 2015, 8 (04) : 405 - 418
  • [27] Model-based clustering for RNA-seq data
    Si, Yaqing
    Liu, Peng
    Li, Pinghua
    Brutnell, Thomas P.
    [J]. BIOINFORMATICS, 2014, 30 (02) : 197 - 205
  • [28] Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size
    Yu, Danni
    Huber, Wolfgang
    Vitek, Olga
    [J]. BIOINFORMATICS, 2013, 29 (10) : 1275 - 1282
  • [29] Statistical Issues in the Analysis of ChIP-Seq and RNA-Seq Data
    Ghosh, Debashis
    Qin, Zhaohui S.
    [J]. GENES, 2010, 1 (02) : 317 - 334
  • [30] RNA-Seq Data Analysis: A Practical Guide for Model and Non-Model Organisms
    Pola-Sanchez, Enrique
    Hernandez-Martinez, Karen Magdalena
    Perez-Estrada, Rafael
    Selem-Mojica, Nelly
    Simpson, June
    Abraham-Juarez, Maria Jazmin
    Herrera-Estrella, Alfredo
    Villalobos-Escobedo, Jose Manuel
    [J]. CURRENT PROTOCOLS, 2024, 4 (05):