Statistical analysis of variability in TnSeq data across conditions using zero-inflated negative binomial regression

被引:12
|
作者
Subramaniyam, Siddharth [1 ]
DeJesus, Michael A. [2 ]
Zaveri, Anisha [3 ]
Smith, Clare M. [4 ]
Baker, Richard E. [4 ]
Ehrt, Sabine [3 ]
Schnappinger, Dirk [3 ]
Sassetti, Christopher M. [4 ]
Ioerger, Thomas R. [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
[2] Rockefeller Univ, 1230 York Ave, New York, NY 10021 USA
[3] Weill Cornell Med Coll, Dept Microbiol & Immunol, New York, NY USA
[4] Univ Massachusetts, Dept Microbiol & Physiol Syst, Med Sch, Worcester, MA USA
关键词
TnSeq; Transposon insertion library; Essentiality; Zero-inflated negative binomial distribution; Mycobacterium tuberculosis; TRANSPOSITION; REQUIRES; GROWTH;
D O I
10.1186/s12859-019-3156-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Deep sequencing of transposon mutant libraries (or TnSeq) is a powerful method for probing essentiality of genomic loci under different environmental conditions. Various analytical methods have been described for identifying conditionally essential genes whose tolerance for insertions varies between two conditions. However, for large-scale experiments involving many conditions, a method is needed for identifying genes that exhibit significant variability in insertions across multiple conditions. Results In this paper, we introduce a novel statistical method for identifying genes with significant variability of insertion counts across multiple conditions based on Zero-Inflated Negative Binomial (ZINB) regression. Using likelihood ratio tests, we show that the ZINB distribution fits TnSeq data better than either ANOVA or a Negative Binomial (in a generalized linear model). We use ZINB regression to identify genes required for infection of M. tuberculosis H37Rv in C57BL/6 mice. We also use ZINB to perform a analysis of genes conditionally essential in H37Rv cultures exposed to multiple antibiotics. Conclusions Our results show that, not only does ZINB generally identify most of the genes found by pairwise resampling (and vastly out-performs ANOVA), but it also identifies additional genes where variability is detectable only when the magnitudes of insertion counts are treated separately from local differences in saturation, as in the ZINB model.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Modeling of Parking Violations Using Zero-Inflated Negative Binomial Regression: A Case Study for Berlin
    Hagen, Tobias
    Reinfeld, Nicole
    Saki, Siavash
    TRANSPORTATION RESEARCH RECORD, 2023, 2677 (06) : 498 - 512
  • [32] Modeling shark bycatch: The zero-inflated negative binomial regression model with smoothing
    Minami, M.
    Lennert-Cody, C. E.
    Gao, W.
    Roman-Verdesoto, M.
    FISHERIES RESEARCH, 2007, 84 (02) : 210 - 221
  • [33] Parameter estimations of zero-inflated negative binomial model with incomplete data
    Pho, Kim-Hung
    Lukusa, T. Martin
    APPLIED MATHEMATICAL MODELLING, 2024, 129 : 207 - 231
  • [34] Estimation in zero-inflated binomial regression with missing covariates
    Diallo, Alpha Oumar
    Diop, Aliou
    Dupuy, Jean-Francois
    STATISTICS, 2019, 53 (04) : 839 - 865
  • [35] Assessing recurrence of depression using a zero-inflated negative binomial model: A secondary analysis of lifelog data
    Kumagai, Narimasa
    Tajika, Aran
    Hasegawa, Akio
    Kawanishi, Nao
    Fujita, Hirokazu
    Tsujino, Naohisa
    Jinnin, Ran
    Uchida, Megumi
    Okamoto, Yasumasa
    Akechi, Tatsuo
    Furukawa, Toshi A.
    PSYCHIATRY RESEARCH, 2021, 300
  • [36] A constrained marginal zero-inflated binomial regression model
    Ali, Essoham
    Diop, Aliou
    Dupuy, Jean-Francois
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (18) : 6396 - 6422
  • [37] Normalizing Metagenomic Hi-C Data and Detecting Spurious Contacts Using Zero-Inflated Negative Binomial Regression
    Du, Yuxuan
    Laperriere, Sarah M.
    Fuhrman, Jed
    Sun, Fengzhu
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (02) : 106 - 120
  • [38] A Zero-Inflated Negative Binomial Regression Model to Evaluate Ship Sinking Accident Mortalities
    Chai, Tian
    Xiong, De-qi
    Weng, Jinxian
    TRANSPORTATION RESEARCH RECORD, 2018, 2672 (11) : 65 - 72
  • [39] Bayesian estimation and case influence diagnostics for the zero-inflated negative binomial regression model
    Garay, Aldo M.
    Lachos, Victor H.
    Bolfarine, Heleno
    JOURNAL OF APPLIED STATISTICS, 2015, 42 (06) : 1148 - 1165
  • [40] PTSD Symptom Severity, Cannabis, and Gender: A Zero-Inflated Negative Binomial Regression Model
    Rehder, Kristoffer
    Bowen, Sarah
    SUBSTANCE USE & MISUSE, 2019, 54 (08) : 1309 - 1318