Evaluation of negative binomial and zero-inflated negative binomial models for the analysis of zero-inflated count data: application to the telemedicine for children with medical complexity trial

被引:1
|
作者
Lee, Kyung Hyun [1 ]
Pedroza, Claudia [1 ]
Avritscher, Elenir B. C. [1 ]
Mosquera, Ricardo A. [1 ]
Tyson, Jon E. [1 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Inst Clin Res & Learning Hlth Care, Houston, TX 77030 USA
关键词
Zero-inflated regression model; Count data; Negative binomial; Telemedicine; ELECTRONIC HEALTH RECORDS; POISSON REGRESSION; HURDLE MODELS; IMPACT; TESTS;
D O I
10.1186/s13063-023-07648-8
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
BackgroundTwo characteristics of commonly used outcomes in medical research are zero inflation and non-negative integers; examples include the number of hospital admissions or emergency department visits, where the majority of patients will have zero counts. Zero-inflated regression models were devised to analyze this type of data. However, the performance of zero-inflated regression models or the properties of data best suited for these analyses have not been thoroughly investigated.MethodsWe conducted a simulation study to evaluate the performance of two generalized linear models, negative binomial and zero-inflated negative binomial, for analyzing zero-inflated count data. Simulation scenarios assumed a randomized controlled trial design and varied the true underlying distribution, sample size, and rate of zero inflation. We compared the models in terms of bias, mean squared error, and coverage. Additionally, we used logistic regression to determine which data properties are most important for predicting the best-fitting model.ResultsWe first found that, regardless of the rate of zero inflation, there was little difference between the conventional negative binomial and its zero-inflated counterpart in terms of bias of the marginal treatment group coefficient. Second, even when the outcome was simulated from a zero-inflated distribution, a negative binomial model was favored above its ZI counterpart in terms of the Akaike Information Criterion. Third, the mean and skewness of the non-zero part of the data were stronger predictors of model preference than the percentage of zero counts. These results were not affected by the sample size, which ranged from 60 to 800.ConclusionsWe recommend that the rate of zero inflation and overdispersion in the outcome should not be the sole and main justification for choosing zero-inflated regression models. Investigators should also consider other data characteristics when choosing a model for count data. In addition, if the performance of the NB and ZINB regression models is reasonably comparable even with ZI outcomes, we advocate the use of the NB regression model due to its clear and straightforward interpretation of the results.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Modeling citrus huanglongbing data using a zero-inflated negative binomial distribution
    de Almeida, Eudmar Paiva
    Janeiro, Vanderly
    Guedes, Terezinha Aparecida
    Mulati, Fabio
    Pedroza Carneiro, Jose Walter
    de Carvalho Nunes, William Mario
    [J]. ACTA SCIENTIARUM-AGRONOMY, 2016, 38 (03): : 299 - 306
  • [32] Parameter Estimation on Zero-Inflated Negative Binomial Regression with Right Truncated Data
    Saffari, Seyed Ehsan
    Adnan, Robiah
    [J]. SAINS MALAYSIANA, 2012, 41 (11): : 1483 - 1487
  • [33] Sampling plans for the zero-inflated negative binomial distribution in the food industry
    Wang, Fu-Kwun
    Hailemariam, Shalemu Sharew
    [J]. QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2018, 34 (06) : 1174 - 1184
  • [34] A New Zero-Inflated Negative Binomial Methodology for Latent Category Identification
    Simon J. Blanchard
    Wayne S. DeSarbo
    [J]. Psychometrika, 2013, 78 : 322 - 340
  • [35] Some Theoretical Comparisons of Negative Binomial and Zero-Inflated Poisson Distributions
    Feng, Changyong
    Wang, Hongyue
    Han, Yu
    Xia, Yinglin
    Lu, Naiji
    Tu, Xin M.
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2015, 44 (15) : 3266 - 3277
  • [36] Improved shrinkage estimators in zero-inflated negative binomial regression model
    Zandi, Zahra
    Bevrani, Hossein
    Belaghi, Reza Arabi
    [J]. HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2021, 50 (06): : 1855 - 1876
  • [37] A new Stein estimator for the zero-inflated negative binomial regression model
    Akram, Muhammad Nauman
    Abonazel, Mohamed R.
    Amin, Muhammad
    Kibria, B. M. Golam
    Afzal, Nimra
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (19):
  • [38] A New Zero-Inflated Negative Binomial Methodology for Latent Category Identification
    Blanchard, Simon J.
    DeSarbo, Wayne S.
    [J]. PSYCHOMETRIKA, 2013, 78 (02) : 322 - 340
  • [39] A zero-inflated negative binomial regression model with hidden Markov chain
    Wang, Peiming
    Alba, Joseph D.
    [J]. ECONOMICS LETTERS, 2006, 92 (02) : 209 - 213
  • [40] The Zero-Inflated Negative Binomial Semiparametric Regression Model: Application to Number of Failing Grades Data
    Aráujo E.G.
    Vasconcelos J.C.S.
    dos Santos D.P.
    Ortega E.M.M.
    de Souza D.
    Zanetoni J.P.F.
    [J]. Annals of Data Science, 2023, 10 (04) : 991 - 1006