Bayesian Generalized Additive Models for Location, Scale, and Shape for Zero-Inflated and Overdispersed Count Data

被引:44
|
作者
Klein, Nadja [1 ]
Kneib, Thomas [1 ]
Lang, Stefan [2 ]
机构
[1] Univ Gottingen, Dept Stat & Econometr, D-37073 Gottingen, Germany
[2] Univ Innsbruck, Dept Stat, A-6020 Innsbruck, Austria
关键词
Iteratively weighted least squares; Markov chain Monte Carlo; Penalized splines; Zero-inflated negative binomial; Zero-inflated Poisson; REGRESSION-MODELS; PARAMETERS; POSTERIORS; PROPRIETY; SELECTION; PRIORS;
D O I
10.1080/01621459.2014.912955
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Frequent problems in applied research preventing the application of the classical Poisson log-linear model for analyzing count data include overdispersion, an excess of zeros compared to the Poisson distribution, correlated responses, as well as complex predictor structures comprising nonlinear effects of continuous covariates, interactions or spatial effects. We propose a general class of Bayesian generalized additive models for zero-inflated and overdispersed count data within the framework of generalized additive models for location, scale, and shape where semiparametric predictors can be specified for several parameters of a count data distribution. As standard options for applied work we consider the zero-inflated Poisson, the negative binomial and the zero-inflated negative binomial distribution. The additive predictor specifications rely on basis function approximations for the different types of effects in combination with Gaussian smoothness priors. We develop Bayesian inference based on Markov chain Monte Carlo simulation techniques where suitable proposal densities are constructed based on iteratively weighted least squares approximations to the full conditionals. To ensure practicability of the inference, we consider theoretical properties like the involved question whether the joint posterior is proper. The proposed approach is evaluated in simulation studies and applied to count data arising from patent citations and claim frequencies in car insurances. For the comparison of models with respect to the distribution, we consider quantile residuals as an effective graphical device and scoring rules that allow us to quantify the predictive ability of the models. The deviance information criterion is used to select appropriate predictor specifications once a response distribution has been chosen. Supplementary materials for this article are available online.
引用
收藏
页码:405 / 419
页数:15
相关论文
共 50 条
  • [1] Exponential dispersion models for overdispersed zero-inflated count data
    Bar-Lev, Shaul K.
    Ridder, Ad
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (07) : 3286 - 3304
  • [2] Models for Zero-Inflated and Overdispersed Correlated Count Data: An Application to Cigarette Use
    Pittman, Brian
    Buta, Eugenia
    Garrison, Kathleen
    Gueorguieva, Ralitza
    [J]. NICOTINE & TOBACCO RESEARCH, 2023, 25 (05) : 996 - 1003
  • [3] A marginalized model for zero-inflated, overdispersed and correlated count data
    Iddia, Samuel
    Molenberghs, Geert
    [J]. ELECTRONIC JOURNAL OF APPLIED STATISTICAL ANALYSIS, 2013, 6 (02) : 149 - 165
  • [4] Examples of Computing Power for Zero-Inflated and Overdispersed Count Data
    Doyle, Suzanne R.
    [J]. JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2009, 8 (02) : 360 - 376
  • [5] Models for Analyzing Zero-Inflated and Overdispersed Count Data: An Application to Cigarette and Marijuana Use
    Pittman, Brian
    Buta, Eugenia
    Krishnan-Sarin, Suchitra
    O'Malley, Stephanie S.
    Liss, Thomas
    Gueorguieva, Ralitza
    [J]. NICOTINE & TOBACCO RESEARCH, 2020, 22 (08) : 1390 - 1398
  • [6] Generalized Additive Models for Zero-Inflated Data with Partial Constraints
    Liu, Hai
    Chan, Kung-Sik
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2011, 38 (04) : 650 - 665
  • [7] A joint model for hierarchical continuous and zero-inflated overdispersed count data
    Kassahun, Wondwosen
    Neyens, Thomas
    Molenberghs, Geert
    Faes, Christel
    Verbeke, Geert
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (03) : 552 - 571
  • [8] Modeling Overdispersion, Autocorrelation, and Zero-Inflated Count Data Via Generalized Additive Models and Bayesian Statistics in an Aphid Population Study
    Carvalho, F. J.
    de Santana, D. G.
    Sampaio, M., V
    [J]. NEOTROPICAL ENTOMOLOGY, 2020, 49 (01) : 40 - 51
  • [9] Marginalized multilevel hurdle and zero-inflated models for overdispersed and correlated count data with excess zeros
    Kassahun, Wondwosen
    Neyens, Thomas
    Molenberghs, Geert
    Faes, Christel
    Verbeke, Geert
    [J]. STATISTICS IN MEDICINE, 2014, 33 (25) : 4402 - 4419
  • [10] Modeling Overdispersion, Autocorrelation, and Zero-Inflated Count Data Via Generalized Additive Models and Bayesian Statistics in an Aphid Population Study
    F. J. Carvalho
    D. G. de Santana
    M. V. Sampaio
    [J]. Neotropical Entomology, 2020, 49 : 40 - 51