Modeling Zero-Inflated and Overdispersed Count Data: An Empirical Study of School Suspensions

被引:18
|
作者
Desjardins, Christopher David [1 ]
机构
[1] Univ Iceland, Reykjavik, Iceland
来源
JOURNAL OF EXPERIMENTAL EDUCATION | 2016年 / 84卷 / 03期
关键词
overdispersed; school suspensions; zero-inflated; count data; hurdle; POISSON REGRESSION; BAYESIAN-ANALYSIS; HURDLE MODELS; SELECTION; TESTS; ABUNDANCE; TUTORIAL;
D O I
10.1080/00220973.2015.1054334
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The purpose of this article is to develop a statistical model that best explains variability in the number of school days suspended. Number of school days suspended is a count variable that may be zero-inflated and overdispersed relative to a Poisson model. Four models were examined: Poisson, negative binomial, Poisson hurdle, and negative binomial hurdle. Additionally, the probability of a student being suspended for at least 1day was modeled using a binomial logistic regression model. Of the count models considered, the negative binomial hurdle model had the best fit. Modeling the probability of a student being suspended for at least 1day using a binomial logistic regression model with interactions fit both the training and test data and had adequate fit. Findings here suggest that both the negative binomial hurdle and the binomial logistic regression models should be considered when modeling school suspensions.
引用
收藏
页码:449 / 472
页数:24
相关论文
共 50 条
  • [21] Modelling correlated zero-inflated count data
    Dobbie, MJ
    Welsh, AH
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2001, 43 (04) : 431 - 444
  • [22] A joint modeling of longitudinal zero-inflated count data and time to event data
    Kim, Donguk
    Chun, Jihun
    KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (07) : 1459 - 1473
  • [23] On modeling zero-inflated insurance data
    Perez Sanchez, J. M.
    Gomez-Deniz, E.
    JOURNAL OF RISK MODEL VALIDATION, 2016, 10 (04): : 23 - 37
  • [24] Zero-inflated modeling part II: Zero-inflated models for complex data structures
    Young, Derek S.
    Roemmele, Eric S.
    Shi, Xuan
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (02)
  • [25] BAYESIAN SPATIAL-TEMPORAL MODELING OF ECOLOGICAL ZERO-INFLATED COUNT DATA
    Wang, Xia
    Chen, Ming-Hui
    Kuo, Rita C.
    Dey, Dipak K.
    STATISTICA SINICA, 2015, 25 (01) : 189 - 204
  • [26] Multiple imputation of incomplete zero-inflated count data
    Kleinke, Kristian
    Reinecke, Jost
    STATISTICA NEERLANDICA, 2013, 67 (03) : 311 - 336
  • [27] Decision tree approaches for zero-inflated count data
    Lee, Seong-Keon
    Jin, Seohoon
    JOURNAL OF APPLIED STATISTICS, 2006, 33 (08) : 853 - 865
  • [28] Response to comments on "Marginalized multilevel hurdle and zero-inflated models for overdispersed and correlated count data with excess zeros"
    Molenberghs, Geert
    Poveda, Alvaro Florez
    Kassahun, Wondwosen
    Neyens, Thomas
    Faes, Christel
    Verbeke, Geert
    STATISTICS IN MEDICINE, 2018, 37 (11) : 1942 - 1946
  • [29] Semiparametric analysis of longitudinal zero-inflated count data
    Feng, Jiarui
    Zhu, Zhongyi
    JOURNAL OF MULTIVARIATE ANALYSIS, 2011, 102 (01) : 61 - 72
  • [30] Zero-inflated models with application to spatial count data
    Agarwal, DK
    Gelfand, AE
    Citron-Pousty, S
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2002, 9 (04) : 341 - 355