Modeling Zero-Inflated and Overdispersed Count Data: An Empirical Study of School Suspensions

被引:17
|
作者
Desjardins, Christopher David [1 ]
机构
[1] Univ Iceland, Reykjavik, Iceland
来源
JOURNAL OF EXPERIMENTAL EDUCATION | 2016年 / 84卷 / 03期
关键词
overdispersed; school suspensions; zero-inflated; count data; hurdle; POISSON REGRESSION; BAYESIAN-ANALYSIS; HURDLE MODELS; SELECTION; TESTS; ABUNDANCE; TUTORIAL;
D O I
10.1080/00220973.2015.1054334
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The purpose of this article is to develop a statistical model that best explains variability in the number of school days suspended. Number of school days suspended is a count variable that may be zero-inflated and overdispersed relative to a Poisson model. Four models were examined: Poisson, negative binomial, Poisson hurdle, and negative binomial hurdle. Additionally, the probability of a student being suspended for at least 1day was modeled using a binomial logistic regression model. Of the count models considered, the negative binomial hurdle model had the best fit. Modeling the probability of a student being suspended for at least 1day using a binomial logistic regression model with interactions fit both the training and test data and had adequate fit. Findings here suggest that both the negative binomial hurdle and the binomial logistic regression models should be considered when modeling school suspensions.
引用
收藏
页码:449 / 472
页数:24
相关论文
共 50 条
  • [1] A marginalized model for zero-inflated, overdispersed and correlated count data
    Iddia, Samuel
    Molenberghs, Geert
    [J]. ELECTRONIC JOURNAL OF APPLIED STATISTICAL ANALYSIS, 2013, 6 (02) : 149 - 165
  • [2] Examples of Computing Power for Zero-Inflated and Overdispersed Count Data
    Doyle, Suzanne R.
    [J]. JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2009, 8 (02) : 360 - 376
  • [3] Exponential dispersion models for overdispersed zero-inflated count data
    Bar-Lev, Shaul K.
    Ridder, Ad
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (07) : 3286 - 3304
  • [4] Multilevel modeling in single-case studies with zero-inflated and overdispersed count data
    Li, Haoran
    Luo, Wen
    Baek, Eunkyeng
    [J]. BEHAVIOR RESEARCH METHODS, 2024, 56 (4) : 2765 - 2781
  • [5] A joint model for hierarchical continuous and zero-inflated overdispersed count data
    Kassahun, Wondwosen
    Neyens, Thomas
    Molenberghs, Geert
    Faes, Christel
    Verbeke, Geert
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (03) : 552 - 571
  • [6] Models for Zero-Inflated and Overdispersed Correlated Count Data: An Application to Cigarette Use
    Pittman, Brian
    Buta, Eugenia
    Garrison, Kathleen
    Gueorguieva, Ralitza
    [J]. NICOTINE & TOBACCO RESEARCH, 2023, 25 (05) : 996 - 1003
  • [7] Count Regression and Machine Learning Techniques for Zero-Inflated Overdispersed Count Data: Application to Ecological Data
    Sidumo B.
    Sonono E.
    Takaidza I.
    [J]. Annals of Data Science, 2024, 11 (03) : 803 - 817
  • [8] Modeling count data with marginalized zero-inflated distributions
    Cummings, Tammy H.
    Hardin, James W.
    [J]. STATA JOURNAL, 2019, 19 (03): : 499 - 509
  • [9] Models for Analyzing Zero-Inflated and Overdispersed Count Data: An Application to Cigarette and Marijuana Use
    Pittman, Brian
    Buta, Eugenia
    Krishnan-Sarin, Suchitra
    O'Malley, Stephanie S.
    Liss, Thomas
    Gueorguieva, Ralitza
    [J]. NICOTINE & TOBACCO RESEARCH, 2020, 22 (08) : 1390 - 1398
  • [10] Marginalized multilevel hurdle and zero-inflated models for overdispersed and correlated count data with excess zeros
    Kassahun, Wondwosen
    Neyens, Thomas
    Molenberghs, Geert
    Faes, Christel
    Verbeke, Geert
    [J]. STATISTICS IN MEDICINE, 2014, 33 (25) : 4402 - 4419