Developing a Random Parameters Negative Binomial-Lindley Model to analyze highly over-dispersed crash count data

被引:49
|
作者
Shaon, Mohammad Razaur Rahman [1 ]
Qin, Xiao [1 ]
Shirazi, Mohammadali [2 ]
Lord, Dominique [2 ]
Geedipally, Srinivas Reddy [3 ]
机构
[1] Univ Wisconsin Milwaukee, Dept Civil & Environm Engn, POB 784, Milwaukee, WI 53201 USA
[2] Texas A&M Univ, Zachry Dept Civil Engn, College Stn, TX 77843 USA
[3] Texas A&M Transportat Inst, 110 N Davis Dr, Arlington, TX 76013 USA
关键词
Excess zero observations; Over-dispersion; Unobserved heterogeneity; Mixed model; Random parameters model; Negative binomial-Lindley; GENERALIZED LINEAR-MODEL; MOTOR-VEHICLE CRASHES; INJURY SEVERITIES; UNOBSERVED HETEROGENEITY; STATISTICAL-ANALYSIS; MIXED MODELS; POISSON; FREQUENCY; VARIANCES; SAFETY;
D O I
10.1016/j.amar.2018.04.002
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
The existence of preponderant zero crash sites and/or sites with large crash counts can present challenges during the statistical analysis of crash count data. Additionally, unobserved heterogeneity in crash data due to the absence of important variables could negatively impact the estimated model parameters. The traditional negative binomial (NB) model with fixed parameters might not adequately handle highly over-dispersed data or unobserved heterogeneity. Many research efforts that have involved the negative binomial-Lindley (NB-L) model or the random parameters negative binomial (RPNB) model, for example, have attempted to improve the inference of estimated coefficients by explicitly accounting for extra variation in crash data. The NB-L is a mixed modeling approach which provides flexibility to account for additional dispersion in data. The RP modeling approach accommodates the effect of unobserved variables by allowing the model parameters to vary from one observation to another. The following study proposes a combination of these models - the random parameters NB-L (RPNB-L) generalized linear model (GLM) - to account for underlying heterogeneity and address excess over-dispersion. The results show that the RPNB-L model not only provides a superior goodness-of-fit (GOF) with the sample data, but also offers a better understanding about the effects of potential contributing factors. The paper uses the Bayesian framework to provide a strategy for eliminating the potential for poor mixing in the Markov Chain Monte Carlo (MCMC) chains during the estimation of the RPNB-L model. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:33 / 44
页数:12
相关论文
共 42 条
  • [1] Grouped Random Parameters Negative Binomial-Lindley for accounting unobserved heterogeneity in crash data with preponderant zero observations
    Islam, A. S. M. Mohaiminul
    Shirazi, Mohammadali
    Lord, Dominique
    ANALYTIC METHODS IN ACCIDENT RESEARCH, 2023, 37
  • [2] The negative binomial-Lindley generalized linear model: Characteristics and application using crash data
    Geedipally, Srinivas Reddy
    Lord, Dominique
    Dhavala, Soma Sekhar
    ACCIDENT ANALYSIS AND PREVENTION, 2012, 45 : 258 - 265
  • [3] A semiparametric negative binomial generalized linear model for modeling over-dispersed count data with a heavy tail: Characteristics and applications to crash data
    Shirazi, Mohammadali
    Lord, Dominique
    Dhavala, Soma Sekhar
    Geedipally, Srinivas Reddy
    ACCIDENT ANALYSIS AND PREVENTION, 2016, 91 : 10 - 18
  • [4] Evaluating alternative variations of Negative Binomial-Lindley distribution for modelling crash data
    Khodadadi, Ali
    Shirazi, Mohammadali
    Geedipally, Srinivas
    Lord, Dominique
    TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2023, 19 (03)
  • [5] A new model for over-dispersed count data: Poisson quasi-Lindley regression model
    Emrah Altun
    Mathematical Sciences, 2019, 13 : 241 - 247
  • [6] The negative Binomial-Lindley model with Time-Dependent Parameters: Accounting for temporal variations and excess zero observations in crash data
    Dzinyela, Richard
    Shirazi, Mohammadali
    Das, Subasish
    Lord, Dominique
    ACCIDENT ANALYSIS AND PREVENTION, 2024, 207
  • [7] A four-parameter negative binomial-Lindley distribution for modeling over and underdispersed count data with excess zeros
    Tajuddin, Razik Ridzuan Mohd
    Ismail, Noriszura
    Ibrahim, Kamarulzaman
    Abu Bakar, Shaiful Anuar
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (02) : 414 - 426
  • [8] A new model for over-dispersed count data: Poisson quasi-Lindley regression model
    Altun, Emrah
    MATHEMATICAL SCIENCES, 2019, 13 (03) : 241 - 247
  • [9] The negative binomial-Lindley distribution as a tool for analyzing crash data characterized by a large amount of zeros
    Lord, Dominique
    Geedipally, Srinivas Reddy
    ACCIDENT ANALYSIS AND PREVENTION, 2011, 43 (05): : 1738 - 1742
  • [10] Finite mixture Negative Binomial-Lindley for modeling heterogeneous crash data with many zero observations
    Islam, A. S. M. Mohaiminul
    Shirazi, Mohammadali
    Lord, Dominique
    ACCIDENT ANALYSIS AND PREVENTION, 2022, 175