Crash frequency modeling using negative binomial models: An application of generalized estimating equation to longitudinal data

被引:43
|
作者
Mohammadi, Mojtaba A. [1 ]
Samaranayake, V. A. [2 ]
Bham, Ghulam H. [3 ]
机构
[1] Missouri Univ Sci & Technol, Civil Architecture & Environm Engn, 219 Butler Carlton Hall,1401 N.Pine St, Rolla, MO 65409 USA
[2] Missouri Univ Sci & Technol, Math & Stat, Rolla, MO 65409 USA
[3] Univ Alaska Anchorage, Civil Engn, Anchorage, AK 99508 USA
关键词
Generalized estimation equation; Longitudinal analysis; Temporal correlation; Crash frequency model; Autocorrelation; Autoregressive;
D O I
10.1016/j.amar.2014.07.001
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
The prediction of crash frequency models can be improved when several years of crash data are utilized, instead of three to five years of data most commonly used in research. Crash data, however, generates multiple observations over the years that can be correlated. This temporal correlation affects the estimated coefficients and their variances in commonly used crash frequency models (such as negative binomial (NB), Poisson models, and their generalized forms). Despite the obvious temporal correlation of crashes, research analyses of such correlation have been limited and the consequences of this omission are not completely known. The objective of this study is to explore the effects of temporal correlation in crash frequency models at the highway segment level. In this paper, a negative binomial model has been developed using a generalized estimating equation (GEE) procedure that incorporates the temporal correlations amongst yearly crash counts. The longitudinal model employs an autoregressive correlation structure to compare to the more traditional NB model, which uses a Maximum Likelihood Estimation (MLE) method that cannot accommodate temporal correlations. The GEE model with temporal correlation was Found to be superior compared to the MLE model, as it does not underestimate the variance in the coefficient estimates, and it provides more accurate and less biased estimates. Furthermore, an autoregressive correlation structure was found to be an appropriate structure for longitudinal type of data used in this study. Ten years (2002-2011) of Missouri Interstate highway crash data have been utilized in this paper. The crash data comprises of traffic characteristics and geometric properties of highway segments. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:52 / 69
页数:18
相关论文
共 50 条
  • [21] Modeling Factors Associated with Dialysis Adequacy Using Longitudinal Data Analysis: Generalized Estimating Equation Versus Quadratic Inference Function
    Gholian, Khadije
    Hajian-Tilaki, Karimollah
    Akbari, Roghayeh
    JOURNAL OF RESEARCH IN HEALTH SCIENCES, 2023, 23 (02)
  • [22] Generalized estimating equation models for correlated data: A review with applications
    Zorn, CJW
    AMERICAN JOURNAL OF POLITICAL SCIENCE, 2001, 45 (02) : 470 - 490
  • [23] Analysis of frequency count data using the negative binomial distribution
    White, GC
    Bennetts, RE
    ECOLOGY, 1996, 77 (08) : 2549 - 2557
  • [24] Using generalized estimating equations for longitudinal data analysis
    Ballinger, GA
    ORGANIZATIONAL RESEARCH METHODS, 2004, 7 (02) : 127 - 150
  • [25] Crash Frequency Modeling Using Realistic Artificial Data
    Li, Can
    Edara, Praveen
    Shang, Yi
    2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 28 - 29
  • [26] A negative binomial Lindley approach considering spatiotemporal effects for modeling traffic crash frequency with excess zeros
    Wang, Wencheng
    Yang, Yang
    Yang, Xiaobao
    Gayah, Vikash V.
    Wang, Yunpeng
    Tang, Jinjun
    Yuan, Zhenzhou
    ACCIDENT ANALYSIS AND PREVENTION, 2024, 207
  • [27] Comments on "Longitudinal beta-binomial modeling using GEE for overdispersed binomial data"
    Inan, Gul
    STATISTICS IN MEDICINE, 2018, 37 (03) : 503 - 505
  • [28] Model crash frequency at highway-railroad grade crossings using negative binomial regression
    Hu, Shou-Ren
    Li, Chin-Shang
    Lee, Chi-Kang
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2012, 35 (07) : 841 - 852
  • [29] Factors associated with mortality from tuberculosis in Iran: an application of a generalized estimating equation-based zero-inflated negative binomial model to national registry data
    Sarvi, Fatemeh
    Moghimbeigi, Abbas
    Mahjub, Hossein
    Nasehi, Mahshid
    Khodadost, Mahmoud
    EPIDEMIOLOGY AND HEALTH, 2019, 41
  • [30] Bias properties of Bayesian statistics in finite mixture of negative binomial regression models in crash data analysis
    Park, Byung-Jung
    Lord, Dominique
    Hart, Jeffrey D.
    ACCIDENT ANALYSIS AND PREVENTION, 2010, 42 (02): : 741 - 749