Multiple Imputation for Incomplete Traffic Accident Data Using Chained Equations

被引:0
|
作者
Li, Linchao [1 ]
Zhang, Jian [1 ]
Wang, Yonggang [2 ]
Ran, Bin [1 ]
机构
[1] Southeast Univ, Sch Transportat, Nanjing, Jiangsu, Peoples R China
[2] Changan Univ, Sch Highway, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
imputation model; missing values; recovery; traffic safety; METHODOLOGICAL ALTERNATIVES; STATISTICAL-ANALYSIS; MISSING VALUES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing value in traffic accident data prevents the discovery of the significant factors to reduce accident severity and even lead to an invalid conclusion. In previous studies, to handle this problem, researchers mainly tried to improve the methodologies to fit the incomplete data. In this paper, we propose a missing value imputation method. It can impute missing values in the traffic accident data set. The method is called multiple imputation by chained equations (MICE) which is flexible and practical. It can not only cope with univariate missing values but also multivariate missing values. The proposed algorithm is compared with two traditional imputation methods using two publicly available traffic accident datasets from New York. Furthermore, we test the performance of the model with different missing ratios. The imputations for continuous variables and discrete variables are analyzed separately. The results indicate that our proposed model outperforms the other two models under almost all situations.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Multiple Imputation and Genetic Programming for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 521 - 528
  • [32] mice: Multivariate Imputation by Chained Equations in R
    van Buuren, Stef
    Groothuis-Oudshoorn, Karin
    JOURNAL OF STATISTICAL SOFTWARE, 2011, 45 (03): : 1 - 67
  • [33] Multiple Imputation for Incomplete Data in Environmental Epidemiology Research
    Prince Addo Allotey
    Ofer Harel
    Current Environmental Health Reports, 2019, 6 : 62 - 71
  • [34] Multiple Imputation and Ensemble Learning for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    Lam Thu Bui
    INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2016, 2017, 8 : 401 - 415
  • [35] Multiple Imputation for Incomplete Data in Environmental Epidemiology Research
    Allotey, Prince Addo
    Harel, Ofer
    CURRENT ENVIRONMENTAL HEALTH REPORTS, 2019, 6 (02) : 62 - 71
  • [36] A functional multiple imputation approach to incomplete longitudinal data
    He, Yulei
    Yucel, Recai
    Raghunathan, Trivellore E.
    STATISTICS IN MEDICINE, 2011, 30 (10) : 1137 - 1156
  • [37] Multiple imputation with multivariate imputation by chained equation (MICE) package
    Zhang, Zhongheng
    ANNALS OF TRANSLATIONAL MEDICINE, 2016, 4 (02)
  • [38] Infilling of high-dimensional rainfall networks through multiple imputation by chained equations
    O'Sullivan, Brian
    Kelly, Gabrielle
    INTERNATIONAL JOURNAL OF CLIMATOLOGY, 2024, 44 (09) : 3075 - 3091
  • [39] Multiple imputation for analysis of incomplete data in distributed health data networks
    Changgee Chang
    Yi Deng
    Xiaoqian Jiang
    Qi Long
    Nature Communications, 11
  • [40] A fair comparison of tree-based and parametric methods in multiple imputation by chained equations
    Slade, Emily
    Naylor, Melissa G.
    STATISTICS IN MEDICINE, 2020, 39 (08) : 1156 - 1166