Multiple Imputation for Incomplete Traffic Accident Data Using Chained Equations

被引:0
|
作者
Li, Linchao [1 ]
Zhang, Jian [1 ]
Wang, Yonggang [2 ]
Ran, Bin [1 ]
机构
[1] Southeast Univ, Sch Transportat, Nanjing, Jiangsu, Peoples R China
[2] Changan Univ, Sch Highway, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
imputation model; missing values; recovery; traffic safety; METHODOLOGICAL ALTERNATIVES; STATISTICAL-ANALYSIS; MISSING VALUES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing value in traffic accident data prevents the discovery of the significant factors to reduce accident severity and even lead to an invalid conclusion. In previous studies, to handle this problem, researchers mainly tried to improve the methodologies to fit the incomplete data. In this paper, we propose a missing value imputation method. It can impute missing values in the traffic accident data set. The method is called multiple imputation by chained equations (MICE) which is flexible and practical. It can not only cope with univariate missing values but also multivariate missing values. The proposed algorithm is compared with two traditional imputation methods using two publicly available traffic accident datasets from New York. Furthermore, we test the performance of the model with different missing ratios. The imputations for continuous variables and discrete variables are analyzed separately. The results indicate that our proposed model outperforms the other two models under almost all situations.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] A Correlation Based Imputation Method for Incomplete Traffic Accident Data
    Deb, Rupam
    Liew, Alan Wee-Chung
    Oh, Erwin
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 905 - 912
  • [3] Missing value imputation for the analysis of incomplete traffic accident data
    Deb, Rupam
    Liew, Alan Wee -Chung
    INFORMATION SCIENCES, 2016, 339 : 274 - 289
  • [4] Multiple imputation using chained equations for missing data in TIMSS: a case study
    Bouhlila D.S.
    Sellaouti F.
    Large-scale Assessments in Education, 1 (1)
  • [5] Multiple imputation of unordered categorical missing data: A comparison of the multivariate normal imputation and multiple imputation by chained equations
    Karangwa, Innocent
    Kotze, Danelle
    Blignaut, Renette
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2016, 30 (04) : 521 - 539
  • [6] Multiple imputation using chained equations: Issues and guidance for practice
    White, Ian R.
    Royston, Patrick
    Wood, Angela M.
    STATISTICS IN MEDICINE, 2011, 30 (04) : 377 - 399
  • [7] Method for Incomplete and Imbalanced Data Based on Multivariate Imputation by Chained Equations and Ensemble Learning
    Li, Jiaxi
    Wang, Zhelong
    Wu, Lina
    Qiu, Sen
    Zhao, Hongyu
    Lin, Fang
    Zhang, Ke
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (05) : 3102 - 3113
  • [8] Multiple imputation by chained equations for systematically and sporadically missing multilevel data
    Resche-Rigon, Matthieu
    White, Ian R.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2018, 27 (06) : 1634 - 1649
  • [9] Multiple Imputation by Chained Equations (MICE): Implementation in Stata
    Royston, Patrick
    White, Ian R.
    JOURNAL OF STATISTICAL SOFTWARE, 2011, 45 (04): : 1 - 20
  • [10] Multilevel Multiple Imputation: A Review and Evaluation of Joint Modeling and Chained Equations Imputation
    Enders, Craig K.
    Mistler, Stephen A.
    Keller, Brian T.
    PSYCHOLOGICAL METHODS, 2016, 21 (02) : 222 - 240