Multiple Imputation for Incomplete Traffic Accident Data Using Chained Equations

被引:0
|
作者
Li, Linchao [1 ]
Zhang, Jian [1 ]
Wang, Yonggang [2 ]
Ran, Bin [1 ]
机构
[1] Southeast Univ, Sch Transportat, Nanjing, Jiangsu, Peoples R China
[2] Changan Univ, Sch Highway, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
imputation model; missing values; recovery; traffic safety; METHODOLOGICAL ALTERNATIVES; STATISTICAL-ANALYSIS; MISSING VALUES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing value in traffic accident data prevents the discovery of the significant factors to reduce accident severity and even lead to an invalid conclusion. In previous studies, to handle this problem, researchers mainly tried to improve the methodologies to fit the incomplete data. In this paper, we propose a missing value imputation method. It can impute missing values in the traffic accident data set. The method is called multiple imputation by chained equations (MICE) which is flexible and practical. It can not only cope with univariate missing values but also multivariate missing values. The proposed algorithm is compared with two traditional imputation methods using two publicly available traffic accident datasets from New York. Furthermore, we test the performance of the model with different missing ratios. The imputations for continuous variables and discrete variables are analyzed separately. The results indicate that our proposed model outperforms the other two models under almost all situations.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Multiple imputation for analysis of incomplete data in distributed health data networks
    Chang, Changgee
    Deng, Yi
    Jiang, Xiaoqian
    Long, Qi
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [42] Using Multivariate Imputation by Chained Equations to Predict Redshifts of Active Galactic Nuclei
    Gibson, Spencer James
    Narendra, Aditya
    Dainotti, Maria Giovanna
    Bogdan, Malgorzata
    Pollo, Agnieszka
    Poliszczuk, Artem
    Rinaldi, Enrico
    Liodakis, Ioannis
    FRONTIERS IN ASTRONOMY AND SPACE SCIENCES, 2022, 9
  • [43] Multiple Imputation by Generative Adversarial Networks for Classification with Incomplete Data
    Bao Ngoc Vi
    Dinh Tan Nguyen
    Cao Truong Tran
    Huu Phuc Ngo
    Chi Cong Nguyen
    Hai-Hong Phan
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 162 - 167
  • [44] A comparison of multiple imputation methods for incomplete longitudinal binary data
    Yamaguchi, Yusuke
    Misumi, Toshihiro
    Maruo, Kazushi
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2018, 28 (04) : 645 - 667
  • [45] Multiple imputation of incomplete zero-inflated count data
    Kleinke, Kristian
    Reinecke, Jost
    STATISTICA NEERLANDICA, 2013, 67 (03) : 311 - 336
  • [46] Multivariate imputation via chained equations for elastic well log imputation and prediction
    Hallam, Antony
    Mukherjee, Debajoy
    Chassagne, Romain
    APPLIED COMPUTING AND GEOSCIENCES, 2022, 14
  • [47] Handling Incomplete Data Using Evolution of Imputation Methods
    Zawistowski, Pawel
    Grzenda, Maciej
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, 2009, 5495 : 22 - +
  • [48] Imputation Methods for Incomplete Data
    Umathe, Vaishali H.
    Chaudhary, Gauri
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [49] Cost-effectiveness in clinical trials: using multiple imputation to deal with incomplete cost data
    Burton, Andrea
    Billingham, Lucinda Jane
    Bryan, Stirling
    CLINICAL TRIALS, 2007, 4 (02) : 154 - 161
  • [50] Difference Between Binomial Proportions Using Newcombe's Method With Multiple Imputation for Incomplete Data
    Sidi, Yulia
    Harel, Ofer
    AMERICAN STATISTICIAN, 2022, 76 (01): : 29 - 36