PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach

被引:301
|
作者
Qu, Li [1 ]
Li, Li [1 ]
Zhang, Yi [1 ]
Hu, Jianming [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Missing data; probabilistic principal component analysis (PPCA); traffic flow volume; PRINCIPAL COMPONENT ANALYSIS; MAXIMUM-LIKELIHOOD;
D O I
10.1109/TITS.2009.2026312
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The missing data problem greatly affects traffic analysis. In this paper, we put forward a new reliable method called probabilistic principal component analysis (PPCA) to impute the missing flow volume data based on historical data mining. First, we review the current missing data-imputation method and why it may fail to yield acceptable results in many traffic flow applications. Second, we examine the statistical properties of traffic flow volume time series. We show that the fluctuations of traffic flow are Gaussian type and that principal component analysis (PCA) can be used to retrieve the features of traffic flow. Third, we discuss how to use a robust PCA to filter out the abnormal traffic flow data that disturb the imputation process. Finally, we recall the theories of PPCA/Bayesian PCA-based imputation algorithms and compare their performance with some conventional methods, including the nearest/mean historical imputation methods and the local interpolation/regression methods. The experiments prove that the PPCA method provides significantly better performance than the conventional methods, reducing the root-mean-square imputation error by at least 25%.
引用
收藏
页码:512 / 522
页数:11
相关论文
共 50 条
  • [1] Automatic Imputation of Missing Highway Traffic Volume Data
    Elshenawy, Mohamed
    El-darieby, Mohamed
    Abdulhai, Baher
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2018,
  • [2] A functional data approach to missing value imputation and outlier detection for traffic flow data
    Chiou, Jeng-Min
    Zhang, Yi-Chen
    Chen, Wan-Hui
    Chang, Chiung-Wen
    [J]. TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2014, 2 (02) : 106 - 129
  • [3] A Kriging based spatiotemporal approach for traffic volume data imputation
    Yang, Hongtai
    Yang, Jianjiang
    Han, Lee D.
    Liu, Xiaohan
    Pu, Li
    Chin, Shih-miao
    Hwang, Ho-ling
    [J]. PLOS ONE, 2018, 13 (04):
  • [5] Imputation of Missing Traffic Flow Data Using Denoising Autoencoders
    Jiang, Boyuan
    Siddiqi, Muhammad Danial
    Asadi, Reza
    Regan, Amelia
    [J]. 12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 84 - 91
  • [6] A spatiotemporal approach for traffic data imputation with complicated missing patterns
    Li, Huiping
    Li, Meng
    Lin, Xi
    He, Fang
    Wang, Yinhai
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 119
  • [7] Missing value imputation method for heterogeneous traffic flow data based on feature fusion
    基于特征级融合的高速公路异质交通流数据修复方法
    [J]. Zhang, Jian (jianzhang@seu.edu.cn), 2018, Southeast University (48):
  • [8] A Bayesian robust CP decomposition approach for missing traffic data imputation
    Yun Zhu
    Weiye Wang
    Gaohang Yu
    Jun Wang
    Lei Tang
    [J]. Multimedia Tools and Applications, 2022, 81 : 33171 - 33184
  • [9] A Bayesian robust CP decomposition approach for missing traffic data imputation
    Zhu, Yun
    Wang, Weiye
    Yu, Gaohang
    Wang, Jun
    Tang, Lei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33171 - 33184
  • [10] Data imputation approach for laneHscale traffic flow based on tensor decomposition theory
    Lu, Wen-Qi
    Zhou, Tian
    Gu, Yuan-Li
    Rui, Yi-Kang
    Ran, Bin
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2021, 51 (05): : 1708 - 1715