Missing Traffic Data Imputation with a Linear Generative Model Based on Probabilistic Principal Component Analysis

被引:2
|
作者
Huang, Liping [1 ]
Li, Zhenghuan [1 ]
Luo, Ruikang [1 ]
Su, Rong [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
missing data; urban traffic sensing; probabilistic; principal component analysis; PREDICTION;
D O I
10.3390/s23010204
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Even with the ubiquitous sensing data in intelligent transportation systems, such as the mobile sensing of vehicle trajectories, traffic estimation is still faced with the data missing problem due to the detector faults or limited number of probe vehicles as mobile sensors. Such data missing issue poses an obstacle for many further explorations, e.g., the link-based traffic status modeling. Although many studies have focused on tackling this kind of problem, existing studies mainly focus on the situation in which data are missing at random and ignore the distinction between links of missing data. In the practical scenario, traffic speed data are always missing not at random (MNAR). The distinction for recovering missing data on different links has not been studied yet. In this paper, we propose a general linear model based on probabilistic principal component analysis (PPCA) for solving MNAR traffic speed data imputation. Furthermore, we propose a metric, i.e., Pearson score (p-score), for distinguishing links and investigate how the model performs on links with different p-score values. Experimental results show that the new model outperforms the typically used PPCA model, and missing data on links with higher p-score values can be better recovered.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Symbolic Missing Data Imputation in Principal Component Analysis
    Zuccolotto, Paola
    [J]. Statistical Analysis and Data Mining, 2011, 4 (02): : 171 - 183
  • [2] Enhanced Application of Principal Component Analysis in Machine Learning for Imputation of Missing Traffic Data
    Choi, Yoon-Young
    Shon, Heeseung
    Byon, Young-Ji
    Kim, Dong-Kyu
    Kang, Seungmo
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (10):
  • [3] Missing Data Imputation Algorithm for Transmission Systems Based on Multivariate Imputation With Principal Component Analysis
    Sim, Yeon-Sub
    Hwang, Jae-Sang
    Mun, Sung-Duk
    Kim, Tae-Joon
    Chang, Seung Jin
    [J]. IEEE ACCESS, 2022, 10 : 83195 - 83203
  • [4] Probabilistic principal component analysis-based anomaly detection for structures with missing data
    Ma, Zhi
    Yun, Chung-Bang
    Wan, Hua-Ping
    Shen, Yanbin
    Yu, Feng
    Luo, Yaozhi
    [J]. STRUCTURAL CONTROL & HEALTH MONITORING, 2021, 28 (05):
  • [5] Deep Generative Imputation Model for Missing Not At Random Data
    Chen, Jialei
    Xu, Yuanbo
    Wang, Pengyang
    Yang, Yongjian
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 316 - 325
  • [6] Solving the Missing Data Problem in Urban Traffic Estimation with Principal Component Analysis
    Yang, Qiangrong
    Hu, Jianyao
    Peng, Qi
    [J]. BDIOT 2018: PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS, 2018, : 23 - 28
  • [7] Modified GAN Model for Traffic Missing Data Imputation
    Li, Huiping
    Wang, Yinhai
    Li, Meng
    [J]. CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 3013 - 3023
  • [8] IMPUTATION OF MISSING DATA USING BAYESIAN PRINCIPAL COMPONENT ANALYSIS ON TEC IONOSPHERIC SATELLITE DATASET
    Subashini, P.
    Krishnaveni, M.
    [J]. 2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, : 1540 - 1543
  • [9] Incremental expectation maximization principal component analysis for missing value imputation for coevolving EEG data
    Sun Hee KIM
    Hyung Jeong YANG
    Kam Swee NG
    [J]. Journal of Zhejiang University-Science C(Computers & Electronics)., 2011, 12 (08) - 697
  • [10] Incremental expectation maximization principal component analysis for missing value imputation for coevolving EEG data
    Sun Hee KIM
    Hyung Jeong YANG
    Kam Swee NG
    [J]. Frontiers of Information Technology & Electronic Engineering, 2011, 12 (08) : 687 - 697