Missing data imputation for traffic flow speed using spatio-temporal cokriging

被引:67
|
作者
Bae, Bumjoon [1 ]
Kim, Hyun [2 ]
Lim, Hyeonsup [3 ]
Liu, Yuandong [1 ]
Han, Lee D. [1 ]
Freeze, Phillip B. [4 ]
机构
[1] Univ Tennessee, Dept Civil & Environm Engn, 851 Neyland Dr, Knoxville, TN 37996 USA
[2] Univ Tennessee, Dept Geog, 1000 Philip Fulmer Way, Knoxville, TN 37916 USA
[3] Oak Ridge Natl Lab, Ctr Transportat Anal, 2360 Cherahala Blvd, Knoxville, TN 37932 USA
[4] Tennessee Dept Transportat, James K Polk Bldg,505 Deaderick St,Suite 30, Nashville, TN 37243 USA
关键词
Missing data; Imputation; Spatio-temporal kriging; Cokriging; Missing pattems; PREDICTION; TIME;
D O I
10.1016/j.trc.2018.01.015
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Modern transportation systems rely increasingly on the availability and accuracy of traffic detector data to monitor traffic operational conditions and assess system performance. Missing data, which occurs almost inevitably for a number of reasons, can lead to suboptimal operations and ineffective decisions if not remedied in a timely and systematic fashion through data imputation. A review of literature suggests that most traffic data imputation studies considered the temporal continuity of the data but often overlooked the spatial correlations that exist. Few of the studies explored the randomness of the patterns of the missing data. Therefore, this paper proposes two cokriging methods that exploit the existence of spatio-temporal dependency in traffic data and employ multiple data sources, each with independently missing data, to impute high resolution traffic speed data under different data missing pattern scenarios. The two proposed cokriging methods, both using multiple independent data sources, were benchmarked against classic simple and ordinary kriging methods, which use only the primary data source. An array of testing scenarios were designed to test these methods under different missing rates (10-40% data loss) and different missing patterns (random in time and location, random only in location, and non-random blocks of missing data). The results suggest that using multiple data sources with the spatio-temporal simple cokriging method effectively improves the imputation accuracy if the missing data were clustered, or in blocks. On the other hand, if the missing data were randomly scattered in time and location, the classic ordinary or simple kriging method using only the primary data source can be more effective. Our study, which employs empirical traffic speed data from radar detectors and vehicle probes, demonstrates that the overall predictions of the kriging-based imputation approach are accurate and reliable for all combinations of missing patterns and missing rates investigated.
引用
收藏
页码:124 / 139
页数:16
相关论文
共 50 条
  • [1] Sequential Imputation of Missing Spatio-Temporal Precipitation Data Using Random Forests
    Mital, Utkarsh
    Dwivedi, Dipankar
    Brown, James B.
    Faybishenko, Boris
    Painter, Scott L.
    Steefel, Carl I.
    [J]. FRONTIERS IN WATER, 2020, 2
  • [2] Spatio-Temporal Graph Convolutional Network for Stochastic Traffic Speed Imputation
    Cuza, Carlos Enrique Muniz
    Ho, Nguyen
    Zacharatou, Eleni Tzirita
    Pedersen, Torben Bach
    Yang, Bin
    [J]. 30TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2022, 2022, : 105 - 116
  • [4] Learning Traffic as Videos: A Spatio-Temporal VAE Approach for Traffic Data Imputation
    Chen, Jiayuan
    Zhang, Shuo
    Chen, Xiaofei
    Jiang, Qiao
    Huang, Hejiao
    Gu, Chonglin
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 615 - 627
  • [5] Traffic Speed Imputation with Spatio-Temporal Attentions and Cycle-Perceptual Traininge
    Xu, Qianxiong
    Ruan, Sijie
    Long, Cheng
    Yu, Liang
    Zhang, Chen
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2280 - 2289
  • [6] STGAN: Spatio-Temporal Generative Adversarial Network for Traffic Data Imputation
    Yuan, Ye
    Zhang, Yong
    Wang, Boyue
    Peng, Yuan
    Hu, Yongli
    Yin, Baocai
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (01) : 200 - 211
  • [7] Enhancing Traffic Flow Prediction In The Presence Of Missing Data Through Spatio-Temporal Causal Graphs
    Cao, Ruihao
    Ma, Zhirou
    Liu, Jie
    [J]. Journal of Applied Science and Engineering, 2025, 28 (02): : 237 - 246
  • [8] Hierarchical spatio-temporal graph convolutional neural networks for traffic data imputation
    Xu, Dongwei
    Peng, Hang
    Tang, Yufu
    Guo, Haifeng
    [J]. INFORMATION FUSION, 2024, 106
  • [9] Spatio-Temporal Tensor Completion for Imputing Missing Internet Traffic Data
    Zhou, Huibin
    Zhang, Dafang
    Xie, Kun
    Chen, Yuxiang
    [J]. 2015 IEEE 34TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2015,
  • [10] Imputation of missing data from offshore wind farms using spatio-temporal correlation and feature correlation
    Sun, Chuan
    Chen, Yueyi
    Cheng, Cheng
    [J]. ENERGY, 2021, 229