A functional data approach to missing value imputation and outlier detection for traffic flow data

被引:63
|
作者
Chiou, Jeng-Min [1 ]
Zhang, Yi-Chen [1 ]
Chen, Wan-Hui [2 ]
Chang, Chiung-Wen [3 ]
机构
[1] Acad Sinica, Inst Stat Sci, Taipei 11529, Taiwan
[2] Tamkang Univ, Dept Transportat Management, New Taipei City, Taiwan
[3] Minist Transportat & Commun, Inst Transportat, Taipei, Taiwan
关键词
functional data; functional principal component analysis; intelligent transportation system; traffic flow rate; vehicle detector; RATES; CLASSIFICATION; MORTALITY;
D O I
10.1080/21680566.2014.892847
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Missing values and outliers are frequently encountered in traffic monitoring data. We approach these problems by sampling the daily traffic flow rate trajectories from random functions and taking advantage of the data features using functional data analysis. We propose to impute missing values by using the conditional expectation approach to functional principal component analysis (FPCA). Our simulation study shows that the FPCA approach performs better than two commonly discussed methods in the literature, the probabilistic principal component analysis (PCA) and the Bayesian PCA, which have been shown to perform better than many conventional approaches. Based on the FPCA approach, the functional principal component scores can be applied to the functional bagplot and functional highest density region boxplot, which makes outlier detection possible for incomplete functional data. Our numerical results indicate that these two outlier detection approaches coupled with the proposed missing value imputation method can perform reasonably well. Although motivated by traffic flow data application, the proposed functional data methods for missing value imputation and outlier detection can be used in many applications with longitudinally recorded functional data.
引用
收藏
页码:106 / 129
页数:24
相关论文
共 50 条
  • [1] Missing value imputation and outlier detection for functional data: an application for PM10 data
    Melendez, Rafael
    Bolivar, Stevenson
    Rojano, Roberto
    [J]. UIS INGENIERIAS, 2020, 19 (02): : 1 - 10
  • [2] A FUNCTIONAL DATA APPROACH TO OUTLIER DETECTION AND IMPUTATION FOR TRAFFIC DENSITY DATA ON URBAN ARTERIAL ROADS
    Tang, Bin
    Hu, Yao
    Chen, Huan
    [J]. PROMET-TRAFFIC & TRANSPORTATION, 2022, 34 (05): : 755 - 765
  • [3] A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
    Kumar, Nishith
    Hoque, Md Aminul
    Shahjaman, Md
    Islam, S. M. Shahinul
    Mollah, Md Nurul Haque
    [J]. CURRENT BIOINFORMATICS, 2019, 14 (01) : 43 - 52
  • [4] Functional clustering and missing value imputation of traffic flow trajectories
    Li, Pai-Ling
    Chiou, Jeng-Min
    [J]. TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2021, 9 (01) : 1 - 21
  • [5] Missing value imputation method for heterogeneous traffic flow data based on feature fusion
    基于特征级融合的高速公路异质交通流数据修复方法
    [J]. Zhang, Jian (jianzhang@seu.edu.cn), 2018, Southeast University (48):
  • [6] Missing value imputation for the analysis of incomplete traffic accident data
    Deb, Rupam
    Liew, Alan Wee -Chung
    [J]. INFORMATION SCIENCES, 2016, 339 : 274 - 289
  • [7] Imputation of Missing Traffic Flow Data Using Denoising Autoencoders
    Jiang, Boyuan
    Siddiqi, Muhammad Danial
    Asadi, Reza
    Regan, Amelia
    [J]. 12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 84 - 91
  • [8] PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach
    Qu, Li
    Li, Li
    Zhang, Yi
    Hu, Jianming
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2009, 10 (03) : 512 - 522
  • [9] A spatiotemporal approach for traffic data imputation with complicated missing patterns
    Li, Huiping
    Li, Meng
    Lin, Xi
    He, Fang
    Wang, Yinhai
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 119
  • [10] RadarTSR: A new algorithm for cellwise and rowwise outlier detection and missing data imputation
    Gonzalez-Cebrian, Alba
    Folch-Fortuny, Abel
    Arteaga, Francisco
    Ferrer, Alberto
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2024, 247