Imputing Missing Data in Hourly Traffic Counts

被引:5
|
作者
Shafique, Muhammad Awais [1 ,2 ,3 ]
机构
[1] Univ Politecn Catalunya BarcelonaTech UPC, Ctr Int Metodes Numer Engn CIMNE, Barcelona 08034, Spain
[2] Univ Politecn Catalunya BarcelonaTech UPC, Ctr Innovat Transport CENIT, Barcelona 08034, Spain
[3] Univ Cent Punjab, Dept Civil Engn, Lahore 54590, Pakistan
关键词
AADT; ATR; daily volumes; imputation; missForest; missing data; IMPUTATION; MODELS;
D O I
10.3390/s22249876
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Hourly traffic volumes, collected by automatic traffic recorders (ATRs), are of paramount importance since they are used to calculate average annual daily traffic (AADT) and design hourly volume (DHV). Hence, it is necessary to ensure the quality of the collected data. Unfortunately, ATRs malfunction occasionally, resulting in missing data, as well as unreliable counts. This naturally has an impact on the accuracy of the key parameters derived from the hourly counts. This study aims to solve this problem. ATR data from New South Wales, Australia was screened for irregularities and invalid entries. A total of 25% of the reliable data was randomly selected to test thirteen different imputation methods. Two scenarios for data omission, i.e., 25% and 100%, were analyzed. Results indicated that missForest outperformed other imputation methods; hence, it was used to impute the actual missing data to complete the dataset. AADT values were calculated from both original counts before imputation and completed counts after imputation. AADT values from imputed data were slightly higher. The average daily volumes when plotted validated the quality of imputed data, as the annual trends demonstrated a relatively better fit.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Imputing missing data
    Croy, CD
    Novins, DK
    [J]. JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 2004, 43 (04): : 380 - 380
  • [2] Spatio-Temporal Tensor Completion for Imputing Missing Internet Traffic Data
    Zhou, Huibin
    Zhang, Dafang
    Xie, Kun
    Chen, Yuxiang
    [J]. 2015 IEEE 34TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2015,
  • [3] A BPCA Based Missing Value Imputing Method for Traffic Flow Volume Data
    Qu, Li
    Zhang, Yi
    Hu, Jianming
    Jia, Liyan
    Li, Li
    [J]. 2008 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2008, : 171 - 176
  • [4] Efficient missing data imputing for traffic flow by considering temporal and spatial dependence
    Li, Li
    Li, Yuebiao
    Li, Zhiheng
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2013, 34 : 108 - 120
  • [5] IMPUTING MISSING YIELD TRIAL DATA
    GAUCH, HG
    ZOBEL, RW
    [J]. THEORETICAL AND APPLIED GENETICS, 1990, 79 (06) : 753 - 761
  • [6] Matching hourly, daily, and monthly traffic patterns to estimate missing volume data
    Zhong, Ming
    Sharma, Satish
    [J]. NATIONAL, STATE, AND FREIGHT DATA ISSUES AND ASSET MANAGEMENT, 2006, (1957): : 32 - 42
  • [7] A method for imputing missing data in longitudinal studies
    Youk, AO
    Stone, RA
    Marsh, GM
    [J]. ANNALS OF EPIDEMIOLOGY, 2004, 14 (05) : 354 - 361
  • [8] Imputing missing genotypes: effects of methods and patterns of missing data
    Funda Ogut
    Fikret Isik
    Steven McKeand
    Ross Whetten
    [J]. BMC Proceedings, 5 (Suppl 7)
  • [9] Imputing missing values for genetic interaction data
    Wang, Yishu
    Wang, Lin
    Yang, Dejie
    Deng, Minghua
    [J]. METHODS, 2014, 67 (03) : 269 - 277
  • [10] THE PROBLEM OF IMPUTATION OF THE MISSING DATA FROM THE CONTINUOUS COUNTS OF ROAD TRAFFIC
    Splawinska, M.
    [J]. ARCHIVES OF CIVIL ENGINEERING, 2015, 61 (01) : 131 - 145