Imputing Missing Data in Hourly Traffic Counts

被引:5
|
作者
Shafique, Muhammad Awais [1 ,2 ,3 ]
机构
[1] Univ Politecn Catalunya BarcelonaTech UPC, Ctr Int Metodes Numer Engn CIMNE, Barcelona 08034, Spain
[2] Univ Politecn Catalunya BarcelonaTech UPC, Ctr Innovat Transport CENIT, Barcelona 08034, Spain
[3] Univ Cent Punjab, Dept Civil Engn, Lahore 54590, Pakistan
关键词
AADT; ATR; daily volumes; imputation; missForest; missing data; IMPUTATION; MODELS;
D O I
10.3390/s22249876
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Hourly traffic volumes, collected by automatic traffic recorders (ATRs), are of paramount importance since they are used to calculate average annual daily traffic (AADT) and design hourly volume (DHV). Hence, it is necessary to ensure the quality of the collected data. Unfortunately, ATRs malfunction occasionally, resulting in missing data, as well as unreliable counts. This naturally has an impact on the accuracy of the key parameters derived from the hourly counts. This study aims to solve this problem. ATR data from New South Wales, Australia was screened for irregularities and invalid entries. A total of 25% of the reliable data was randomly selected to test thirteen different imputation methods. Two scenarios for data omission, i.e., 25% and 100%, were analyzed. Results indicated that missForest outperformed other imputation methods; hence, it was used to impute the actual missing data to complete the dataset. AADT values were calculated from both original counts before imputation and completed counts after imputation. AADT values from imputed data were slightly higher. The average daily volumes when plotted validated the quality of imputed data, as the annual trends demonstrated a relatively better fit.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Temporal Belief Memory: Imputing Missing Data during RNN Training
    Kim, Yeo Jin
    Chi, Min
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2326 - 2332
  • [42] Improving Disparity Research by Imputing Missing Data in Health Care Records
    Rhodes, William
    [J]. HEALTH SERVICES RESEARCH, 2015, 50 (04) : 939 - 945
  • [43] Imputing cross-sectional missing data: comparison of common techniques
    Hawthorne, G
    Elliott, P
    [J]. AUSTRALIAN AND NEW ZEALAND JOURNAL OF PSYCHIATRY, 2005, 39 (07): : 583 - 590
  • [44] THE ART OF IMPUTING MISSING DATA OF DISEASE AND FUNCTION ACTIVITY IN RHEUMATOID ARTHRITISREGISTRIES
    Mongin, Denis
    Lauper, Kim
    Turesson, Carl
    Hetland, Merete L.
    Kristianslund, Eirik
    Kvien, Tore K.
    Santos, Maria Jose
    Pavelka, Karel
    Iannone, Florenzo
    Finckh, Axel
    Courvoisier, Delphine
    [J]. ANNALS OF THE RHEUMATIC DISEASES, 2019, 78 : 1386 - 1386
  • [45] An alternative procedure for imputing missing data based on principal components analysis
    Di Franco, Giovanni
    [J]. QUALITY & QUANTITY, 2014, 48 (03) : 1149 - 1163
  • [46] Non-linear regression models for imputing longitudinal missing data
    Samad, Manar D.
    Yin, Linlin
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2019, : 559 - 561
  • [47] An alternative procedure for imputing missing data based on principal components analysis
    Giovanni Di Franco
    [J]. Quality & Quantity, 2014, 48 : 1149 - 1163
  • [48] A Multiple Imputation Model for Imputing Missing Physiologic Data in the National Trauma Data Bank
    Moore, Lynne
    Hanley, James A.
    Turgeon, Alexis F.
    Lavoie, Andre
    Emond, Marcel
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2009, 209 (05) : 572 - 579
  • [49] Genetically designed models for accurate imputation of missing traffic counts
    Zhong, M
    Sharma, S
    Lingras, P
    [J]. INFORMATION SYSTEMS AND TECHNOLOGY, 2004, (1879): : 71 - 79
  • [50] A Method for Imputing Response Options for Missing Data on Multiple-Choice Assessments
    Wolkowitz, Amanda A.
    Skorupski, William P.
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2013, 73 (06) : 1036 - 1053