A bagging algorithm for the imputation of missing values in time series

被引:29
|
作者
Andiojaya, Agung [1 ,2 ]
Demirhan, Haydar [2 ]
机构
[1] Badan Pusat Stat, Jl Dr Sutomo 6-8, Jakarta 10710, Indonesia
[2] RMIT Univ, Sch Sci, Math Sci Discipline, 364 Swanston St, Melbourne, Vic 3000, Australia
关键词
Block bootstrap; Gap filling; Interpolation; Kalman filter; Stineman; Weighted moving average; QUANTILE PREDICTORS; CLASSIFICATION; BOOTSTRAP; BINARY;
D O I
10.1016/j.eswa.2019.03.044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classical time series analysis methods are not readily applicable to the series with missing observations. To deal with the missingness in time series, the common approach is to use imputation techniques to fill in the gaps and get a regularly spaced series. However, this approach has several drawbacks such as information and time bias, relationship causality, and not being suitable for the series with a high missingness rate. Instead of directly imputing the missing values, we propose a bagging algorithm to improve on the accuracy of imputation methods utilizing block bootstrap methods and marked point processes. We consider non-overlapping, moving, and circular block bootstrap methods along with amplitude modulated series and integer valued sequences. Imputation methods considered for bagging are Stineman and linear interpolations, Kalman filters, and weighted moving average. Imputation accuracy of the proposed algorithm is investigated by nearly 3000 yearly, quarterly, and monthly time series from different sectors under the "missing completely random" and "missing at random" missingness mechanisms. The results of the numerical study show that the proposed algorithm improved the accuracy of the considered imputation methods at most of the instances for different missingness rates and frequencies under both missingness mechanisms. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:10 / 26
页数:17
相关论文
共 50 条
  • [1] Imputation of Missing Values in Time Series with Lagged Correlations
    Rahman, Shah Atiqur
    Huang, Yuxiao
    Claassen, Jan
    Kleinberg, Samantha
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 753 - 762
  • [2] Recurrent Imputation for Multivariate Time Series with Missing Values
    Suo, Qiuling
    Yao, Liuyi
    Xun, Guangxu
    Sun, Jianhui
    Zhang, Aidong
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2019, : 562 - 564
  • [3] Selective Imputation for Multivariate Time Series Datasets With Missing Values
    Blazquez-Garcia, Anehd
    Wickstrom, Kristoffer
    Yu, Shujian
    Mikalsen, Karl Oyvind
    Boubekki, Ahcene
    Conde, Angel
    Mori, Usue
    Jenssen, Robert
    Lozano, Jose A.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 9490 - 9501
  • [4] On the Imputation of Missing Values in Univariate PM10 Time Series
    Albano, G.
    La Rocca, M.
    Perna, C.
    [J]. COMPUTER AIDED SYSTEMS THEORY - EUROCAST 2017, PT II, 2018, 10672 : 12 - 19
  • [5] Trend Tests in Time Series with Missing Values: a Case Study with Imputation
    Rosario Ramos, M.
    Cordeiro, Clara
    [J]. 11TH INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2013, PTS 1 AND 2 (ICNAAM 2013), 2013, 1558 : 1909 - 1912
  • [6] Missing values imputation for a clustering genetic algorithm
    Hruschka, ER
    Hruschka, ER
    Ebecken, NFF
    [J]. ADVANCES IN NATURAL COMPUTATION, PT 3, PROCEEDINGS, 2005, 3612 : 245 - 254
  • [7] SEASONAL TIME-SERIES IMPUTATION OF GAP MISSING ALGORITHM (STIGMA)
    Rangel-heras, Eduardo
    Zuniga, Pavel
    Alanis, Alma y.
    Hernadez-vargas, Esteban a.
    Sanchez, Oscar d.
    [J]. KYBERNETIKA, 2023, 59 (06) : 861 - 879
  • [8] Visually and Statistically Guided Imputation of Missing Values in Univariate Seasonal Time Series
    Boegl, M.
    Filzmoser, P.
    Gschwandtner, T.
    Miksch, S.
    Aigner, W.
    Rind, A.
    Lammarsch, T.
    [J]. 2015 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY, 2015, : 189 - 190
  • [9] Deep imputation of missing values in time series health data: A review with benchmarking
    Kazijevs, Maksims
    Samad, Manar D.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 144
  • [10] A novel imputation method for missing values in air pollutant time series data
    Pena, Mario
    Ortega, Patricia
    Orellana, Marcos
    [J]. 2019 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2019, : 99 - 104