Long-term missing value imputation for time series data using deep neural networks

被引:9
|
作者
Park, Jangho [1 ]
Muller, Juliane [2 ]
Arora, Bhavna [3 ]
Faybishenko, Boris [3 ]
Pastorello, Gilberto [2 ]
Varadharajan, Charuleka [3 ]
Sahu, Reetik [4 ]
Agarwal, Deborah [2 ]
机构
[1] Teikametrics, Artificial Intelligence Team, 280 Summer St, Boston, MA 02210 USA
[2] Natl Renewable Energy Lab, Computat Sci Ctr, 15013 Denver W Pkwy, Golden, CO 80401 USA
[3] Lawrence Berkeley Natl Lab, Earth & Environm Sci Area, Calvin Rd, Berkeley, CA 94705 USA
[4] Int Inst Appl Syst Anal, Water Secur Res Grp, Schlosspl 1, A-2361 Laxenburg, Austria
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 12期
关键词
Missing value imputation; Environmental data; Machine learning; Hyperparameter optimization; Derivative-free optimization; Surrogate models; FLUXES; ALGORITHMS; REGRESSION;
D O I
10.1007/s00521-022-08165-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach that uses a deep learning model, in particular, a MultiLayer Perceptron, for estimating the missing values of a variable in multivariate time series data. We focus on filling a long continuous gap (e.g., multiple months of missing daily observations) rather than on individual randomly missing observations. Our proposed gap filling algorithm uses an automated method for determining the optimal MLP model architecture, thus allowing for optimal prediction performance for the given time series. We tested our approach by filling gaps of various lengths (three months to three years) in three environmental datasets with different time series characteristics, namely daily groundwater levels, daily soil moisture, and hourly Net Ecosystem Exchange. We compared the accuracy of the gap-filled values obtained with our approach to the widely used R-based time series gap filling methods ImputeTS and mtsdi. The results indicate that using an MLP for filling a large gap leads to better results, especially when the data behave nonlinearly. Thus, our approach enables the use of datasets that have a large gap in one variable, which is common in many long-term environmental monitoring observations.
引用
收藏
页码:9071 / 9091
页数:21
相关论文
共 50 条
  • [1] Long-term missing value imputation for time series data using deep neural networks
    Jangho Park
    Juliane Müller
    Bhavna Arora
    Boris Faybishenko
    Gilberto Pastorello
    Charuleka Varadharajan
    Reetik Sahu
    Deborah Agarwal
    [J]. Neural Computing and Applications, 2023, 35 : 9071 - 9091
  • [2] Missing Value Imputation of Time-Series Air-Quality Data via Deep Neural Networks
    Kim, Taesung
    Kim, Jinhee
    Yang, Wonho
    Lee, Hunjoo
    Choo, Jaegul
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (22)
  • [3] Imputation of Missing Data in Time Series for Air Pollutants Using Long Short-Term Memory Recurrent Neural Networks
    Yuan, Hongwu
    Xu, Guoming
    Yao, Zijian
    Jia, Ji
    Zhang, Yiwen
    [J]. PROCEEDINGS OF THE 2018 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC'18 ADJUNCT), 2018, : 1293 - 1300
  • [4] Multivariate time series imputation for energy data using neural networks
    Bulte, Christopher
    Kleinebrahm, Max
    Yilmaz, Hasan Umitcan
    Gomez-Romero, Juan
    [J]. ENERGY AND AI, 2023, 13
  • [5] Imputation of Missing Value Using Dynamic Bayesian Network for Multivariate Time Series Data
    Susanti, Steffi Pauli
    Azizah, Fazat Nur
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2017,
  • [6] Optimization of missing value imputation for neural networks
    Han, Jongmin
    Kang, Seokho
    [J]. INFORMATION SCIENCES, 2023, 649
  • [7] Missing Value Imputation on Multidimensional Time Series
    Bansal, Parikshit
    Deshpande, Prathamesh
    Sarawagi, Sunita
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11): : 2533 - 2545
  • [8] Imputation of missing data with neural networks for classification
    Choudhury, Suyra Jyoti
    Pal, Nikhil R.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 182
  • [9] Missing value imputation in time series using Singular Spectrum Analysis
    Mahmoudvand, Rahim
    Rodrigues, Paulo Canas
    [J]. INTERNATIONAL JOURNAL OF ENERGY AND STATISTICS, 2016, 4 (01)
  • [10] Missing Precipitation Data Estimation Using Long Short-Term Memory Deep Neural Networks
    Djerbouai, Salim
    [J]. JOURNAL OF ECOLOGICAL ENGINEERING, 2022, 23 (05): : 216 - 225