Estimating Missing Data in Temporal Data Streams Using Multi-Directional Recurrent Neural Networks

被引:154
|
作者
Yoon, Jinsung [1 ]
Zame, William R. [2 ]
van der Schaar, Mihaela [3 ,4 ]
机构
[1] Univ Calif Los Angeles, Dept Elect & Comp Engn, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Econ & Math, Los Angeles, CA USA
[3] Univ Oxford, Dept Engn Sci, Oxford, England
[4] Alan Turing Inst, London, England
基金
美国国家科学基金会;
关键词
Missing data; temporal data streams; imputation; recurrent neural nets; MULTIPLE-IMPUTATION;
D O I
10.1109/TBME.2018.2874712
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Missing data is a ubiquitous problem. It is especially challenging in medical settings because many streams of measurements are collected at different-and often irregular-times. Accurate estimation of the missing measurements is critical for many reasons, including diagnosis, prognosis, and treatment. Existing methods address this estimation problem by interpolating within data streams or imputing across data streams (both of which ignore important information) or ignoring the temporal aspect of the data and imposing strong assumptions about the nature of the data-generating process and/or the pattern of missing data (both of which are especially problematic for medical data). We propose a new approach, based on a novel deep learning architecture that we call a Multi-directional Recurrent Neural Network that interpolates within data streams and imputes across data streams. We demonstrate the power of our approach by applying it to five real-world medical datasets. We show that it provides dramatically improved estimation of missing measurements in comparison to 11 state-of-the-art benchmarks (including Spline and Cubic Interpolations, MICE, MissForest, matrix completion, and several RNN methods); typical improvements in Root Mean Squared Error are between 35%-50%. Additional experiments based on the same five datasets demonstrate that the improvements provided by our method are extremely robust.
引用
收藏
页码:1477 / 1490
页数:14
相关论文
共 50 条
  • [21] Inference of Missing PV Monitoring Data using Neural Networks
    Koubli, Eleni
    Palmer, Diane
    Betts, Tom
    Rowley, Paul
    Gottschalg, Ralph
    2016 IEEE 43RD PHOTOVOLTAIC SPECIALISTS CONFERENCE (PVSC), 2016, : 3436 - 3440
  • [22] Continual learning with attentive recurrent neural networks for temporal data classification
    Yin, Shao-Yu
    Huang, Yu
    Chang, Tien-Yu
    Chang, Shih-Fang
    Tseng, Vincent S.
    NEURAL NETWORKS, 2023, 158 : 171 - 187
  • [23] Infilling Missing Daily Evapotranspiration Data Using Neural Networks
    Abudu, Shalamu
    Bawazir, A. Salim
    King, J. Phillip
    JOURNAL OF IRRIGATION AND DRAINAGE ENGINEERING, 2010, 136 (05) : 317 - 325
  • [24] Treatment of missing data using neural networks and genetic algorithms
    Abdella, M
    Marwala, T
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 598 - 603
  • [25] Estimation of Missing Data of Showcase Using Artificial Neural Networks
    Sakurai, Daiji
    Fukuyama, Yoshikazu
    Santana, Adamo
    Kawamura, Yu
    Murakami, Kenya
    Iizaka, Tatsuya
    Matsui, Tetsuro
    2017 IEEE 10TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (IWCIA), 2017, : 15 - 18
  • [26] Using artificial neural networks to estimate missing rainfall data
    Kuligowski, RJ
    Barros, AP
    JOURNAL OF THE AMERICAN WATER RESOURCES ASSOCIATION, 1998, 34 (06): : 1437 - 1447
  • [27] Configurable Multi-directional Systolic Array Architecture for Convolutional Neural Networks
    Xu, Rui
    Ma, Sheng
    Wang, Yaohua
    Chen, Xinhai
    Guo, Yang
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (04)
  • [28] Imputing Missing Data In Large-Scale Multivariate Biomedical Wearable Recordings Using Bidirectional Recurrent Neural Networks With Temporal Activation Regularization
    Feng, Tiantian
    Narayanan, Shrikanth
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 2529 - 2534
  • [29] Multi-directional temporal convolutional artificial neural network for PM2.5 forecasting with missing values: A deep learning approach
    Samal, K. Krishna Rani
    Babu, Korra Sathya
    Das, Santos Kumar
    URBAN CLIMATE, 2021, 36
  • [30] Spatio-Temporal Dynamics of Intrinsic Networks in Functional Magnetic Imaging Data Using Recurrent Neural Networks
    Hjelm, R. Devon
    Damaraju, Eswar
    Cho, Kyunghyun
    Laufs, Helmut
    Plis, Sergey M.
    Calhoun, Vince D.
    FRONTIERS IN NEUROSCIENCE, 2018, 12