Estimating Missing Data in Temporal Data Streams Using Multi-Directional Recurrent Neural Networks

被引:154
|
作者
Yoon, Jinsung [1 ]
Zame, William R. [2 ]
van der Schaar, Mihaela [3 ,4 ]
机构
[1] Univ Calif Los Angeles, Dept Elect & Comp Engn, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Econ & Math, Los Angeles, CA USA
[3] Univ Oxford, Dept Engn Sci, Oxford, England
[4] Alan Turing Inst, London, England
基金
美国国家科学基金会;
关键词
Missing data; temporal data streams; imputation; recurrent neural nets; MULTIPLE-IMPUTATION;
D O I
10.1109/TBME.2018.2874712
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Missing data is a ubiquitous problem. It is especially challenging in medical settings because many streams of measurements are collected at different-and often irregular-times. Accurate estimation of the missing measurements is critical for many reasons, including diagnosis, prognosis, and treatment. Existing methods address this estimation problem by interpolating within data streams or imputing across data streams (both of which ignore important information) or ignoring the temporal aspect of the data and imposing strong assumptions about the nature of the data-generating process and/or the pattern of missing data (both of which are especially problematic for medical data). We propose a new approach, based on a novel deep learning architecture that we call a Multi-directional Recurrent Neural Network that interpolates within data streams and imputes across data streams. We demonstrate the power of our approach by applying it to five real-world medical datasets. We show that it provides dramatically improved estimation of missing measurements in comparison to 11 state-of-the-art benchmarks (including Spline and Cubic Interpolations, MICE, MissForest, matrix completion, and several RNN methods); typical improvements in Root Mean Squared Error are between 35%-50%. Additional experiments based on the same five datasets demonstrate that the improvements provided by our method are extremely robust.
引用
收藏
页码:1477 / 1490
页数:14
相关论文
共 50 条
  • [41] Product failure prediction with missing data using graph neural networks
    Kang, Seokho
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (12): : 7225 - 7234
  • [42] Missing data interpolation by using local-global neural networks
    Fariñas, MS
    Pedreira, CE
    ENGINEERING INTELLIGENT SYSTEMS FOR ELECTRICAL ENGINEERING AND COMMUNICATIONS, 2002, 10 (02): : 85 - 91
  • [43] Missing Pavement Performance Data Imputation Using Graph Neural Networks
    Gao, Lu
    Yu, Ke
    Lu, Pan
    TRANSPORTATION RESEARCH RECORD, 2022, 2676 (12) : 409 - 419
  • [44] Product failure prediction with missing data using graph neural networks
    Seokho Kang
    Neural Computing and Applications, 2021, 33 : 7225 - 7234
  • [45] Global aerosol distribution based on multi-directional data given by POLDER
    Sano, I
    Mukai, S
    Okada, Y
    POLARIZATION: MEASUREMENT, ANALYSIS, AND REMOTE SENSING II, 1999, 3754 : 392 - 398
  • [46] Estimating tie strength in social networks using temporal communication data
    Urena-Carrion, Javier
    Saramaki, Jari
    Kivela, Mikko
    EPJ DATA SCIENCE, 2020, 9 (01)
  • [47] Estimating tie strength in social networks using temporal communication data
    Javier Ureña-Carrion
    Jari Saramäki
    Mikko Kivelä
    EPJ Data Science, 9
  • [48] Meta-Sketch: A Neural Data Structure for Estimating Item Frequencies of Data Streams
    Cao, Yukun
    Feng, Yuan
    Xie, Xike
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6916 - +
  • [49] Estimating Lossy Compressibility of Scientific Data Using Deep Neural Networks
    Qin, Zhenlu
    Wang, Jinzhen
    Liu, Qing
    Chen, Jieyang
    Pugmire, Dave
    Podhorszki, Norbert
    Klasky, Scott
    IEEE Letters of the Computer Society, 2020, 3 (01): : 5 - 8
  • [50] Sampled-data Synchronization of Recurrent Neural Networks with Multi-GPUs
    Jin, Yongsik
    Han, Seungyong
    Park, Jongcheon
    Lee, S. M.
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2172 - 2177