Sparse multi-output Gaussian processes for online medical time series prediction

被引:20
|
作者
Cheng, Li-Fang [1 ]
Dumitrascu, Bianca [2 ]
Darnell, Gregory [2 ]
Chivers, Corey [3 ]
Draugelis, Michael [3 ]
Li, Kai [4 ]
Engelhardt, Barbara E. [4 ,5 ]
机构
[1] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[2] Princeton Univ, Lewis Sigler Inst, Princeton, NJ 08544 USA
[3] Univ Penn Hlth Syst, Philadelphia, PA USA
[4] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[5] Princeton Univ, Ctr Stat & Machine Learning, Princeton, NJ 08544 USA
关键词
Gaussian processes; Electronic health records; Sparse time series; Spectral mixture kernel; MODELS;
D O I
10.1186/s12911-020-1069-4
中图分类号
R-058 [];
学科分类号
摘要
Background For real-time monitoring of hospital patients, high-quality inference of patients' health status using all information available from clinical covariates and lab test results is essential to enable successful medical interventions and improve patient outcomes. Developing a computational framework that can learn from observational large-scale electronic health records (EHRs) and make accurate real-time predictions is a critical step. In this work, we develop and explore a Bayesian nonparametric model based on multi-output Gaussian process (GP) regression for hospital patient monitoring. Methods We propose MedGP, a statistical framework that incorporates 24 clinical covariates and supports a rich reference data set from which relationships between observed covariates may be inferred and exploited for high-quality inference of patient state over time. To do this, we develop a highly structured sparse GP kernel to enable tractable computation over tens of thousands of time points while estimating correlations among clinical covariates, patients, and periodicity in patient observations. MedGP has a number of benefits over current methods, including (i) not requiring an alignment of the time series data, (ii) quantifying confidence regions in the predictions, (iii) exploiting a vast and rich database of patients, and (iv) inferring interpretable relationships among clinical covariates. Results We evaluate and compare results from MedGP on the task of online prediction for three patient subgroups from two medical data sets across 8,043 patients. We find MedGP improves online prediction over baseline and state-of-the-art methods for nearly all covariates across different disease subgroups and hospitals. Conclusions The MedGP framework is robust and efficient in estimating the temporal dependencies from sparse and irregularly sampled medical time series data for online prediction. The publicly available code is at.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Sparse multi-output Gaussian processes for online medical time series prediction
    Li-Fang Cheng
    Bianca Dumitrascu
    Gregory Darnell
    Corey Chivers
    Michael Draugelis
    Kai Li
    Barbara E Engelhardt
    [J]. BMC Medical Informatics and Decision Making, 20
  • [2] Online Sparse Multi-Output Gaussian Process Regression and Learning
    Yang, Le
    Wang, Ke
    Mihaylova, Lyudmila
    [J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2019, 5 (02): : 258 - 272
  • [3] GAP FILLING OF BIOPHYSICAL PARAMETER TIME SERIES WITH MULTI-OUTPUT GAUSSIAN PROCESSES
    Mateo-Sanchis, Anna
    Munoz-Mari, Jordi
    Campos-Taberner, Manuel
    Garcia-Haro, Javier
    Camps-Valls, Gustau
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4039 - 4042
  • [4] Collaborative Multi-output Gaussian Processes
    Nguyen, Trung V.
    Bonilla, Edwin V.
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 643 - 652
  • [5] Federated Multi-Output Gaussian Processes
    Chung, Seokhyun
    Al Kontar, Raed
    [J]. TECHNOMETRICS, 2024, 66 (01) : 90 - 103
  • [6] Urban Network Travel Time Prediction via Online Multi-Output Gaussian Process Regression
    Rodriguez-Deniz, Hector
    Jenelius, Erik
    Villani, Mattias
    [J]. 2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,
  • [7] Multi-output Infinite Horizon Gaussian Processes
    Lim, Jaehyun
    Park, Jehyun
    Nah, Sungjae
    Choi, Jongeun
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1542 - 1549
  • [8] Heterogeneous Multi-output Gaussian Process Prediction
    Moreno-Munoz, Pablo
    Artes-Rodriguez, Antonio
    Alvarez, Mauricio A.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [9] Scalable Exact Inference in Multi-Output Gaussian Processes
    Bruinsma, Wessel P.
    Perim, Eric
    Tebbutt, Will
    Hosking, J. Scott
    Solin, Arno
    Turner, Richard E.
    [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [10] Spectral Mixture Kernels for Multi-Output Gaussian Processes
    Parra, Gabriel
    Tobar, Felipe
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30