Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups

被引:3
|
作者
Merkelbach, Kilian [1 ]
Schaper, Steffen [2 ]
Diedrich, Christian [2 ]
Fritsch, Sebastian Johannes [3 ,4 ]
Schuppert, Andreas [1 ]
机构
[1] Rhein Westfal TH Aachen, JRC COMBINE, MTZ, Pauwelsstr 19,Level 3, D-52074 Aachen, Germany
[2] Bayer AG Pharmaceut, Pharmacometr Modeling & Simulat, Leverkusen, Germany
[3] Univ Hosp RWTH Aachen, Dept Intens Care Med, Pauwelsstr 30, D-52074 Aachen, Germany
[4] Forschungszentrum Julich, Juelich Supercomp Ctr, Wilhelm Johnen Str, D-52428 Julich, Germany
关键词
D O I
10.1038/s41598-023-30986-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Electronic health records (EHRs) are used in hospitals to store diagnoses, clinician notes, examinations, lab results, and interventions for each patient. Grouping patients into distinct subsets, for example, via clustering, may enable the discovery of unknown disease patterns or comorbidities, which could eventually lead to better treatment through personalized medicine. Patient data derived from EHRs is heterogeneous and temporally irregular. Therefore, traditional machine learning methods like PCA are ill-suited for analysis of EHR-derived patient data. We propose to address these issues with a new methodology based on training a gated recurrent unit (GRU) autoencoder directly on health record data. Our method learns a low-dimensional feature space by training on patient data time series, where the time of each data point is expressed explicitly. We use positional encodings for time, allowing our model to better handle the temporal irregularity of the data. We apply our method to data from the Medical Information Mart for Intensive Care (MIMIC-III). Using our data-derived feature space, we can cluster patients into groups representing major classes of disease patterns. Additionally, we show that our feature space exhibits a rich substructure at multiple scales.
引用
收藏
页数:22
相关论文
共 5 条
  • [1] Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups
    Kilian Merkelbach
    Steffen Schaper
    Christian Diedrich
    Sebastian Johannes Fritsch
    Andreas Schuppert
    Scientific Reports, 13
  • [2] Dynamically Modeling Patient's Health State from Electronic Medical Records: A Time Series Approach
    Caballero, Karla
    Akella, Ram
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 69 - 78
  • [3] Learning and diSentangling patient static information from time-series Electronic hEalth Records (STEER)
    Liao, Wei
    Voldman, Joel
    PLOS DIGITAL HEALTH, 2024, 3 (10):
  • [4] Deep learning on time series laboratory test results from electronic health records for early detection of pancreatic cancer
    Park, Jiheum
    Artin, Michael G.
    Lee, Kate E.
    Pumpalova, Yoanna S.
    Ingram, Myles A.
    May, Benjamin L.
    Park, Michael
    Hur, Chin
    Tatonetti, Nicholas P.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 131
  • [5] Deep learning on time series laboratory test results from electronic health records for early detection of pancreatic cancer.
    Park, Jiheum
    Artin, Michael
    Lee, Kate E.
    Pumpalova, Yoanna S.
    Ingram, Myles
    May, Benjamin
    Park, Michael
    Hur, Chin
    Tatonetti, Nicholas
    JOURNAL OF CLINICAL ONCOLOGY, 2022, 40 (16) : E16268 - E16268