Unsupervised probabilistic models for sequential Electronic Health Records

被引:0
|
作者
Kaplan, Alan D. [1 ]
Greene, John D. [2 ]
Liu, Vincent X. [2 ]
Ray, Priyadip [1 ]
机构
[1] Lawrence Livermore Natl Lab, Computat Engn Div, 7000 East Ave, Livermore, CA 94550 USA
[2] Kasler Permanente Div Res, 2000 Broadway, Oakland, CA 94612 USA
关键词
Unsupervised learning; EHR data; Mixture modeling; Subgroup analysis; MIXTURE; SEPSIS; HMM;
D O I
10.1016/j.jbi.2022.104163
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We develop an unsupervised probabilistic model for heterogeneous Electronic Health Record (EHR) data. Utilizing a mixture model formulation, our approach directly models sequences of arbitrary length, such as medications and laboratory results. This allows for subgrouping and incorporation of the dynamics underlying heterogeneous data types. The model consists of a layered set of latent variables that encode underlying structure in the data. These variables represent subject subgroups at the top layer, and unobserved states for sequences in the second layer. We train this model on episodic data from subjects receiving medical care in the Kaiser Permanente Northern California integrated healthcare delivery system. The resulting properties of the trained model generate novel insight from these complex and multifaceted data. In addition, we show how the model can be used to analyze sequences that contribute to assessment of mortality likelihood.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Continuous-time probabilistic models for longitudinal electronic health records
    Kaplan, Alan D.
    Tipnis, Uttara
    Beckham, Jean C.
    Kimbrel, Nathan A.
    Oslin, David W.
    McMahon, Benjamin H.
    MVP Suicide Exemplar Workgroup, M. V. P. Suicide Exemplar Workgroup
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 130
  • [2] The application of unsupervised deep learning in predictive models using electronic health records
    Lei Wang
    Liping Tong
    Darcy Davis
    Tim Arnold
    Tina Esposito
    [J]. BMC Medical Research Methodology, 20
  • [3] The application of unsupervised deep learning in predictive models using electronic health records
    Wang, Lei
    Tong, Liping
    Davis, Darcy
    Arnold, Tim
    Esposito, Tina
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2020, 20 (01)
  • [4] Probabilistic modeling personalized treatment pathways using electronic health records
    Huang, Zhengxing
    Ge, Zhenxiao
    Dong, Wei
    He, Kunlun
    Duan, Huilong
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 86 : 33 - 48
  • [5] Scalable and Interpretable Predictive Models for Electronic Health Records
    Fejza, Amela
    Geneves, Pierre
    Layaida, Nabil
    Bosson, Jean-Luc
    [J]. 2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 341 - 350
  • [6] Unsupervised Learning to Subphenotype Heart Failure Patients from Electronic Health Records
    Hackl, Melanie
    Datta, Suparno
    Miotto, Riccardo
    Bottinger, Erwin
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2021), 2021, : 219 - 228
  • [7] Generating sequential electronic health records using dual adversarial autoencoder
    Lee, Dongha
    Yu, Hwanjo
    Jiang, Xiaoqian
    Rogith, Deevakar
    Gudala, Meghana
    Tejani, Mubeen
    Zhang, Qiuchen
    Xiong, Li
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (09) : 1411 - 1419
  • [8] SEQUENTIAL MODELS IN PROBABILISTIC DEPRECIATION
    IJIRI, Y
    KAPLAN, RS
    [J]. JOURNAL OF ACCOUNTING RESEARCH, 1970, 8 (01) : 34 - 46
  • [9] A probabilistic topic model for clinical risk stratification from electronic health records
    Huang, Zhengxing
    Dong, Wei
    Duan, Hilong
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 : 28 - 36
  • [10] An Unsupervised Learning Approach to Identify Immunoglobulin Utilization Patterns Using Electronic Health Records
    Li, N.
    Riazi, K.
    Arnold, D.
    Sidhu, D.
    Barty, R.
    Heddle, N.
    Callum, J.
    Down, D.
    [J]. TRANSFUSION, 2023, 63 : 261A - 262A