Learning and visualizing chronic latent representations using electronic health records

被引:3
|
作者
Chushig-Muzo, David [1 ]
Soguero-Ruiz, Cristina [1 ]
Bohoyo, Pablo de Miguel [2 ]
Mora-Jimenez, Inmaculada [1 ]
机构
[1] Rey Juan Carlos Univ, Dept Signal Theory & Commun Telemat & Comp Syst, Madrid, Spain
[2] Univ Hosp Fuenlabrada, Madrid, Spain
关键词
Denoising Autoencoder; Chronic diseases; Diabetes; Hypertension; Clustering; Patient representation; Synthetic patient; Health status trajectory; DIABETES-MELLITUS; RISK; CLASSIFICATION; DIMENSIONALITY; MULTIMORBIDITY; PATTERNS; DISEASE;
D O I
10.1186/s13040-022-00303-z
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Nowadays, patients with chronic diseases such as diabetes and hypertension have reached alarming numbers worldwide. These diseases increase the risk of developing acute complications and involve a substantial economic burden and demand for health resources. The widespread adoption of Electronic Health Records (EHRs) is opening great opportunities for supporting decision-making. Nevertheless, data extracted from EHRs are complex (heterogeneous, high-dimensional and usually noisy), hampering the knowledge extraction with conventional approaches. Methods We propose the use of the Denoising Autoencoder (DAE), a Machine Learning (ML) technique allowing to transform high-dimensional data into latent representations (LRs), thus addressing the main challenges with clinical data. We explore in this work how the combination of LRs with a visualization method can be used to map the patient data in a two-dimensional space, gaining knowledge about the distribution of patients with different chronic conditions. Furthermore, this representation can be also used to characterize the patient's health status evolution, which is of paramount importance in the clinical setting. Results To obtain clinical LRs, we considered real-world data extracted from EHRs linked to the University Hospital of Fuenlabrada in Spain. Experimental results showed the great potential of DAEs to identify patients with clinical patterns linked to hypertension, diabetes and multimorbidity. The procedure allowed us to find patients with the same main chronic disease but different clinical characteristics. Thus, we identified two kinds of diabetic patients with differences in their drug therapy (insulin and non-insulin dependant), and also a group of women affected by hypertension and gestational diabetes. We also present a proof of concept for mapping the health status evolution of synthetic patients when considering the most significant diagnoses and drugs associated with chronic patients. Conclusion Our results highlighted the value of ML techniques to extract clinical knowledge, supporting the identification of patients with certain chronic conditions. Furthermore, the patient's health status progression on the two-dimensional space might be used as a tool for clinicians aiming to characterize health conditions and identify their more relevant clinical codes.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Learning and visualizing chronic latent representations using electronic health records
    David Chushig-Muzo
    Cristina Soguero-Ruiz
    Pablo de Miguel Bohoyo
    Inmaculada Mora-Jiménez
    BioData Mining, 15
  • [2] Unsupervised Annotation of Phenotypic Abnormalities via Semantic Latent Representations on Electronic Health Records
    Zhang, Jingqing
    Zhang, Xiaoyu
    Sun, Kai
    Yang, Xian
    Dai, Chengliang
    Guo, Yike
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 598 - 603
  • [3] Learning latent heterogeneity for type 2 diabetes patients using longitudinal health markers in electronic health records
    Lou, Jitong
    Wang, Yuanjia
    Li, Lang
    Zeng, Donglin
    STATISTICS IN MEDICINE, 2021, 40 (08) : 1930 - 1946
  • [4] Unsupervised machine learning for the discovery of latent disease clusters and patient subgroups using electronic health records
    Wang, Yanshan
    Zhao, Yiqing
    Therneau, Terry M.
    Atkinson, Elizabeth J.
    Tafti, Ahmad P.
    Zhang, Nan
    Amin, Shreyasee
    Limper, Andrew H.
    Khosla, Sundeep
    Liu, Hongfang
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 102
  • [5] Unsupervised Machine Learning for the Discovery of Latent Clusters in COVID-19 Patients Using Electronic Health Records
    Cui, Wanting
    Robins, Daniel
    Finkelstein, Joseph
    IMPORTANCE OF HEALTH INFORMATICS IN PUBLIC HEALTH DURING A PANDEMIC, 2020, 272 : 1 - 4
  • [6] Using machine learning for predicting cervical cancer from Swedish electronic health records by mining hierarchical representations
    Weegar, Rebecka
    Sundstrom, Karin
    PLOS ONE, 2020, 15 (08):
  • [7] Subphenotyping depression using machine learning and electronic health records
    Xu, Zhenxing
    Wang, Fei
    Adekkanattu, Prakash
    Bose, Budhaditya
    Vekaria, Veer
    Brandt, Pascal
    Jiang, Guoqian
    Kiefer, Richard C.
    Luo, Yuan
    Pacheco, Jennifer A.
    Rasmussen, Luke V.
    Xu, Jie
    Alexopoulos, George
    Pathak, Jyotishman
    LEARNING HEALTH SYSTEMS, 2020, 4 (04):
  • [8] Readmission prediction using deep learning on electronic health records
    Ashfaq, Awais
    Sant'Anna, Anita
    Lingman, Markus
    Nowaczyk, Slawomir
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 97
  • [9] Federated Learning for Electronic Health Records
    Dang, Trung Kien
    Lan, Xiang
    Weng, Jianshu
    Feng, Mengling
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (05)
  • [10] Comparing Machine Learning Models for Identifying Chronic Cough Using Diagnosis and Medication in the Electronic Health Records
    Bali, Vishal
    Luo, Xiao
    Gandhi, Priyanka
    Zhang, Zuoyi
    Shao, Wei
    Han, Zhi
    Chandrasekaran, Vasu
    Turzhitsky, Vladimir
    Roberts, Anna
    Metzger, Megan
    Baker, Jarod
    La Rosa, Carmen
    Weaver, Jessica
    Dexter, Paul
    Huang, Kun
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2021, 147 (02) : AB61 - AB61