Exploratory Data Analysis in Electronic Health Records Graphs: Intuitive Features and Visualization Tools

被引:1
|
作者
Cazzolato, Mirela T. [1 ,2 ]
Gutierrez, Marco Antonio [2 ]
Traina, Cactano, Jr. [1 ]
Faloutsos, Christos [3 ]
Traina, Agma J. M. [1 ]
机构
[1] Univ Sao Paulo ICMC USP, Inst Math & Comp Sci, Sao Carlos, Brazil
[2] Univ Sao Paulo HC FMUSP, Heart Inst InCor, Clin Hosp, Fac Med, Sao Paulo, Brazil
[3] Carnegie Mellon Univ, Pittsburgh, PA USA
基金
巴西圣保罗研究基金会;
关键词
Exploratory data analysis; electronic health records; graph mining; visualization; features; VISUAL ANALYTICS;
D O I
10.1109/CBMS58004.2023.00202
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a large, unlabeled set of Electronic Health Records (EHRs) acquired from multiple hospitals, how can we analyze the available entities and identify relationships in the data? Also, how can we perform Exploratory Data Analysis (EDA) over such EHR data? Many medical institutions generate EHRs as tabular data with entities and attributes in common. However, due to a large number of records, attributes, and high cardinality, exploring the different datasets and finding patterns and insights become laborious and prone to errors. In this work, we propose GraF-EDA for EDA over EHR data from different institutions. GraF-EDA models EHRs as time-evolving graphs, allowing the interoperability of such data into a single representation. We extract meaningful features from the graph nodes and provide intuitive visualizations to improve data explainability. We evaluate GraF-EDA with four COVID-19 datasets from hospitals of the Sao Paulo state, Brazil, resulting in million-scale graphs. Our method identified correlations, similarities and dissimilarities among medical treatments, exams, clinics, and outcomes. With the visual tools provided by GraF-EDA, we were able to spot cases of interest and check more details about them. Our results indicate that GraF-EDA is a fast, effective, open-sourced tool for EDA of EHRs from multiple institutions.
引用
收藏
页码:117 / 122
页数:6
相关论文
共 50 条
  • [1] A richly interactive exploratory data analysis and visualization tool using electronic medical records
    Huang, Chih-Wei
    Lu, Richard
    Iqbal, Usman
    Lin, Shen-Hsien
    Phung Anh Nguyen
    Yang, Hsuan-Chia
    Wang, Chun-Fu
    Li, Jianping
    Ma, Kwan-Liu
    Li, Yu-Chuan
    Jian, Wen-Shan
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2015, 15
  • [2] A richly interactive exploratory data analysis and visualization tool using electronic medical records
    Chih-Wei Huang
    Richard Lu
    Usman Iqbal
    Shen-Hsien Lin
    Phung Anh (Alex) Nguyen
    Hsuan-Chia Yang
    Chun-Fu Wang
    Jianping Li
    Kwan-Liu Ma
    Yu-Chuan (Jack) Li
    Wen-Shan Jian
    [J]. BMC Medical Informatics and Decision Making, 15
  • [3] Multimodal Data Analysis and Visualization to Study the Usage of Electronic Health Records
    Weibel, Nadir
    Ashfaq, Shazia
    Calvitti, Alan
    Hollan, James D.
    Agha, Zia
    [J]. PROCEEDINGS OF THE 2013 7TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING TECHNOLOGIES FOR HEALTHCARE AND WORKSHOPS (PERVASIVEHEALTH 2013), 2013, : 282 - 283
  • [4] Data-Driven Activities Involving Electronic Health Records: An Activity and Task Analysis Framework for Interactive Visualization Tools
    Rostamzadeh, Neda
    Abdullah, Sheikh S.
    Sedig, Kamran
    [J]. MULTIMODAL TECHNOLOGIES AND INTERACTION, 2020, 4 (01)
  • [5] Textual analysis and visualization of research trends in data mining for electronic health records
    Chen, Jingfeng
    Wei, Wei
    Guo, Chonghui
    Tang, Lin
    Sun, Leilei
    [J]. HEALTH POLICY AND TECHNOLOGY, 2017, 6 (04) : 389 - 400
  • [6] Heimdall, a Computer Program for Electronic Health Records Data Visualization
    Martignene, Niels
    Balcaen, Thibaut
    Bouzille, Guillaume
    Calafiore, Matthieu
    Beuscart, Jean-Baptiste
    Lamer, Antoine
    Legrand, Bertrand
    Ficheur, Gregoire
    Chazard, Emmanuel
    [J]. DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 247 - 251
  • [7] A Detailed Study on Temporal Data Visualization Techniques in Electronic Health Records
    Younas, Aisha
    Malik, Muhammad Sheraz Arhsad
    Khalil-ur-Rehman
    Shahid, Rabia
    [J]. 2016 SIXTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2016, : 638 - 643
  • [8] Divvy: Fast and Intuitive Exploratory Data Analysis
    Lewis, Joshua M.
    de Sa, Virginia R.
    van der Maaten, Laurens
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 3159 - 3163
  • [9] RAVEL: Retrieval And Visualization in ELectronic health records
    Thiessard, Frantz
    Mougin, Fleur
    Diallo, Gayo
    Jouhet, Vianney
    Cossin, Sebastien
    Garcelon, Nicolas
    Campillo, Boris
    Jouini, Wassim
    Grosjean, Julien
    Massari, Philippe
    Griffon, Nicolas
    Dupuch, Marie
    Tayalati, Fayssal
    Dugas, Edwige
    Balvet, Antonio
    Grabar, Natalia
    Pereira, Suzanne
    Frandji, Bruno
    Darmoni, Stefan
    Cuggia, Marc
    [J]. QUALITY OF LIFE THROUGH QUALITY OF INFORMATION, 2012, 180 : 194 - 198
  • [10] Exploratory Analysis of Animal Bites Events in the City of Buenos Aires Using Data from Electronic Health Records
    Nicolas Quintana, Gaston
    Esteban, Santiago
    [J]. DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 1283 - 1284