Exploratory Data Analysis in Electronic Health Records Graphs: Intuitive Features and Visualization Tools

被引:1
|
作者
Cazzolato, Mirela T. [1 ,2 ]
Gutierrez, Marco Antonio [2 ]
Traina, Cactano, Jr. [1 ]
Faloutsos, Christos [3 ]
Traina, Agma J. M. [1 ]
机构
[1] Univ Sao Paulo ICMC USP, Inst Math & Comp Sci, Sao Carlos, Brazil
[2] Univ Sao Paulo HC FMUSP, Heart Inst InCor, Clin Hosp, Fac Med, Sao Paulo, Brazil
[3] Carnegie Mellon Univ, Pittsburgh, PA USA
基金
巴西圣保罗研究基金会;
关键词
Exploratory data analysis; electronic health records; graph mining; visualization; features; VISUAL ANALYTICS;
D O I
10.1109/CBMS58004.2023.00202
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a large, unlabeled set of Electronic Health Records (EHRs) acquired from multiple hospitals, how can we analyze the available entities and identify relationships in the data? Also, how can we perform Exploratory Data Analysis (EDA) over such EHR data? Many medical institutions generate EHRs as tabular data with entities and attributes in common. However, due to a large number of records, attributes, and high cardinality, exploring the different datasets and finding patterns and insights become laborious and prone to errors. In this work, we propose GraF-EDA for EDA over EHR data from different institutions. GraF-EDA models EHRs as time-evolving graphs, allowing the interoperability of such data into a single representation. We extract meaningful features from the graph nodes and provide intuitive visualizations to improve data explainability. We evaluate GraF-EDA with four COVID-19 datasets from hospitals of the Sao Paulo state, Brazil, resulting in million-scale graphs. Our method identified correlations, similarities and dissimilarities among medical treatments, exams, clinics, and outcomes. With the visual tools provided by GraF-EDA, we were able to spot cases of interest and check more details about them. Our results indicate that GraF-EDA is a fast, effective, open-sourced tool for EDA of EHRs from multiple institutions.
引用
收藏
页码:117 / 122
页数:6
相关论文
共 50 条
  • [41] Stochastic algorithms for exploratory data analysis: Data clustering and data visualization
    Buhmann, JM
    [J]. LEARNING IN GRAPHICAL MODELS, 1998, 89 : 405 - 419
  • [42] VitaPad: visualization tools for the analysis of pathway data
    Holford, M
    Li, NX
    Nadkarni, P
    Zhao, HY
    [J]. BIOINFORMATICS, 2005, 21 (08) : 1596 - 1602
  • [43] Software tools for analysis and visualization of fMRI data
    Cox, RW
    Hyde, JS
    [J]. NMR IN BIOMEDICINE, 1997, 10 (4-5) : 171 - 178
  • [44] Harnessing modern web application technology to create intuitive and efficient data visualization and sharing tools
    Wood, Dylan
    King, Margaret
    Landis, Drew
    Courtney, William
    Wang, Runtang
    Kelly, Ross
    Turner, Jessica A.
    Calhoun, Vince D.
    [J]. FRONTIERS IN NEUROINFORMATICS, 2014, 8
  • [45] An Analytical Evaluation of Information Visualization Factors for Multiple Electronic Health Records
    Malik, Muhammad Sheraz Arshad
    Sulaiman, Suziah
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2016, : 126 - 131
  • [46] Comparative adherence to diabetes drugs: An analysis of electronic health records and claims data
    Flory, James
    Gerhard, Tobias
    Stempniewicz, Nikita
    Keating, Scott
    Rowan, Christopher G.
    [J]. DIABETES OBESITY & METABOLISM, 2017, 19 (08): : 1184 - 1187
  • [47] Patients' Adoption of Electronic Personal Health Records in England: Secondary Data Analysis
    Abd-Alrazaq, Alaa
    Alalwan, Ali Abdallah
    McMillan, Brian
    Bewick, Bridgette M.
    Househ, Mowafa
    Al-Zyadat, Alaa T.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (10)
  • [48] Social determinants of health: Data standardization in electronic health records
    Cummins, Mollie R.
    Hardiker, Nicholas
    Wang, Jing
    Wilson, Marisa
    Sward, Katherine
    Chernecky, Cynthia
    Roberts, Darryl
    Langford, Laura Heermann
    [J]. NURSING OUTLOOK, 2022, 70 (03) : 528 - 534
  • [49] Integrating Data On Social Determinants Of Health Into Electronic Health Records
    Cantor, Michael N.
    Thorpe, Lorna
    [J]. HEALTH AFFAIRS, 2018, 37 (04) : 585 - 590
  • [50] What is the impact of electronic health records on the quality of health data?
    Callen, Joanne
    [J]. HEALTH INFORMATION MANAGEMENT JOURNAL, 2014, 43 (01) : 42 - 43