The Secondary Use of Electronic Health Records for Data Mining: Data Characteristics and Challenges

被引:33
|
作者
Sarwar, Tabinda [1 ]
Seifollahi, Sattar [1 ]
Chan, Jeffrey [1 ]
Zhang, Xiuzhen [1 ]
Aksakalli, Vural [1 ]
Hudson, Irene [1 ]
Verspoor, Karin [1 ]
Cavedon, Lawrence [1 ]
机构
[1] RMIT Univ, 124 La Trobe St, Melbourne, Vic 3000, Australia
关键词
EHR; data types; data characteristic; data challenges; data mining; health analytics; CHRONIC KIDNEY-DISEASE; MISSING DATA; BIG DATA; CLINICAL NOTES; OLDER-ADULTS; PRIMARY-CARE; DATA-DRIVEN; AUTOMATIC IDENTIFICATION; ARTIFICIAL-INTELLIGENCE; RISK STRATIFICATION;
D O I
10.1145/3490234
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The primary objective of implementing Electronic Health Records (EHRs) is to improve the management of patients' health-related information. However, these records have also been extensively used for the secondary purpose of clinical research and to improve healthcare practice. EHRs provide a rich set of information that includes demographics, medical history, medications, laboratory test results, and diagnosis. Data mining and analytics techniques have extensively exploited EHR information to study patient cohorts for various clinical and research applications, such as phenotype extraction, precision medicine, intervention evaluation, disease prediction, detection, and progression. But the presence of diverse data types and associated characteristics poses many challenges to the use of EHR data. In this article, we provide an overview of information found in EHR systems and their characteristics that could be utilized for secondary applications. We first discuss the different types of data stored in EHRs, followed by the data transformations necessary for data analysis and mining. Later, we discuss the data quality issues and characteristics of the EHRs along with the relevant methods used to address them. Moreover, this survey also highlights the usage of various data types for different applications. Hence, this article can serve as a primer for researchers to understand the use of EHRs for data mining and analytics purposes.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] Challenges in data quality assurance for electronic health records
    Shabestari, Omid
    Roudsari, Abdul
    [J]. Studies in Health Technology and Informatics, 2013, 183 : 37 - 41
  • [2] ELECTRONIC HEALTH RECORDS DATA AND METADATA: Challenges for Big Data in the United States
    Sweet, Lauren E.
    Moulaison, Heather Lea
    [J]. BIG DATA, 2013, 1 (04) : BD245 - BD251
  • [3] Explore Data Quality Challenges Based on Data Structure of Electronic Health Records
    Liu, Caihua
    Peng, Guochao
    Lan, Chaowang
    Kong, Shufeng
    [J]. HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION, HIMI 2023, PT I, 2023, 14015 : 236 - 247
  • [4] Mining electronic health records: challenges and impact
    Menasalvas, Ernestina
    Rodriguez-Gonzalez, Alejandro
    Gonzalo, Consuelo
    [J]. 2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS), 2018, : 747 - 754
  • [5] Use of Data from Electronic Health Records for Pharmacoepidemiology
    Michael D. Murray
    [J]. Current Epidemiology Reports, 2014, 1 (4) : 186 - 193
  • [6] Mining Electronic Health Records Data: Domestic Violence and Adverse Health Effects
    Karakurt, Gunnur
    Patel, Vishal
    Whiting, Kathleen
    Koyuturk, Mehmet
    [J]. JOURNAL OF FAMILY VIOLENCE, 2017, 32 (01) : 79 - 87
  • [7] Mining for equitable health: Assessing the impact of missing data in electronic health records
    Getzen, Emily
    Ungar, Lyle
    Mowery, Danielle
    Jiang, Xiaoqian
    Long, Qi
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 139
  • [8] Mining Electronic Health Records Data: Domestic Violence and Adverse Health Effects
    Gunnur Karakurt
    Vishal Patel
    Kathleen Whiting
    Mehmet Koyutürk
    [J]. Journal of Family Violence, 2017, 32 : 79 - 87
  • [9] Challenges and opportunities beyond structured data in analysis of electronic health records
    Tayefi, Maryam
    Ngo, Phuong
    Chomutare, Taridzo
    Dalianis, Hercules
    Salvi, Elisa
    Budrionis, Andrius
    Godtliebsen, Fred
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2021, 13 (06)
  • [10] Applying Data Mining Techniques to Standardized Electronic Health Records for Decision Support
    Batra, Shivani
    Sachdeva, Shelly
    Parashar, Hem Jyotsana
    Mehndiratta, Pulkit
    [J]. 2013 SIXTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2013, : 510 - 515