quEHRy: a question answering system to query electronic health records

被引:3
|
作者
Soni, Sarvesh [1 ]
Datta, Surabhi [1 ]
Roberts, Kirk [1 ,2 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, Houston, TX USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, 7000 Fannin St,Suite 600, Houston, TX 77030 USA
关键词
question answering; electronic health records; natural language processing; artificial intelligence; machine learning; FHIR; CLINICAL QUESTIONS; CARE;
D O I
10.1093/jamia/ocad050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective We propose a system, quEHRy, to retrieve precise, interpretable answers to natural language questions from structured data in electronic health records (EHRs). Materials and Methods We develop/synthesize the main components of quEHRy: concept normalization (MetaMap), time frame classification (new), semantic parsing (existing), visualization with question understanding (new), and query module for FHIR mapping/processing (new). We evaluate quEHRy on 2 clinical question answering (QA) datasets. We evaluate each component separately as well as holistically to gain deeper insights. We also conduct a thorough error analysis for a crucial subcomponent, medical concept normalization. Results Using gold concepts, the precision of quEHRy is 98.33% and 90.91% for the 2 datasets, while the overall accuracy was 97.41% and 87.75%. Precision was 94.03% and 87.79% even after employing an automated medical concept extraction system (MetaMap). Most incorrectly predicted medical concepts were broader in nature than gold-annotated concepts (representative of the ones present in EHRs), eg, Diabetes versus Diabetes Mellitus, Non-Insulin-Dependent. Discussion The primary performance barrier to deployment of the system is due to errors in medical concept extraction (a component not studied in this article), which affects the downstream generation of correct logical structures. This indicates the need to build QA-specific clinical concept normalizers that understand EHR context to extract the "relevant" medical concepts from questions. Conclusion We present an end-to-end QA system that allows information access from EHRs using natural language and returns an exact, verifiable answer. Our proposed system is high-precision and interpretable, checking off the requirements for clinical use.
引用
收藏
页码:1091 / 1102
页数:12
相关论文
共 50 条
  • [1] Question Answering for Electronic Health Records: Scoping Review of Datasets and Models
    Bardhan, Jayetri
    Roberts, Kirk
    Wang, Daisy Zhe
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [2] Clinical Judgement Study using Question Answering from Electronic Health Records
    Rawat, Bhanu Pratap Singh
    Li, Fei
    Yu, Hong
    MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 106, 2019, 106
  • [3] Improving Fine-Tuned Question Answering Models for Electronic Health Records
    Mairittha, Tittaya
    Mairittha, Nattaya
    Inoue, Sozo
    UBICOMP/ISWC '20 ADJUNCT: PROCEEDINGS OF THE 2020 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2020 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2020, : 688 - 691
  • [4] The research on query expansion for chinese question answering system
    Yu, ZT
    Fan, XZ
    Song, LR
    Guo, JY
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 1, PROCEEDINGS, 2005, 3613 : 571 - 579
  • [5] emrQA: A Large Corpus for Question Answering on Electronic Medical Records
    Pampari, Anusri
    Raghavan, Preethi
    Liang, Jennifer
    Peng, Jian
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2357 - 2368
  • [6] Uncertainty-Aware Text-to-Program for Question Answering on Structured Electronic Health Records
    Kim, Daeyoung
    Bae, Seongsu
    Kim, Seungho
    Choi, Edward
    CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 174, 2022, 174 : 138 - 151
  • [7] DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
    Bardhan, Jayetri
    Colas, Anthony
    Roberts, Kirk
    Wang, Daisy Zhe
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1083 - 1097
  • [8] Study and Development of Question Answering System based on Ontology Query
    Liu, Xiaoqiang
    Guo, Zhenbo
    Wang, Kaixi
    Jiang, Wenxu
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND COMPUTER APPLICATION, 2016, 30 : 430 - 432
  • [9] Text-to-SQL Generation for Question Answering on Electronic Medical Records
    Wang, Ping
    Shi, Tian
    Reddy, Chandan K.
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 350 - 361
  • [10] Question Answering for Complex Electronic Health Records Database using Unified Encoder-Decoder Architecture
    Bae, Seongsu
    Kim, Daeyoung
    Kim, Jiho
    Choi, Edward
    MACHINE LEARNING FOR HEALTH, VOL 158, 2021, 158 : 13 - 25