Overview of the First Natural Language Processing Challenge for Extracting Medication, Indication, and Adverse Drug Events from Electronic Health Record Notes (MADE 1.0)

被引:97
|
作者
Jagannatha, Abhyuday [1 ]
Liu, Feifan [2 ]
Liu, Weisong [3 ,4 ]
Yu, Hong [1 ,3 ,4 ,5 ]
机构
[1] Univ Massachusetts, Coll Informat & Comp Sci, Amherst, MA 01003 USA
[2] Univ Massachusetts, Med Sch, Dept Quantitat Hlth Sci & Radiol, Worcester, MA 01605 USA
[3] Univ Massachusetts, Dept Comp Sci, 220 Pawtucket St, Lowell, MA 01854 USA
[4] Univ Massachusetts, Med Sch, Dept Med, Worcester, MA 01605 USA
[5] Bedford VAMC, Bedford, MA 01730 USA
基金
美国国家卫生研究院;
关键词
SYSTEM; INFORMATION; PHARMACOVIGILANCE; PREDICTION; SAFETY; CORPUS; UMLS;
D O I
10.1007/s40264-018-0762-z
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
IntroductionThis work describes the Medication and Adverse Drug Events from Electronic Health Records (MADE 1.0) corpus and provides an overview of the MADE 1.0 2018 challenge for extracting medication, indication, and adverse drug events (ADEs) from electronic health record (EHR) notes.ObjectiveThe goal of MADE is to provide a set of common evaluation tasks to assess the state of the art for natural language processing (NLP) systems applied to EHRs supporting drug safety surveillance and pharmacovigilance. We also provide benchmarks on the MADE dataset using the system submissions received in the MADE 2018 challenge.MethodsThe MADE 1.0 challenge has released an expert-annotated cohort of medication and ADE information comprising 1089 fully de-identified longitudinal EHR notes from 21 randomly selected patients with cancer at the University of Massachusetts Memorial Hospital. Using this cohort as a benchmark, the MADE 1.0 challenge designed three shared NLP tasks. The named entity recognition (NER) task identifies medications and their attributes (dosage, route, duration, and frequency), indications, ADEs, and severity. The relation identification (RI) task identifies relations between the named entities: medication-indication, medication-ADE, and attribute relations. The third shared task (NER-RI) evaluates NLP models that perform the NER and RI tasks jointly. In total, 11 teams from four countries participated in at least one of the three shared tasks, and 41 system submissions were received in total.ResultsThe best systems F-1 scores for NER, RI, and NER-RI were 0.82, 0.86, and 0.61, respectively. Ensemble classifiers using the team submissions improved the performance further, with an F-1 score of 0.85, 0.87, and 0.66 for the three tasks, respectively.ConclusionMADE results show that recent progress in NLP has led to remarkable improvements in NER and RI tasks for the clinical domain. However, some room for improvement remains, particularly in the NER-RI task.
引用
收藏
页码:99 / 111
页数:13
相关论文
共 50 条
  • [11] Extracting Family History Information From Electronic Health Records: Natural Language Processing Analysis
    Rybinski, Maciej
    Dai, Xiang
    Singh, Sonit
    Karimi, Sarvnaz
    Nguyen, Anthony
    JMIR MEDICAL INFORMATICS, 2021, 9 (04)
  • [12] A Rules-Based, Natural Language Processing Approach to Identifying Trauma History From Psychiatric Notes in the Electronic Health Record
    Lepow, Lauren
    Patra, Braja
    Landi, Isotta
    Nadkarni, Girish
    Pathak, Jyotishman
    Yehuda, Rachel
    Glicksberg, Benjamin
    Charney, Alexander
    NEUROPSYCHOPHARMACOLOGY, 2021, 46 (SUPPL 1) : 103 - 104
  • [13] Using Natural Language Processing Techniques to Detect Adverse Events From Progress Notes Due to Chemotherapy
    Mashima, Yukinori
    Tamura, Takashi
    Kunikata, Jun
    Tada, Shinobu
    Yamada, Akiko
    Tanigawa, Masatoshi
    Hayakawa, Akiko
    Tanabe, Hirokazu
    Yokoi, Hideto
    CANCER INFORMATICS, 2022, 21
  • [14] Using Natural Language Processing Techniques to Detect Adverse Events From Progress Notes Due to Chemotherapy
    Mashima, Yukinori
    Tamura, Takashi
    Kunikata, Jun
    Tada, Shinobu
    Yamada, Akiko
    Tanigawa, Masatoshi
    Hayakawa, Akiko
    Tanabe, Hirokazu
    Yokoi, Hideto
    CANCER INFORMATICS, 2022, 21
  • [15] Identifying adverse drug reactions from free-text electronic hospital health record notes
    Wasylewicz, Arthur
    van de Burgt, Britt
    Weterings, Aniek
    Jessurun, Naomi
    Korsten, Erik
    Egberts, Toine
    Bouwman, Arthur
    Kerskes, Marieke
    Grouls, Rene
    van der Linden, Carolien
    BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2022, 88 (03) : 1235 - 1245
  • [16] Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learning
    Chen, Long
    Gu, Yu
    Ji, Xin
    Sun, Zhiyong
    Li, Haodan
    Gao, Yuan
    Huang, Yang
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (01) : 56 - 64
  • [17] Extracting social determinants of health from electronic health records using natural language processing: a systematic review
    Patra, Braja G.
    Sharma, Mohit M.
    Vekaria, Veer
    Adekkanattu, Prakash
    Patterson, Olga, V
    Glicksberg, Benjamin
    Lepow, Lauren A.
    Ryu, Euijung
    Biernacka, Joanna M.
    Furmanchuk, Al'ona
    George, Thomas J.
    Hogan, William
    Wu, Yonghui
    Yang, Xi
    Bian, Jiang
    Weissman, Myrna
    Wickramaratne, Priya
    Mann, J. John
    Olfson, Mark
    Campion, Thomas R., Jr.
    Weiner, Mark
    Pathak, Jyotishman
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (12) : 2716 - 2727
  • [18] Statistical inference for natural language processing algorithms with a demonstration using type 2 diabetes prediction from electronic health record notes
    Egleston, Brian L.
    Bai, Tian
    Bleicher, Richard J.
    Taylor, Stanford J.
    Lutz, Michael H.
    Vucetic, Slobodan
    BIOMETRICS, 2021, 77 (03) : 1089 - 1100
  • [19] Performance of a Natural Language Processing Method to Extract Stone Composition From the Electronic Health Record
    Bejan, Cosmin A.
    Lee, Daniel J.
    Xu, Yaomin
    Hsi, Ryan S.
    UROLOGY, 2019, 132 : 56 - 62