Contextualized race and ethnicity annotations for clinical text from MIMIC-III

被引:1
|
作者
Oliver J. Bear Don’t Walk [1 ]
Adrienne Pichon [2 ]
Harry Reyes Nieva [2 ]
Tony Sun [3 ]
Jaan Li [2 ]
Josh Joseph [4 ]
Sivan Kinberg [5 ]
Lauren R. Richter [3 ]
Salvatore Crusco [6 ]
Kyle Kulas [2 ]
Shaan A. Ahmed [2 ]
Daniel Snyder [2 ]
Ashkon Rahbari [7 ]
Benjamin L. Ranard [2 ]
Pallavi Juneja [2 ]
Dina Demner-Fushman [2 ]
Noémie Elhadad [2 ]
机构
[1] University of Washington,
[2] Columbia University Irving Medical Center,undefined
[3] Harvard Medical School,undefined
[4] One Fact Foundation,undefined
[5] University of Tartu,undefined
[6] Brigham and Women’s Hospital,undefined
[7] NewYork-Presbyterian Hospital,undefined
[8] US National Library of Medicine,undefined
关键词
D O I
10.1038/s41597-024-04183-2
中图分类号
学科分类号
摘要
Observational health research often relies on accurate and complete race and ethnicity (RE) patient information, such as characterizing cohorts, assessing quality/performance metrics of hospitals and health systems, and identifying health disparities. While the electronic health record contains structured data such as accessible patient-level RE data, it is often missing, inaccurate, or lacking granular details. Natural language processing models can be trained to identify RE in clinical text which can supplement missing RE data in clinical data repositories. Here we describe the Contextualized Race and Ethnicity Annotations for Clinical Text (C-REACT) Dataset, which comprises 12,000 patients and 17,281 sentences from their clinical notes in the MIMIC-III dataset. Using these sentences, two sets of reference standard annotations for RE data are made available with annotation guidelines. The first set of annotations comprise highly granular information related to RE, such as preferred language and country of origin, while the second set contains RE labels annotated by physicians. This dataset can support health systems’ ability to use RE data to serve health equity goals.
引用
收藏
相关论文
共 50 条
  • [1] Extracting Alarm Events from the MIMIC-III Clinical Database
    Chromik, Jonas
    Pfitzner, Bjarne
    Ihde, Nina
    Michaelis, Marius
    Schmidt, Denise
    Klopfenstein, Sophie Anne Ines
    Poncette, Akira-Sebastian
    Balzer, Felix
    Arnrich, Bert
    HEALTHINF: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 5: HEALTHINF, 2021, : 328 - 335
  • [2] Clinical Characteristics of Aortic Aneurysm in MIMIC-III
    Song, Kun
    Guo, Cuirong
    Yang, Kongzhi
    Li, Changluo
    Ding, Ning
    HEART SURGERY FORUM, 2021, 24 (02): : E351 - E358
  • [3] Strategies of Predictive Schemes and Clinical Diagnosis for Prognosis Using MIMIC-III: A Systematic Review
    Khope, Sarika R.
    Elias, Susan
    HEALTHCARE, 2023, 11 (05)
  • [4] Experimental Evaluation and Development of a Silver-Standard for the MIMIC-III Clinical Coding Dataset
    Searle, Thomas
    Ibrahim, Zina
    Dobson, Richard J. B.
    19TH SIGBIOMED WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2020), 2020, : 76 - 85
  • [5] Platelets as a prognostic marker for sepsis A cohort study from the MIMIC-III database
    Zhao, Lina
    Zhao, Lijiao
    Wang, Yun Ying
    Yang, Fei
    Chen, Zhuang
    Yu, Qing
    Shi, Hui
    Huang, Shiying
    Zhao, Xiaoli
    Xiu, Limei
    Li, Xiaolu
    Li, Yun
    MEDICINE, 2020, 99 (45)
  • [6] THE CORRELATION BETWEEN THE HYPOALBUMINAEMIA AND HYPOCALCAEMIA IN SEPSIS PATIENTS: A RETROSPECTIVE STUDY FROM MIMIC-III
    Li, Weijia
    Huang, Lei
    Luo, Hua
    Zhang, Weixing
    He, Wencheng
    ACTA MEDICA MEDITERRANEA, 2022, 38 (05): : 3229 - 3237
  • [7] A novel nomogram to predict mortality in patients with stroke: a survival analysis based on the MIMIC-III clinical database
    Xiao-Dan Li
    Min-Min Li
    BMC Medical Informatics and Decision Making, 22
  • [8] A novel nomogram to predict mortality in patients with stroke: a survival analysis based on the MIMIC-III clinical database
    Li, Xiao-Dan
    Li, Min-Min
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [9] ASSOCIATION OF SEX WITH CLINICAL OUTCOME IN CRITICALLY ILL SEPSIS PATIENTS: A RETROSPECTIVE ANALYSIS OF THE LARGE CLINICAL DATABASE MIMIC-III
    Xu, Jinghong
    Tong, Li
    Yao, Jiyou
    Guo, Zilu
    Lui, Ka Yin
    Hu, XiaoGuang
    Cao, Lu
    Zhu, Yanping
    Huang, Fa
    Guan, Xiangdong
    Cai, Changjie
    SHOCK, 2019, 52 (02): : 146 - 151
  • [10] Prognostic Value of Blood Urea Nitrogen/Creatinine Ratio for Septic Shock: An Analysis of the MIMIC-III Clinical Database
    Han, Didi
    Zhang, Luming
    Zheng, Shuai
    Xu, Fengshuo
    Li, Chengzhuo
    Yang, Rui
    Ma, Wen
    Yin, Haiyan
    Lyu, Jun
    BIOMED RESEARCH INTERNATIONAL, 2021, 2021