Contextualized race and ethnicity annotations for clinical text from MIMIC-III

被引:1
|
作者
Oliver J. Bear Don’t Walk [1 ]
Adrienne Pichon [2 ]
Harry Reyes Nieva [2 ]
Tony Sun [3 ]
Jaan Li [2 ]
Josh Joseph [4 ]
Sivan Kinberg [5 ]
Lauren R. Richter [3 ]
Salvatore Crusco [6 ]
Kyle Kulas [2 ]
Shaan A. Ahmed [2 ]
Daniel Snyder [2 ]
Ashkon Rahbari [7 ]
Benjamin L. Ranard [2 ]
Pallavi Juneja [2 ]
Dina Demner-Fushman [2 ]
Noémie Elhadad [2 ]
机构
[1] University of Washington,
[2] Columbia University Irving Medical Center,undefined
[3] Harvard Medical School,undefined
[4] One Fact Foundation,undefined
[5] University of Tartu,undefined
[6] Brigham and Women’s Hospital,undefined
[7] NewYork-Presbyterian Hospital,undefined
[8] US National Library of Medicine,undefined
关键词
D O I
10.1038/s41597-024-04183-2
中图分类号
学科分类号
摘要
Observational health research often relies on accurate and complete race and ethnicity (RE) patient information, such as characterizing cohorts, assessing quality/performance metrics of hospitals and health systems, and identifying health disparities. While the electronic health record contains structured data such as accessible patient-level RE data, it is often missing, inaccurate, or lacking granular details. Natural language processing models can be trained to identify RE in clinical text which can supplement missing RE data in clinical data repositories. Here we describe the Contextualized Race and Ethnicity Annotations for Clinical Text (C-REACT) Dataset, which comprises 12,000 patients and 17,281 sentences from their clinical notes in the MIMIC-III dataset. Using these sentences, two sets of reference standard annotations for RE data are made available with annotation guidelines. The first set of annotations comprise highly granular information related to RE, such as preferred language and country of origin, while the second set contains RE labels annotated by physicians. This dataset can support health systems’ ability to use RE data to serve health equity goals.
引用
收藏
相关论文
共 50 条
  • [31] Reporting Race and Ethnicity in Population Health and Clinical Research From Japan
    Saeki, Soichiro
    Kusumoto, Misa
    JOURNAL OF EPIDEMIOLOGY, 2023, 33 (11) : 589 - 590
  • [32] CLINICAL TRIAL EXPERIENCE WITH THE 9-VALENT HPV VACCINE BY RACE/ETHNICITY: A COMBINED ANALYSIS FROM SEVEN PHASE III CLINICAL STUDIES
    Clark, Liana R.
    Luxembourg, Alain T.
    JOURNAL OF ADOLESCENT HEALTH, 2016, 58 (02) : S118 - S118
  • [33] Lactate dehydrogenase to albumin ratio is associated with in-hospital mortality in patients with acute heart failure: Data from the MIMIC-III database
    Xia, Xiangjun
    Tan, Suisai
    Zeng, Runhong
    Ouyang, Can
    Huang, Xiabin
    OPEN MEDICINE, 2024, 19 (01):
  • [34] Relationship between mean corpuscular volume and 30-day mortality in patients with intracerebral hemorrhage Evidence from the MIMIC-III database
    Zhang, Lu
    Yin, Jiahui
    Sun, Haiyang
    Li, Jinling
    Zhao, Xuelian
    Liu, Yuanxiang
    Yang, Jiguo
    MEDICINE, 2022, 101 (44) : E31415
  • [35] The Positive and Negative Effects of Calcium Supplementation on Mortality in Septic ICU Patients Depend on Disease Severity: A Retrospective Study from the MIMIC-III
    He, Wencheng
    Huang, Lei
    Luo, Hua
    Chen, Jingying
    Li, Weijia
    Zhang, Yiming
    An, Youzhong
    Zhang, Weixing
    CRITICAL CARE RESEARCH AND PRACTICE, 2022, 2022
  • [36] A Novel Nomogram for Predicting Morbidity Risk in Patients with Secondary Malignant Neoplasm of Bone and Bone Marrow: An Analysis Based on the Large MIMIC-III Clinical Database
    Miao, Guiqiang
    Li, Zhaohui
    Chen, Linjian
    Li, Wenyong
    Lan, Guobo
    Chen, Qiyuan
    Luo, Zhen
    Liu, Ruijia
    Zhao, Xiaodong
    INTERNATIONAL JOURNAL OF GENERAL MEDICINE, 2022, 15 : 3255 - 3264
  • [37] Association between Mean Arterial Pressure during the First 24 Hours and Clinical Outcome in Critically Ill Stroke Patients: An Analysis of the MIMIC-III Database
    Zhang, Sheng
    Cui, Yun-Liang
    Yu, Sheng
    Shang, Wei-Feng
    Li, Jie
    Pan, Xiao-Jun
    Wen, Zhen-Liang
    Huang, Si-Si
    Chen, Li-Min
    Shen, Xuan
    Yu, Yue-Tian
    Liu, Jiao
    Chen, De-Chang
    JOURNAL OF CLINICAL MEDICINE, 2023, 12 (04)
  • [38] Prevalence and Prognostic Impact of Malnutrition in Critical Patients With Acute Myocardial Infarction: Results From Chinese CIN Cohort and American MIMIC-III Database
    Lu, Jin
    Huang, Zhidong
    Wang, Junjie
    Zhao, Xiaoli
    Yang, Yanfang
    Wu, Bo
    Kang, Yu
    Xiu, Jiaming
    Tu, Jiabin
    Pan, Yuxiong
    Chen, Weihua
    Bao, Kunming
    Chen, Liling
    Liu, Jin
    Liu, Yong
    Chen, Shiqun
    Fang, Yong
    Chen, Kaihong
    FRONTIERS IN NUTRITION, 2022, 9
  • [39] Admission oxygen saturation and all-cause in-hospital mortality in acute myocardial infarction patients: data from the MIMIC-III database
    Yu, Yue
    Wang, Jun
    Wang, Qing
    Wang, Junnan
    Min, Jie
    Wang, Suyu
    Wang, Pei
    Huang, Renhong
    Xiao, Jian
    Zhang, Yufeng
    Wang, Zhinong
    ANNALS OF TRANSLATIONAL MEDICINE, 2020, 8 (21)
  • [40] Association between platelet-lymphocyte ratio and 90-day mortality in patients with intracerebral hemorrhage: data from the MIMIC-III database
    Yuan, Min
    Xiao, Zhilong
    Zhou, Huangyan
    Fu, Anxia
    Pei, Zhimin
    FRONTIERS IN NEUROLOGY, 2023, 14