Evaluation of a Natural Language Processing Approach to Identify Social Determinants of Health in Electronic Health Records in a Diverse Community Cohort

被引:0
|
作者
Rouillard, Christopher J. [1 ,2 ]
Nasser, Mahmoud A. [2 ]
Hu, Haihong [2 ]
Roblin, Douglas W. [2 ]
机构
[1] Univ Illinois, Carle Illinois Coll Med, Champaign, IL USA
[2] Kaiser Permanente Midatlantic States, Midatlant Permanente Med Grp Pc, Rockville, MD USA
关键词
social determinants of health; social needs; natural language processing; CARE; TOOLS; NEEDS;
D O I
暂无
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Health care systems in the United States are increasingly interested in measuring and addressing social determinants of health (SDoH). Advances in electronic health record systems and Natural Language Processing (NLP) create a unique opportunity to systematically document patient SDoH from digitized free-text provider notes. Methods: Patient SDoH status [recorded by Your Current Life Situation (YCLS) Survey] and associated provider notes recorded between March 2017 and June 2020 were extracted (32,261 beneficiaries; 50,722 YCLS surveys; 485,425 provider notes). NLP patterns were generated using a machine learning test statistic (Term Frequency-Inverse Document Frequency). Patterns were developed and assessed in a training, training validation, and final validation dataset (64%, 16%, and 20% of total data, respectively). NLP models analyzed SDoH-specific categories (housing, medical care, and transportation needs) and a combined SDoH metric. Model performance was assessed using sensitivity, specificity, and Cohen kappa statistic, assuming the YCLS Survey to be the gold standard. Results: Within the training validation dataset, NLP models showed strong sensitivity and specificity, with moderate agreement with the YCLS Survey (Housing: sensitivity = 0.67, specificity = 0.89, kappa = 0.51; Medical care: sensitivity = 0.55, specificity = 0.73, lc = 0.20; Transportation: sensitivity = 0.79, specificity = 0.87, kappa = 0.58). Model performance in the training and training validation datasets were comparable. In the final validation dataset, a combined SDoH prediction metric showed sensitivity = 0.77, specificity = 0.69, kappa = 0.45. Conclusion: This NLP algorithm demonstrated moderate performance in identification of unmet patient social needs. This novel approach may enable improved targeting of interventions, allocation of limited resources and monitoring a health care system's addressing its patients' SDoH needs.
引用
收藏
页码:248 / 255
页数:8
相关论文
共 50 条
  • [32] Applying Natural Language Processing Toolkits to Electronic Health Records - An Experience Report
    Barrett, Neil
    Weber-Jahnke, Jens H.
    [J]. ADVANCES IN INFORMATION TECHNOLOGY AND COMMUNICATION IN HEALTH, 2009, 143 : 441 - 446
  • [33] Neural Natural Language Processing for unstructured data in electronic health records: A review
    Li, Irene
    Pan, Jessica
    Goldwasser, Jeremy
    Verma, Neha
    Wong, Wai Pan
    Nuzumlali, Muhammed Yavuz
    Rosand, Benjamin
    Li, Yixin
    Zhang, Matthew
    Chang, David
    Taylor, R. Andrew
    Krumholz, Harlan M.
    Radev, Dragomir
    [J]. COMPUTER SCIENCE REVIEW, 2022, 46
  • [34] Using Natural Language Processing of Electronic Health Records to Identify Patients with ANCA-Associated Vasculitides in the Veterans Affairs
    DuVall, Scott L.
    Kamauu, Aaron W. C.
    Napalkov, Pavel
    Anglemyer, Andrew T.
    Cantrell, Ronald A.
    Koening, Curry L.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2012, 21 : 118 - 118
  • [36] Natural Language Processing to Identify Cancer Treatments With Electronic Medical Records
    Zeng, Jiaming
    Banerjee, Imon
    Henry, A. Solomon
    Wood, Douglas J.
    Shachter, Ross D.
    Gensheimer, Michael F.
    Rubin, Daniel L.
    [J]. JCO CLINICAL CANCER INFORMATICS, 2021, 5 : 379 - 393
  • [37] A Road Map to Integrate Social Determinants of Health into Electronic Health Records
    Palacio, Ana
    Suarez, Maritza
    Tamariz, Leonardo
    Seo, David
    [J]. POPULATION HEALTH MANAGEMENT, 2017, 20 (06) : 424 - 426
  • [38] Natural language processing of multi-hospital electronic health records for public health surveillance of suicidality
    Romain Bey
    Ariel Cohen
    Vincent Trebossen
    Basile Dura
    Pierre-Alexis Geoffroy
    Charline Jean
    Benjamin Landman
    Thomas Petit-Jean
    Gilles Chatellier
    Kankoe Sallah
    Xavier Tannier
    Aurelie Bourmaud
    Richard Delorme
    [J]. npj Mental Health Research, 3 (1):
  • [39] Prediction and evaluation of combination pharmacotherapy using natural language processing, machine learning and patient electronic health records
    Ding, Pingjian
    Pan, Yiheng
    Wang, Quanqiu
    Xu, Rong
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 133
  • [40] NATURAL LANGUAGE PROCESSING METHODS ENHANCE MACE IDENTIFICATION FROM ELECTRONIC HEALTH RECORDS
    St Laurent, S.
    Guo, M.
    Alfonso, R.
    Okoro, T.
    Johansen, K.
    Dember, L.
    Lindsay, A.
    [J]. VALUE IN HEALTH, 2018, 21 : S217 - S217