Evaluation of a Natural Language Processing Approach to Identify Social Determinants of Health in Electronic Health Records in a Diverse Community Cohort

被引:0
|
作者
Rouillard, Christopher J. [1 ,2 ]
Nasser, Mahmoud A. [2 ]
Hu, Haihong [2 ]
Roblin, Douglas W. [2 ]
机构
[1] Univ Illinois, Carle Illinois Coll Med, Champaign, IL USA
[2] Kaiser Permanente Midatlantic States, Midatlant Permanente Med Grp Pc, Rockville, MD USA
关键词
social determinants of health; social needs; natural language processing; CARE; TOOLS; NEEDS;
D O I
暂无
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Health care systems in the United States are increasingly interested in measuring and addressing social determinants of health (SDoH). Advances in electronic health record systems and Natural Language Processing (NLP) create a unique opportunity to systematically document patient SDoH from digitized free-text provider notes. Methods: Patient SDoH status [recorded by Your Current Life Situation (YCLS) Survey] and associated provider notes recorded between March 2017 and June 2020 were extracted (32,261 beneficiaries; 50,722 YCLS surveys; 485,425 provider notes). NLP patterns were generated using a machine learning test statistic (Term Frequency-Inverse Document Frequency). Patterns were developed and assessed in a training, training validation, and final validation dataset (64%, 16%, and 20% of total data, respectively). NLP models analyzed SDoH-specific categories (housing, medical care, and transportation needs) and a combined SDoH metric. Model performance was assessed using sensitivity, specificity, and Cohen kappa statistic, assuming the YCLS Survey to be the gold standard. Results: Within the training validation dataset, NLP models showed strong sensitivity and specificity, with moderate agreement with the YCLS Survey (Housing: sensitivity = 0.67, specificity = 0.89, kappa = 0.51; Medical care: sensitivity = 0.55, specificity = 0.73, lc = 0.20; Transportation: sensitivity = 0.79, specificity = 0.87, kappa = 0.58). Model performance in the training and training validation datasets were comparable. In the final validation dataset, a combined SDoH prediction metric showed sensitivity = 0.77, specificity = 0.69, kappa = 0.45. Conclusion: This NLP algorithm demonstrated moderate performance in identification of unmet patient social needs. This novel approach may enable improved targeting of interventions, allocation of limited resources and monitoring a health care system's addressing its patients' SDoH needs.
引用
收藏
页码:248 / 255
页数:8
相关论文
共 50 条
  • [41] NATURAL LANGUAGE PROCESSING METHODS ENHANCE MACE IDENTIFICATION FROM ELECTRONIC HEALTH RECORDS
    St Laurent, S.
    Guo, M.
    Alfonso, R.
    Okoro, T.
    Johansen, K.
    Dember, L.
    Lindsay, A.
    [J]. VALUE IN HEALTH, 2018, 21 : S217 - S217
  • [42] Development of a natural language processing algorithm to detect chronic cough in electronic health records
    Bali, Vishal
    Weaver, Jessica
    Turzhitsky, Vladimir
    Schelfhout, Jonathan
    Paudel, Misti L.
    Hulbert, Erin
    Peterson-Brandt, Jesse
    Currie, Anne-Marie Guerra
    Bakka, Dylan
    [J]. BMC PULMONARY MEDICINE, 2022, 22 (01)
  • [43] Development of a natural language processing algorithm to detect chronic cough in electronic health records
    Vishal Bali
    Jessica Weaver
    Vladimir Turzhitsky
    Jonathan Schelfhout
    Misti L. Paudel
    Erin Hulbert
    Jesse Peterson-Brandt
    Anne-Marie Guerra Currie
    Dylan Bakka
    [J]. BMC Pulmonary Medicine, 22
  • [44] Natural Language Processing of Clinical Notes in Electronic Health Records to Improve Capture of Hypoglycemia
    Nunes, Anthony P.
    Yu, Shengsheng
    Kurtyka, Karen
    Senerchia, Cynthia
    Hill, Jefffrey
    Brodovicz, Kimberly G.
    Radican, Larry
    Engel, Samuel S.
    Calvo, Sean R.
    Dore, David D.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2014, 23 : 494 - 494
  • [45] Natural language processing for electronic health records in anaesthesiology: an introduction to clinicians with recommendations and pitfalls
    Bernstorff, Martin
    Vistisen, Simon Tilma
    Enevoldsen, Kenneth C.
    [J]. JOURNAL OF CLINICAL MONITORING AND COMPUTING, 2024, 38 (02) : 241 - 245
  • [46] Ascertainment of Delirium Status Using Natural Language Processing From Electronic Health Records
    Fu, Sunyang
    Lopes, Guilherme S.
    Pagali, Sandeep R.
    Thorsteinsdottir, Bjoerg
    LeBrasseur, Nathan K.
    Wen, Andrew
    Liu, Hongfang
    Rocca, Walter A.
    Olson, Janet E.
    St Sauver, Jennifer
    Sohn, Sunghwan
    [J]. JOURNALS OF GERONTOLOGY SERIES A-BIOLOGICAL SCIENCES AND MEDICAL SCIENCES, 2022, 77 (03): : 524 - 530
  • [47] Colonoscopy quality, quality measures, and a natural language processing tool for electronic health records
    Deutsch, John C.
    [J]. GASTROINTESTINAL ENDOSCOPY, 2012, 75 (06) : 1240 - 1242
  • [48] Relevant Word Order Vectorization for Improved Natural Language Processing in Electronic Health Records
    Jeffrey Thompson
    Jinxiang Hu
    Dinesh Pal Mudaranthakam
    David Streeter
    Lisa Neums
    Michele Park
    Devin C. Koestler
    Byron Gajewski
    Roy Jensen
    Matthew S. Mayo
    [J]. Scientific Reports, 9
  • [49] Relevant Word Order Vectorization for Improved Natural Language Processing in Electronic Health Records
    Thompson, Jeffrey
    Hu, Jinxiang
    Mudaranthakam, Dinesh Pal
    Streeter, David
    Neums, Lisa
    Park, Michele
    Koestler, Devin C.
    Gajewski, Byron
    Jensen, Roy
    Mayo, Matthew S.
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [50] IDENTIFYING REASONS FOR STATIN NONADHERENCE IN A DIVERSE, REAL-WORLD POPULATION USING ELECTRONIC HEALTH RECORDS AND NATURAL LANGUAGE PROCESSING
    Sarraju, Ashish
    Coquet, Jean
    Chan, Antonia
    Ngo, Summer
    Lossio-Ventura, Juan Antonio
    Hernandez-Boussard, Tina
    Rodriguez, Fatima
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2021, 77 (18) : 1665 - 1665