Using natural language processing to identify opioid use disorder in electronic health record data

被引:8
|
作者
Singleton, Jade [1 ,2 ]
Li, Chengxi [3 ]
Akpunonu, Peter D. [4 ]
Abner, Erin L. [1 ]
Kucharska-Newton, Anna M. [1 ,5 ]
机构
[1] Univ Kentucky, Coll Publ Hlth, Dept Epidemiol, Lexington, KY 40536 USA
[2] Univ Kentucky Healthcare IT Dept, Business Intelligence, Lexington, KY 40517 USA
[3] Univ Kentucky, Coll Engn, Dept Comp Sci, Lexington, KY 40526 USA
[4] Univ Kentucky Hosp, Emergency Med & Med Toxicol, Lexington, KY 40536 USA
[5] Univ North Carolina Chapel Hill, Gillings Sch Global Publ Hlth, Dept Epidemiol, Chapel Hill, NC 27514 USA
关键词
Opioid use disorder; Natural language processing; Electronic healthcare records; ICD-10; ABUSE; PAIN;
D O I
10.1016/j.ijmedinf.2022.104963
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: As opioid prescriptions have risen, there has also been an increase in opioid use disorder (OUD) and its adverse outcomes. Accurate and complete epidemiologic surveillance of OUD, to inform prevention strategies, presents challenges. The objective of this study was to ascertain prevalence of OUD using two methods to identify OUD in electronic health records (EHR): applying natural language processing (NLP) for text mining of un-structured clinical notes and using ICD-10-CM diagnostic codes.Methods: Data were drawn from EHR records for hospital and emergency department patient visits to a large regional academic medical center from 2017 to 2019. International Classification of Disease, 10th Edition, Clinic Modification (ICD-10-CM) discharge codes were extracted for each visit. To develop the rule-based NLP algo-rithm, a stepwise process was used. First, a small sample of visits from 2017 was used to develop initial dic-tionaries. Next, EHR corresponding to 30,124 visits from 2018 were used to develop and evaluate the rule-based algorithm. A random sample of the results were manually reviewed to identify and address shortcomings in the algorithm, and to estimate sensitivity and specificity of the two methods of ascertainment. Last, the final algo-rithm was then applied to 29,212 visits from 2019 to estimate OUD prevalence. Results: While there was substantial overlap in the identified records (n = 1,381 [59.2 %]), overall n = 2,332 unique visits were identified. Of the total unique visits, 430 (18.4 %) were identified only by ICD-10-CM codes, and 521 (22.3 %) were identified only by NLP. The prevalence of visits with evidence of an OUD diagnosis in this sample, ascertained using only ICD-10-CM codes, was 1,811/29,212 (6.1 %). Including the additional 521 visits identified only by NLP, the estimated prevalence of OUD is 2,332/29,212 (7.9 %), an increase of 29.5 % compared to the use of ICD-10-CM codes alone. The estimated sensitivity and specificity of the NLP-based OUD classification were 81.8 % and 97.5 %, respectively, relative to gold-standard manual review by an expert addiction medicine physician.Conclusion: NLP-based algorithms can automate data extraction and identify evidence of opioid use disorder from unstructured electronic healthcare records. The most complete ascertainment of OUD in EHR was combined NLP with ICD-10-CM codes. NLP should be considered for epidemiological studies involving EHR data.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] DEVELOPMENT OF A NATURAL LANGUAGE PROCESSING MACHINE TO IDENTIFY OPIOID USE DISORDER IN ELECTRONIC HEALTH RECORDS.
    Vorontsova, Y.
    Broyles, A.
    Cummins, J.
    Hood, D.
    Stratford, R.
    Quinney, S.
    [J]. CLINICAL PHARMACOLOGY & THERAPEUTICS, 2021, 109 : S60 - S60
  • [2] Using Natural Language Processing and Machine Learning to Identify Opioids in Electronic Health Record Data
    McDermott, Sean P.
    Wasan, Ajay D.
    [J]. JOURNAL OF PAIN RESEARCH, 2023, 16 : 2133 - 2140
  • [3] Natural language processing to identify substance misuse in the electronic health record
    Riddick, Tyne A.
    Choo, Esther K.
    [J]. LANCET DIGITAL HEALTH, 2022, 4 (06): : E401 - E402
  • [4] Natural Language Processing to Identify Dementia and Mild Cognitive Impairment from Electronic Health Record
    Yang, M.
    Bhandari, A.
    Callahan, K.
    Kirkendall, E.
    Lenoir, K. M.
    Pajewski, N. M.
    Topaloglu, U.
    [J]. JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2020, 70 : S160 - S160
  • [6] Assessment of Probable Opioid Use Disorder Using Electronic Health Record Documentation
    Palumbo, Sarah A.
    Adamson, Kayleigh M.
    Krishnamurthy, Sarathbabu
    Manoharan, Shivani
    Beiler, Donielle
    Seiwell, Anthony
    Young, Colt
    Metpally, Raghu
    Crist, Richard C.
    Doyle, Glenn A.
    Ferraro, Thomas N.
    Li, Mingyao
    Berrettini, Wade H.
    Robishaw, Janet D.
    Troiani, Vanessa
    [J]. JAMA NETWORK OPEN, 2020, 3 (09)
  • [7] Interface terminology: Natural language processing of clinical data in Electronic Health Record narratives
    de Souza, Amanda Damasceno
    Correa, Fabio
    de Araujo Nery Ribeiro, Jurema Suely
    de Carvalho Dutra, Frederico Giffoni
    da Silva, Helton Junio
    Felipe, Eduardo Ribeiro
    [J]. ENCONTROS BIBLI-REVISTA ELETRONICA DE BIBLIOTECONOMIA E CIENCIA DA INFORMACAO, 2024, 29
  • [8] The use of natural language processing on pediatric diagnostic radiology reports in the electronic health record to identify deep venous thrombosis in children
    Galvez, Jorge A.
    Pappas, Janine M.
    Ahumada, Luis
    Martin, John N.
    Simpao, Allan F.
    Rehman, Mohamed A.
    Witmer, Char
    [J]. JOURNAL OF THROMBOSIS AND THROMBOLYSIS, 2017, 44 (03) : 281 - 290
  • [9] The use of natural language processing on pediatric diagnostic radiology reports in the electronic health record to identify deep venous thrombosis in children
    Jorge A. Gálvez
    Janine M. Pappas
    Luis Ahumada
    John N. Martin
    Allan F. Simpao
    Mohamed A. Rehman
    Char Witmer
    [J]. Journal of Thrombosis and Thrombolysis, 2017, 44 : 281 - 290
  • [10] DEVELOPMENT OF A NATURAL LANGUAGE PROCESSING ALGORITHM TO IDENTIFY AND EVALUATE TRANSGENDER PATIENTS IN ELECTRONIC HEALTH RECORD SYSTEMS
    Ehrenfelt, Jesse M.
    Gottlieb, Keanan Gabriel
    Beach, Lauren Brittany
    Monahan, Shelby E.
    Fabbri, Daniel
    [J]. ETHNICITY & DISEASE, 2019, 29 : 441 - 450