Developing and validating a natural language processing algorithm to extract preoperative cannabis use status documentation from unstructured narrative clinical notes

被引:3
|
作者
Sajdeya, Ruba [1 ,2 ,7 ]
Mardini, Mamoun T. [3 ]
Tighe, Patrick J. [4 ]
Ison, Ronald L. [4 ]
Bai, Chen [3 ]
Jugl, Sebastian [5 ]
Hanzhi, Gao [6 ]
Zandbiglari, Kimia [5 ]
Adiba, Farzana, I [5 ]
Winterstein, Almut G. [5 ]
Pearson, Thomas A. [1 ,2 ]
Cook, Robert L. [1 ,2 ]
Rouhizadeh, Masoud [2 ,5 ]
机构
[1] Univ Florida, Coll Publ Hlth & Hlth Profess, Dept Epidemiol, Gainesville, FL USA
[2] Univ Florida, Coll Med, Gainesville, FL USA
[3] Univ Florida, Coll Med, Dept Hlth Outcomes & Biomed Informat, Gainesville, FL USA
[4] Univ Florida, Coll Med, Dept Anesthesiol, Gainesville, FL USA
[5] Univ Florida, Ctr Drug Evaluat & Safety CoDES, Dept Pharmaceut Outcomes & Policy, Gainesville, FL USA
[6] Univ Florida, Dept Biostat, Gainesville, FL USA
[7] Univ Florida, Emerging Pathogens Inst, Coll Publ Hlth & Hlth Profess, Coll Med, 2055 Mowry Rd,POB 100009, Gainesville, FL 32610 USA
关键词
cannabis; perioperative outcomes; natural language processing; NLP; substance use; social determinants of health; HEALTH; IMPACT; CARE;
D O I
10.1093/jamia/ocad080
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective This study aimed to develop a natural language processing algorithm (NLP) using machine learning (ML) techniques to identify and classify documentation of preoperative cannabis use status. Materials and Methods We developed and applied a keyword search strategy to identify documentation of preoperative cannabis use status in clinical documentation within 60 days of surgery. We manually reviewed matching notes to classify each documentation into 8 different categories based on context, time, and certainty of cannabis use documentation. We applied 2 conventional ML and 3 deep learning models against manual annotation. We externally validated our model using the MIMIC-III dataset. Results The tested classifiers achieved classification results close to human performance with up to 93% and 94% precision and 95% recall of preoperative cannabis use status documentation. External validation showed consistent results with up to 94% precision and recall. Discussion Our NLP model successfully replicated human annotation of preoperative cannabis use documentation, providing a baseline framework for identifying and classifying documentation of cannabis use. We add to NLP methods applied in healthcare for clinical concept extraction and classification, mainly concerning social determinants of health and substance use. Our systematically developed lexicon provides a comprehensive knowledge-based resource covering a wide range of cannabis-related concepts for future NLP applications. Conclusion We demonstrated that documentation of preoperative cannabis use status could be accurately identified using an NLP algorithm. This approach can be employed to identify comparison groups based on cannabis exposure for growing research efforts aiming to guide cannabis-related clinical practices and policies.
引用
下载
收藏
页码:1418 / 1428
页数:11
相关论文
共 50 条
  • [31] Identifying generalized pustular psoriasis flares using natural language processing of unstructured clinical notes and structured procedure codes
    Rasouliyan, Lawrence
    Kumar, Vikas
    Walton, Sabrina E.
    Londhe, Ajit A.
    Feldman, Steven R.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 : 144 - 145
  • [32] Development of an algorithm using natural language processing to identify metastatic breast cancer patients from clinical notes.
    Swaminathan, Krishna Kumar
    Mendonca, Emma
    Mukherjee, Pranay
    Thirumalai, Karpagavalli
    Newsome, Rachel
    Narayanan, Babu
    JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (15)
  • [33] SEVERITY SCORE EXTRACTION FROM UNSTRUCTURED CLINICAL NOTES USING A DISEASE-AGNOSTIC NATURAL LANGUAGE PROCESSING QUESTION-ANSWERING PIPELINE
    Kumar, V
    Rasouliyan, L.
    Althoff, A.
    Chang, S.
    Long, S.
    VALUE IN HEALTH, 2023, 26 (06) : S5 - S5
  • [34] Use of Natural Language Processing to Assess Frequency of Functional Status Documentation for Patients Newly Diagnosed With Colorectal Cancer
    Agaronnik, Nicole
    Lindvall, Charlotta
    El-Jawahri, Areej
    He, Wei
    Iezzoni, Lisa
    JAMA ONCOLOGY, 2020, 6 (10) : 1628 - 1630
  • [35] A Natural Language Processing Tool to Extract Potential Blood Transfusion-associated Adverse Events in Clinical Notes
    Wang, Michelle
    Goldgof, Gregory
    Belov, Artur
    Whitaker, Barbee I.
    Anderson, Steven A.
    Butte, Atul
    TRANSFUSION, 2021, 61 : 217A - 218A
  • [36] Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing
    Philip Chung
    Christine T. Fong
    Andrew M. Walters
    Meliha Yetisgen
    Vikas N. O’Reilly-Shah
    BMC Anesthesiology, 23
  • [37] Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing
    Chung, Philip
    Fong, Christine T.
    Walters, Andrew M.
    Yetisgen, Meliha
    O'Reilly-Shah, Vikas N.
    BMC ANESTHESIOLOGY, 2023, 23 (01)
  • [38] Patient Dietary Supplements Use: Do Results from Natural Language Processing of Clinical Notes Agree with Survey Data?
    Redd, Douglas
    Workman, Terri Elizabeth
    Shao, Yijun
    Cheng, Yan
    Tekle, Senait
    Garvin, Jennifer H.
    Brandt, Cynthia A.
    Zeng-Treitler, Qing
    MEDICAL SCIENCES, 2023, 11 (02)
  • [39] Machine learning-based natural language processing to extract PD-L1 expression levels from clinical notes
    Lin, Eric
    Zwolinski, Robert
    Wu, Julie Tsu-Yu
    La, Jennifer
    Goryachev, Sergey
    Huhmann, Linden
    Yildrim, Cenk
    Tuck, David P.
    Elbers, Danne C.
    Brophy, Mary T.
    Do, Nhan, V
    Fillmore, Nathanael R.
    HEALTH INFORMATICS JOURNAL, 2023, 29 (03)
  • [40] Exploring natural language processing techniques to extract semantics from unstructured dataset which will aid in effective semantic interlinking
    Aladakatti, Shweta S.
    Kumar, S. Senthil
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2023, 14 (01)