Characterization of patients with idiopathic normal pressure hydrocephalus using natural language processing within an electronic healthcare record system

被引:5
|
作者
Funnell, Jonathan P. [1 ,2 ,12 ]
Noor, Kawsar [3 ,4 ]
Khan, Danyal Z. [1 ,2 ]
D'Antona, Linda [2 ,5 ]
Dobson, Richard J. B. [3 ,4 ,6 ,7 ,8 ,9 ]
Hanrahan, John G. [1 ,2 ]
Hepworth, Christopher [10 ]
Moncur, Eleanor M. [2 ,5 ]
Thomas, Benjamin M. [1 ,11 ]
Thorne, Lewis [2 ]
Watkins, Laurence D. [2 ]
Williams, Simon C. [1 ,2 ]
Wong, Wai Keong [4 ,6 ]
Toma, Ahmed K. [2 ,4 ,5 ]
Marcus, Hani J. [1 ,2 ,4 ,5 ]
机构
[1] UCL, Wellcome EPSRC Ctr Intervent & Surg Sci, London, England
[2] Natl Hosp Neurol & Neurosurg, London, England
[3] UCL, Inst Hlth Informat, London, England
[4] Univ Coll London Hosp NHS Fdn Trust, NIHR Biomed Res Ctr, London, England
[5] UCL, UCL Queen Sq Inst Neurol, London, England
[6] UCL, Hlth Data Res UK London, London, England
[7] South London & Maudsley NHS Fdn Trust, NIHR Biomed Res Ctr, London, England
[8] Kings Coll London, London, England
[9] Kings Coll London, Inst Psychiat Psychol & Neurosci IoPPN, Dept Biostat & Hlth Informat, London, England
[10] B Braun Med Ltd, Sheffield, England
[11] B Braun Med AB, Danderyd, Sweden
[12] UCL, London, England
关键词
artificial intelligence; cerebrospinal fluid diversion; machine learning; natural language processing; SHUNT SURGERY;
D O I
10.3171/2022.9.JNS221095
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
OBJECTIVE Idiopathic normal pressure hydrocephalus (iNPH) is an underdiagnosed, progressive, and disabling condi-tion. Early treatment is associated with better outcomes and improved quality of life. In this paper, the authors aimed to identify features associated with patients with iNPH using natural language processing (NLP) to characterize this cohort, with the intention to later target the development of artificial intelligence-driven tools for early detection. METHODS The electronic health records of patients with shunt-responsive iNPH were retrospectively reviewed using an NLP algorithm. Participants were selected from a prospectively maintained single-center database of patients under-going CSF diversion for probable iNPH (March 2008-July 2020). Analysis was conducted on preoperative health records including clinic letters, referrals, and radiology reports accessed through CogStack. Clinical features were extracted from these records as SNOMED CT (Systematized Nomenclature of Medicine Clinical Terms) concepts using a named entity recognition machine learning model. In the first phase, a base model was generated using unsupervised training on 1 million electronic health records and supervised training with 500 double-annotated documents. The model was fine-tuned to improve accuracy using 300 records from patients with iNPH double annotated by two blinded assessors. Thematic analysis of the concepts identi-fied by the machine learning algorithm was performed, and the frequency and timing of terms were analyzed to describe this patient group. RESULTS In total, 293 eligible patients responsive to CSF diversion were identified. The median age at CSF diversion was 75 years, with a male predominance (69% male). The algorithm performed with a high degree of precision and recall (F1 score 0.92). Thematic analysis revealed the most frequently documented symptoms related to mobility, cognitive impairment, and falls or balance. The most frequent comorbidities were related to cardiovascular and hematological problems. CONCLUSIONS This model demonstrates accurate, automated recognition of iNPH features from medical records. Opportunities for translation include detecting patients with undiagnosed iNPH from primary care records, with the aim to
引用
收藏
页码:1731 / 1739
页数:9
相关论文
共 50 条
  • [31] Identifying Cancer Recurrence from the Electronic Medical Record: Using Natural Language Processing to Interpret Pathology Reports
    Bayona, D.
    Ebner, D. K.
    Weiskittle, T. M.
    Talom, B. C. Kamdem
    Kowalchuk, R. O.
    Breen, W.
    Routman, D. M.
    Waddle, M. R.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2024, 120 (02): : E611 - E611
  • [32] Validation of Prediction Models for Critical Care Outcomes Using Natural Language Processing of Electronic Health Record Data
    Marafino, Ben J.
    Park, Miran
    Davies, Jason M.
    Thombley, Robert
    Luft, Harold S.
    Sing, David C.
    Kazi, Dhruv S.
    DeJong, Colette
    Boscardin, W. John
    Dean, Mitzi L.
    Dudley, R. Adams
    JAMA NETWORK OPEN, 2018, 1 (08)
  • [33] Comprehensive assessment of the true sepsis burden using electronic health record screening augmented by natural language processing
    R Arnold
    J Isserman
    S Smola
    E Jackson
    Critical Care, 18 (Suppl 1):
  • [34] IDENTIFYING PATIENTS WITH ACUTE GASTROINTESTINAL BLEEDING USING NOTE TEXT IN THE ELECTRONIC HEALTH RECORD: A HYBRID NATURAL LANGUAGE PROCESSING AND DEEP LEARNING APPROACH
    Dupont, Johannes
    Zheng, Neil S.
    Laine, Loren
    Hsiao, Allen
    Thomas, Prem
    Partridge, Caitlin M.
    Shung, Dennis
    GASTROENTEROLOGY, 2023, 164 (06) : S838 - S838
  • [35] Identification of Obstructive and Non-Obstructive Hypertrophic Cardiomyopathy Patients Using Natural Language Processing in a Large Integrated Healthcare System
    Sheridan, Ann
    Liu, Jane
    Tabada, Grace
    Tan, Thida
    Dubey, Anand
    Lucas, Matthew
    Zhong, Yue
    Okano, Gary
    Solomon, Matthew
    CIRCULATION, 2024, 150
  • [36] The quantitative motion analysis using portable gait rhythmogram after CSF drainage in the patients with idiopathic normal pressure hydrocephalus
    Makiko Yogo
    Shusaku Omoto
    Masayo Morita
    Masahiko Suzuki
    Fluids and Barriers of the CNS, 12 (Suppl 1)
  • [37] Using algorithms to identify high disease activity relapsing-remitting multiple sclerosis patients using electronic health record data with natural language processing
    Kamauu, A. W. C.
    Le, H. V.
    Truong, C. T. L.
    Crooke, H. R.
    Holmen, J. R.
    Fillmore, C. L.
    Sabido-Espin, M.
    Wong, S. L.
    MULTIPLE SCLEROSIS JOURNAL, 2017, 23 : 453 - 453
  • [38] Analysis of Primary Care Provider Electronic Health Record Notes for Discussions of Prediabetes Using Natural Language Processing Methods
    Tseng, Eva
    Schwartz, Jessica L.
    Rouhizadeh, Masoud
    Maruthur, Nisa M.
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2021,
  • [39] ANALYSIS OF PRIMARY CARE PROVIDER ELECTRONIC HEALTH RECORD NOTES FOR DISCUSSIONS OF PREDIABETES USING NATURAL LANGUAGE PROCESSING METHODS
    Tseng, Eva
    Schwartz, Jessica L.
    Rouhizadeh, Masoud
    Maruthur, Nisa
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2020, 35 (SUPPL 1) : S11 - S12
  • [40] Improving the Efficiency of Clinical Trial Recruitment Using Electronic Health Record Data, Natural Language Processing, and Machine Learning
    Cai, Tianrun
    Cai, Fiona
    Dahal, Kumar
    Hong, Chuan
    Liao, Katherine
    ARTHRITIS & RHEUMATOLOGY, 2019, 71