Using Natural Language Processing to Identify Different Lens Pathology in Electronic Health Records

被引:1
|
作者
Stein, Joshua d. [1 ,2 ]
Zhou, Yunshu [1 ]
Andrews, Chris a. [1 ]
Kim, Judy e. [3 ]
Addis, Victoria [4 ]
Bixler, Jill [1 ]
Grove, Nathan [5 ]
Mcmillan, Brian [6 ]
Munir, Saleha z. [7 ]
Pershing, Suzann [8 ,9 ]
Schultz, Jeffrey s. [10 ]
Stagg, Brian c. [11 ]
Wang, Sophia y. [8 ]
Woreta, Fasika [12 ]
机构
[1] Univ Michigan, WK Kellogg Eye Ctr, Dept Ophthalmol & Visual Sci, 1000 Wall St, Ann Arbor, MI 48105 USA
[2] Univ Michigan, Dept Hlth Management & Policy, Sch Publ Hlth, Ann Arbor, MI USA
[3] Med Coll Wisconsin, Dept Ophthalmol & Visual Sci, Milwaukee, WI USA
[4] Univ Penn, Dept Ophthalmol, Philadelphia, PA USA
[5] Univ Colorado, Dept Ophthalmol, Sch Med, Aurora, CO USA
[6] West Virginia Univ, Dept Ophthalmol & Visual Sci, Morgantown, WV USA
[7] Univ Maryland, Dept Ophthalmol & Visual Sci, Sch Med, Baltimore, MD USA
[8] Stanford Univ, Byers Eye Inst Stanford, Dept Ophthalmol, Stanford, CA USA
[9] VA Palo Alto Hlth Care Syst, , Califomia, Palo Alto, CA USA
[10] Montefiore Med Ctr, Dept Ophthalmol, New York, NY USA
[11] Univ Utah, Dept Ophthalmol, Salt Lake City, UT USA
[12] Johns Hopkins Univ, Dept Ophthalmol, Sch Med, Baltimore, MD USA
基金
美国国家卫生研究院;
关键词
CATARACT-SURGERY;
D O I
10.1016/j.ajo.2024.01.030
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
center dot PURPOSE: Nearly all published ophthalmology-related Big Data studies rely exclusively on International Classification of Diseases (ICD) billing codes to identify patients with particular ocular conditions. However, inaccurate or nonspecific codes may be used. We assessed whether natural language processing (NLP), as an alternative approach, could more accurately identify lens pathology. center dot DESIGN: Database study comparing the accuracy of NLP versus ICD billing codes to properly identify lens pathology. center dot METHODS: We developed an NLP algorithm capable of searching free-text lens exam data in the electronic health record (EHR) to identify the type(s) of cataract present, cataract density, presence of intraocular lenses, and other lens pathology. We applied our algorithm to 17.5 million lens exam records in the Sight Outcomes Research Collaborative (SOURCE) repository. We selected 4314 unique lens-exam entries and asked 11 clinicians to assess whether all pathology present in the entries had been correctly identified in the NLP algorithm output. The algorithm's sensitivity at accurately identifying lens pathology was compared with that of the ICD codes. center dot RESULTS: The NLP algorithm correctly identified all lens pathology present in 4104 of the 4314 lens-exam entries (95.1%). For less common lens pathology, algorithm findings were corroborated by reviewing clinicians for 100% of mentions of pseudoexfoliation material and 99.7% for phimosis, subluxation, and synechia. Sensitivity at identifying lens pathology was better for NLP (0.98 [0.96-0.99] than for billing codes (0.49 [0.46-0.53]). center dot CONCLUSIONS: Our NLP algorithm identifies and classifies lens abnormalities routinely documented by eyecare professionals with high accuracy. Such algorithms will help researchers to properly identify and classify ocular pathology, broadening the scope of feasible research using real-world data. (Am J Ophthalmol 2024;262: 153-160. (c) 2024 Elsevier Inc. All rights reserved.)
引用
收藏
页码:153 / 160
页数:8
相关论文
共 50 条
  • [32] Neural Natural Language Processing for unstructured data in electronic health records: A review
    Li, Irene
    Pan, Jessica
    Goldwasser, Jeremy
    Verma, Neha
    Wong, Wai Pan
    Nuzumlali, Muhammed Yavuz
    Rosand, Benjamin
    Li, Yixin
    Zhang, Matthew
    Chang, David
    Taylor, R. Andrew
    Krumholz, Harlan M.
    Radev, Dragomir
    COMPUTER SCIENCE REVIEW, 2022, 46
  • [33] Using an artificial intelligence tool incorporating natural language processing to identify patients with a diagnosis of ANCA-associated vasculitis in electronic health records
    van Leeuwen, Jolijn R.
    Penne, Erik L.
    Rabelink, Ton
    Knevel, Rachel
    Teng, Y. K. Onno
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
  • [34] Natural language generation for electronic health records
    Lee, Scott H.
    NPJ DIGITAL MEDICINE, 2018, 1
  • [35] Natural language generation for electronic health records
    Scott H. Lee
    npj Digital Medicine, 1
  • [36] Deep Learning Approaches for Predicting Glaucoma Progression Using Electronic Health Records and Natural Language Processing
    Wang, Sophia Y.
    Tseng, Benjamin
    Hernandez-Boussard, Tina
    OPHTHALMOLOGY SCIENCE, 2022, 2 (02):
  • [37] Active Computerized Pharmacovigilance Using Natural Language Processing, Statistics, and Electronic Health Records: A Feasibility Study
    Wang, Xiaoyan
    Hripcsak, George
    Markatou, Marianthi
    Friedman, Carol
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2009, 16 (03) : 328 - 337
  • [38] Identifying ANCA-Associated Vasculitis Cases in Electronic Health Records Using Natural Language Processing
    Wallace, Zachary
    Stone, John H.
    Choi, Hyon K.
    ARTHRITIS & RHEUMATOLOGY, 2018, 70
  • [39] Development and Validation of an Algorithm to Identify Prostate Cancer Related Mortality in Electronic Medical Records Using Natural Language Processing
    DiBello, Julia R.
    Wallner, Lauren P.
    Zheng, Chengyi
    Yu, Wei
    Li, Bonnie H.
    VanDenEeden, Stephen K.
    Weinmann, Sheila
    Ritzwoller, Debra
    Richert-Boe, Kathryn
    Jacobsen, Stephen J.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2015, 24 : 418 - 419
  • [40] Natural Language Processing and Electronic Medical Records Reply
    Murff, Harvey J.
    FitzHenry, Fern
    Speroff, Theodore
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 306 (21): : 2325 - 2326