Using Natural Language Processing to Identify Different Lens Pathology in Electronic Health Records

被引:1
|
作者
Stein, Joshua d. [1 ,2 ]
Zhou, Yunshu [1 ]
Andrews, Chris a. [1 ]
Kim, Judy e. [3 ]
Addis, Victoria [4 ]
Bixler, Jill [1 ]
Grove, Nathan [5 ]
Mcmillan, Brian [6 ]
Munir, Saleha z. [7 ]
Pershing, Suzann [8 ,9 ]
Schultz, Jeffrey s. [10 ]
Stagg, Brian c. [11 ]
Wang, Sophia y. [8 ]
Woreta, Fasika [12 ]
机构
[1] Univ Michigan, WK Kellogg Eye Ctr, Dept Ophthalmol & Visual Sci, 1000 Wall St, Ann Arbor, MI 48105 USA
[2] Univ Michigan, Dept Hlth Management & Policy, Sch Publ Hlth, Ann Arbor, MI USA
[3] Med Coll Wisconsin, Dept Ophthalmol & Visual Sci, Milwaukee, WI USA
[4] Univ Penn, Dept Ophthalmol, Philadelphia, PA USA
[5] Univ Colorado, Dept Ophthalmol, Sch Med, Aurora, CO USA
[6] West Virginia Univ, Dept Ophthalmol & Visual Sci, Morgantown, WV USA
[7] Univ Maryland, Dept Ophthalmol & Visual Sci, Sch Med, Baltimore, MD USA
[8] Stanford Univ, Byers Eye Inst Stanford, Dept Ophthalmol, Stanford, CA USA
[9] VA Palo Alto Hlth Care Syst, , Califomia, Palo Alto, CA USA
[10] Montefiore Med Ctr, Dept Ophthalmol, New York, NY USA
[11] Univ Utah, Dept Ophthalmol, Salt Lake City, UT USA
[12] Johns Hopkins Univ, Dept Ophthalmol, Sch Med, Baltimore, MD USA
基金
美国国家卫生研究院;
关键词
CATARACT-SURGERY;
D O I
10.1016/j.ajo.2024.01.030
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
center dot PURPOSE: Nearly all published ophthalmology-related Big Data studies rely exclusively on International Classification of Diseases (ICD) billing codes to identify patients with particular ocular conditions. However, inaccurate or nonspecific codes may be used. We assessed whether natural language processing (NLP), as an alternative approach, could more accurately identify lens pathology. center dot DESIGN: Database study comparing the accuracy of NLP versus ICD billing codes to properly identify lens pathology. center dot METHODS: We developed an NLP algorithm capable of searching free-text lens exam data in the electronic health record (EHR) to identify the type(s) of cataract present, cataract density, presence of intraocular lenses, and other lens pathology. We applied our algorithm to 17.5 million lens exam records in the Sight Outcomes Research Collaborative (SOURCE) repository. We selected 4314 unique lens-exam entries and asked 11 clinicians to assess whether all pathology present in the entries had been correctly identified in the NLP algorithm output. The algorithm's sensitivity at accurately identifying lens pathology was compared with that of the ICD codes. center dot RESULTS: The NLP algorithm correctly identified all lens pathology present in 4104 of the 4314 lens-exam entries (95.1%). For less common lens pathology, algorithm findings were corroborated by reviewing clinicians for 100% of mentions of pseudoexfoliation material and 99.7% for phimosis, subluxation, and synechia. Sensitivity at identifying lens pathology was better for NLP (0.98 [0.96-0.99] than for billing codes (0.49 [0.46-0.53]). center dot CONCLUSIONS: Our NLP algorithm identifies and classifies lens abnormalities routinely documented by eyecare professionals with high accuracy. Such algorithms will help researchers to properly identify and classify ocular pathology, broadening the scope of feasible research using real-world data. (Am J Ophthalmol 2024;262: 153-160. (c) 2024 Elsevier Inc. All rights reserved.)
引用
收藏
页码:153 / 160
页数:8
相关论文
共 50 条
  • [1] Natural language processing to identify lupus nephritis phenotype in electronic health records
    Deng, Yu
    Pacheco, Jennifer A.
    Ghosh, Anika
    Chung, Anh
    Mao, Chengsheng
    Smith, Joshua C.
    Zhao, Juan
    Wei, Wei-Qi
    Barnado, April
    Dorn, Chad
    Weng, Chunhua
    Liu, Cong
    Cordon, Adam
    Yu, Jingzhi
    Tedla, Yacob
    Kho, Abel
    Ramsey-Goldman, Rosalind
    Walunas, Theresa
    Luo, Yuan
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 22 (SUPPL 2)
  • [2] Natural language processing to identify lupus nephritis phenotype in electronic health records
    Yu Deng
    Jennifer A. Pacheco
    Anika Ghosh
    Anh Chung
    Chengsheng Mao
    Joshua C. Smith
    Juan Zhao
    Wei-Qi Wei
    April Barnado
    Chad Dorn
    Chunhua Weng
    Cong Liu
    Adam Cordon
    Jingzhi Yu
    Yacob Tedla
    Abel Kho
    Rosalind Ramsey-Goldman
    Theresa Walunas
    Yuan Luo
    BMC Medical Informatics and Decision Making, 22
  • [3] Natural Language Processing to Identify Lupus Nephritis Phenotype in Electronic Health Records
    Deng, Yu
    Pacheco, Jennifer
    Chung, Anh
    Mao, Chengsheng
    Smith, Joshua
    Zhao, Juan
    Wei, Wei-Qi
    Barnado, April
    Weng, Chunhua
    Liu, Cong
    Gordon, Adam
    Yu, Jingzhi
    Tedla, Yacob
    Kho, Abel
    Ramsey-Goldman, Rosalind
    Walunas, Theresa
    Luo, Yuan
    ARTHRITIS & RHEUMATOLOGY, 2021, 73 : 666 - 667
  • [4] Using Natural Language Processing and Machine Learning to Identify Incident Stroke From Electronic Health Records
    Zhao, Yiqing
    Fu, Sunyang
    Bielinski, Suzette J.
    Decker, Paul
    Chamberlain, Alanna M.
    Roger, Veronique L.
    Liu, Hongfang
    Larson, Nicolas B.
    CIRCULATION, 2020, 141
  • [5] Using Natural Language Processing to Predict Risk in Electronic Health Records
    Duy Van Le
    Montgomery, James
    Kirkby, Kenneth
    Scanlan, Joel
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 574 - 578
  • [6] NATURAL LANGUAGE PROCESSING CAN ACCURATELY IDENTIFY HOSPITALIZATIONS FOR WORSENING HEART FAILURE USING ELECTRONIC HEALTH RECORDS
    Parikh, Rishi
    Ambrosy, Andrew
    Sung, Sue Hee
    Narayanan, Anand
    Masson, Rajeev
    Phuong-Quang Lam
    Kheder, Kevin
    Iwahashi, Alan
    Hardwick, Alexander
    Fitzpatrick, Jesse
    Go, Alan
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2020, 75 (11) : 3522 - 3522
  • [7] Natural Language Processing to Identify Cancer Treatments With Electronic Medical Records
    Zeng, Jiaming
    Banerjee, Imon
    Henry, A. Solomon
    Wood, Douglas J.
    Shachter, Ross D.
    Gensheimer, Michael F.
    Rubin, Daniel L.
    JCO CLINICAL CANCER INFORMATICS, 2021, 5 : 379 - 393
  • [8] Using Natural Language Processing of Electronic Health Records to Identify Patients with ANCA-Associated Vasculitides in the Veterans Affairs
    DuVall, Scott L.
    Kamauu, Aaron W. C.
    Napalkov, Pavel
    Anglemyer, Andrew T.
    Cantrell, Ronald A.
    Koening, Curry L.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2012, 21 : 118 - 118
  • [9] IDENTIFY PATIENTS WITH PYRUVATE KINASE DEFICIENCY USING NATURAL LANGUAGE PROCESSING ON ELECTRONIC MEDICAL RECORDS
    Liu, S.
    Shi, L.
    Lin, Y.
    Zhang, Y.
    Hong, D.
    Shao, Y.
    VALUE IN HEALTH, 2020, 23 : S329 - S329
  • [10] Natural Language Processing and Machine Learning to Identify People Who Inject Drugs in Electronic Health Records
    Goodman-Meza, David
    Tang, Amber
    Aryanfar, Babak
    Vazquez, Sergio
    Gordon, Adam J.
    Goto, Michihiko
    Goetz, Matthew Bidwell
    Shoptaw, Steven
    Bui, Alex A. T.
    OPEN FORUM INFECTIOUS DISEASES, 2022, 9 (09):