Evaluating the accuracy of lung-RADS score extraction from radiology reports: Manual entry versus natural language processing

被引:1
|
作者
Gandomi, Amir [1 ,2 ,7 ]
Hasan, Eusha [1 ,3 ]
Chusid, Jesse [1 ,3 ,4 ]
Paul, Subroto [1 ,3 ,5 ]
Inra, Matthew [1 ,3 ,5 ]
Makhnevich, Alex [1 ,2 ,3 ,4 ]
Raoof, Suhail [3 ,4 ,5 ]
Silvestri, Gerard [6 ]
Bade, Brett C. [1 ,2 ,3 ,5 ]
Cohen, Stuart L. [1 ,2 ,3 ,4 ]
机构
[1] Northwell, New Hyde Pk, NY USA
[2] Inst Hlth Syst Sci, Feinstein Inst Med Res, Manhasset, NY USA
[3] Donald & Barbara Zucker Sch Med Hofstra Northwell, Hempstead, NY USA
[4] North Shore Univ Hosp, Northwell, Manhasset, NY USA
[5] Lenox Hill Hosp, Northwell, New York, NY USA
[6] Med Univ South Carolina, Charleston, SC USA
[7] Hofstra Univ, Frank G Zarb Sch Business, Hempstead, NY USA
关键词
LC screening; Lung-RADS score; Follow-up; Manual entry; Natural language processing;
D O I
10.1016/j.ijmedinf.2024.105580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction: Radiology scoring systems are critical to the success of lung cancer screening (LCS) programs, impacting patient care, adherence to follow-up, data management and reporting, and program evaluation. Lung CT Screening Reporting and Data System (Lung-RADS) is a structured radiology scoring system that provides recommendations for LCS follow-up that are utilized (a) in clinical care and (b) by LCS programs monitoring rates of adherence to follow-up. Thus, accurate reporting and reliable collection of Lung-RADS scores are fundamental components of LCS program evaluation and improvement. Unfortunately, due to variability in radiology reports, extraction of Lung-RADS scores is non-trivial, and best practices do not exist. The purpose of this project is to compare mechanisms to extract Lung-RADS scores from free-text radiology reports. Methods: We retrospectively analyzed reports of LCS low-dose computed tomography (LDCT) examinations performed at a multihospital integrated healthcare network in New York State between January 2016 and July 2023. We compared three methods of Lung-RADS score extraction: manual physician entry at time of report creation, manual LCS specialist entry after report creation, and an internally developed, rule-based natural language processing (NLP) algorithm. Accuracy, recall, precision, and completeness (i.e., the proportion of LCS exams to which a Lung-RADS score has been assigned) were compared between the three methods. Results: The dataset includes 24,060 LCS examinations on 14,243 unique patients. The mean patient age was 65 years, and most patients were male (54 %) and white (75 %). Completeness rate was 65 %, 68 %, and 99 % for radiologists' manual entry, LCS specialists' entry, and NLP algorithm, respectively. Accuracy, recall, and precision were high across all extraction methods (>94 %), though the NLP-based approach was consistently higher than both manual entries in all metrics. Discussion: An NLP-based method of LCS score determination is an efficient and more accurate means of extracting Lung-RADS scores than manual review and data entry. NLP-based methods should be considered best practice for extracting structured Lung-RADS scores from free-text radiology reports.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Automating Stroke Data Extraction From Free-Text Radiology Reports Using Natural Language Processing: Instrument Validation Study
    Yu, Amy Y. X.
    Liu, Zhongyu A.
    Pou-Prom, Chloe
    Lopes, Kaitlyn
    Kapral, Moira K.
    Aviv, Richard, I
    Mamdani, Muhammad
    JMIR MEDICAL INFORMATICS, 2021, 9 (05)
  • [22] An Integrated Voice Recognition and Natural Language Processing Platform to Automatically Extract Thoracolumbar Injury Classification Score (TLICS) Features from Radiology Reports
    Bhandarkar, Archis R.
    Onyedimma, Chiduziem
    Jarrah, Ryan
    Fu, Sunyang
    Liu, Hongfang
    Bydon, Mohamad
    NEUROSURGERY, 2022, 68 : 50 - 51
  • [23] Image Text Extraction and Natural Language Processing of Unstructured Data from Medical Reports
    Malashin, Ivan
    Masich, Igor
    Tynchenko, Vadim
    Gantimurov, Andrei
    Nelyub, Vladimir
    Borodulin, Aleksei
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (02): : 1361 - 1377
  • [24] Validation of a Natural Language Processing Algorithm for the Extraction of the Sleep Parameters from the Polysomnography Reports
    Rahman, Mahbubur
    Nowakowski, Sara
    Agrawal, Ritwick
    Naik, Aanand
    Sharafkhaneh, Amir
    Razjouyan, Javad
    HEALTHCARE, 2022, 10 (10)
  • [25] Harnessing Natural Language Processing for Structured Information Extraction from Radiology Reports in Crohn's Disease: A Nationwide Study From the epi-IIRN
    Hazan, L.
    Focht, G.
    Gavrielov, N.
    Reichart, R.
    Friss, C.
    Kuint, R. Cytter
    Turner, D.
    Freiman, M.
    JOURNAL OF CROHNS & COLITIS, 2024, 18 : I626 - I627
  • [26] Identifying Patients With Pulmonary Nodules From CT Radiology Reports Using Natural Language Processing (NLP)
    Dotson, T. L.
    Gasimova, A.
    Watkins, J.
    Chometon, Q.
    Bellinger, C. R.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2023, 207
  • [27] IDENTIFYING EROSIVE DISEASE FROM RADIOLOGY REPORTS OF VETERANS WITH INFLAMMATORY ARTHRITIS USING NATURAL LANGUAGE PROCESSING
    Penmetsa, G.
    Pei, S.
    Sauer, B.
    Walsh, J. A.
    Feng, B.
    Walker, J.
    Douglas, K.
    Clewell, J.
    ANNALS OF THE RHEUMATIC DISEASES, 2021, 80 : 353 - 354
  • [28] Automatic Lung Cancer Staging from Medical Reports Using Natural Language Processing
    Sui, X.
    Liu, T.
    Huang, Q.
    Hou, Y.
    Wang, Y.
    Kang, G.
    Guo, H.
    Li, N.
    Li, Y.
    Wang, Z.
    Wang, J.
    JOURNAL OF THORACIC ONCOLOGY, 2018, 13 (10) : S772 - S772
  • [29] Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer
    Wadia, Roxanne
    Akgun, Kathleen
    Brandt, Cynthia
    Fenton, Brenda T.
    Levin, Woody
    Marple, Andrew H.
    Garla, Vijay
    Rose, Michal G.
    Taddei, Tamar
    Taylor, Caroline
    JCO CLINICAL CANCER INFORMATICS, 2018, 2 : 1 - 7
  • [30] Severity score extraction from clinical notes using natural language processing: Applications to dermatology
    Kumar, Vikas
    Rasouliyan, Lawrence
    Althoff, Amanda G.
    Chang, Stella
    Long, Stacey
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 : 125 - 125