Evaluating the accuracy of lung-RADS score extraction from radiology reports: Manual entry versus natural language processing

被引:1
|
作者
Gandomi, Amir [1 ,2 ,7 ]
Hasan, Eusha [1 ,3 ]
Chusid, Jesse [1 ,3 ,4 ]
Paul, Subroto [1 ,3 ,5 ]
Inra, Matthew [1 ,3 ,5 ]
Makhnevich, Alex [1 ,2 ,3 ,4 ]
Raoof, Suhail [3 ,4 ,5 ]
Silvestri, Gerard [6 ]
Bade, Brett C. [1 ,2 ,3 ,5 ]
Cohen, Stuart L. [1 ,2 ,3 ,4 ]
机构
[1] Northwell, New Hyde Pk, NY USA
[2] Inst Hlth Syst Sci, Feinstein Inst Med Res, Manhasset, NY USA
[3] Donald & Barbara Zucker Sch Med Hofstra Northwell, Hempstead, NY USA
[4] North Shore Univ Hosp, Northwell, Manhasset, NY USA
[5] Lenox Hill Hosp, Northwell, New York, NY USA
[6] Med Univ South Carolina, Charleston, SC USA
[7] Hofstra Univ, Frank G Zarb Sch Business, Hempstead, NY USA
关键词
LC screening; Lung-RADS score; Follow-up; Manual entry; Natural language processing;
D O I
10.1016/j.ijmedinf.2024.105580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction: Radiology scoring systems are critical to the success of lung cancer screening (LCS) programs, impacting patient care, adherence to follow-up, data management and reporting, and program evaluation. Lung CT Screening Reporting and Data System (Lung-RADS) is a structured radiology scoring system that provides recommendations for LCS follow-up that are utilized (a) in clinical care and (b) by LCS programs monitoring rates of adherence to follow-up. Thus, accurate reporting and reliable collection of Lung-RADS scores are fundamental components of LCS program evaluation and improvement. Unfortunately, due to variability in radiology reports, extraction of Lung-RADS scores is non-trivial, and best practices do not exist. The purpose of this project is to compare mechanisms to extract Lung-RADS scores from free-text radiology reports. Methods: We retrospectively analyzed reports of LCS low-dose computed tomography (LDCT) examinations performed at a multihospital integrated healthcare network in New York State between January 2016 and July 2023. We compared three methods of Lung-RADS score extraction: manual physician entry at time of report creation, manual LCS specialist entry after report creation, and an internally developed, rule-based natural language processing (NLP) algorithm. Accuracy, recall, precision, and completeness (i.e., the proportion of LCS exams to which a Lung-RADS score has been assigned) were compared between the three methods. Results: The dataset includes 24,060 LCS examinations on 14,243 unique patients. The mean patient age was 65 years, and most patients were male (54 %) and white (75 %). Completeness rate was 65 %, 68 %, and 99 % for radiologists' manual entry, LCS specialists' entry, and NLP algorithm, respectively. Accuracy, recall, and precision were high across all extraction methods (>94 %), though the NLP-based approach was consistently higher than both manual entries in all metrics. Discussion: An NLP-based method of LCS score determination is an efficient and more accurate means of extracting Lung-RADS scores than manual review and data entry. NLP-based methods should be considered best practice for extracting structured Lung-RADS scores from free-text radiology reports.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Automated Classification of Radiology Reports for Acute Lung Injury: Comparison of Keyword and Machine Learning Based Natural Language Processing Approaches
    Solti, Imre
    Cooke, Colin R.
    Xia, Fei
    Wurfel, Mark M.
    BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 308 - +
  • [42] General-Purpose Large Language Models Versus a Domain-Specific Natural Language Processing Tool for Label Extraction From Chest Radiograph Reports
    Savage, Cody H.
    Park, Hyoungsun
    Kwak, Kijung
    Smith, Andrew D.
    Rothenberg, Steven A.
    Parekh, Vishwa S.
    Doo, Florence X.
    Yi, Paul H.
    AMERICAN JOURNAL OF ROENTGENOLOGY, 2024, 222 (04)
  • [43] Anatomic stage extraction from medical reports of breast Cancer patients using natural language processing
    Pratiksha R. Deshmukh
    Rashmi Phalnikar
    Health and Technology, 2020, 10 : 1555 - 1570
  • [44] Anatomic stage extraction from medical reports of breast Cancer patients using natural language processing
    Deshmukh, Pratiksha R.
    Phalnikar, Rashmi
    HEALTH AND TECHNOLOGY, 2020, 10 (06) : 1555 - 1570
  • [45] NATURAL LANGUAGE PROCESSING ALLOWS FOR ACCURATE AND AUTOMATED EXTRACTION OF DATA FROM PROSTATE BIOPSY PATHOLOGY REPORTS
    Joice, Gregory
    Chee, Brant
    Gupta, Natasha
    Johnson, Michael
    JOURNAL OF UROLOGY, 2018, 199 (04): : E1025 - E1026
  • [46] Use of natural language processing techniques to predict patient selection for total hip and knee arthroplasty from radiology reports
    Farrow, L.
    Zhong, M.
    Anderson, L.
    BONE & JOINT JOURNAL, 2024, 106B (07): : 688 - 695
  • [47] A validated natural language processing algorithm for brain imaging phenotypes from radiology reports in UK electronic health records
    Emily Wheater
    Grant Mair
    Cathie Sudlow
    Beatrice Alex
    Claire Grover
    William Whiteley
    BMC Medical Informatics and Decision Making, 19
  • [48] A validated natural language processing algorithm for brain imaging phenotypes from radiology reports in UK electronic health records
    Wheater, Emily
    Mair, Grant
    Sudlow, Cathie
    Alex, Beatrice
    Grover, Claire
    Whiteley, William
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
  • [49] Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach
    Shaina Raza
    Brian Schwartz
    BMC Medical Informatics and Decision Making, 23
  • [50] Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach
    Raza, Shaina
    Schwartz, Brian
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)