Evaluating the accuracy of lung-RADS score extraction from radiology reports: Manual entry versus natural language processing

被引:1
|
作者
Gandomi, Amir [1 ,2 ,7 ]
Hasan, Eusha [1 ,3 ]
Chusid, Jesse [1 ,3 ,4 ]
Paul, Subroto [1 ,3 ,5 ]
Inra, Matthew [1 ,3 ,5 ]
Makhnevich, Alex [1 ,2 ,3 ,4 ]
Raoof, Suhail [3 ,4 ,5 ]
Silvestri, Gerard [6 ]
Bade, Brett C. [1 ,2 ,3 ,5 ]
Cohen, Stuart L. [1 ,2 ,3 ,4 ]
机构
[1] Northwell, New Hyde Pk, NY USA
[2] Inst Hlth Syst Sci, Feinstein Inst Med Res, Manhasset, NY USA
[3] Donald & Barbara Zucker Sch Med Hofstra Northwell, Hempstead, NY USA
[4] North Shore Univ Hosp, Northwell, Manhasset, NY USA
[5] Lenox Hill Hosp, Northwell, New York, NY USA
[6] Med Univ South Carolina, Charleston, SC USA
[7] Hofstra Univ, Frank G Zarb Sch Business, Hempstead, NY USA
关键词
LC screening; Lung-RADS score; Follow-up; Manual entry; Natural language processing;
D O I
10.1016/j.ijmedinf.2024.105580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction: Radiology scoring systems are critical to the success of lung cancer screening (LCS) programs, impacting patient care, adherence to follow-up, data management and reporting, and program evaluation. Lung CT Screening Reporting and Data System (Lung-RADS) is a structured radiology scoring system that provides recommendations for LCS follow-up that are utilized (a) in clinical care and (b) by LCS programs monitoring rates of adherence to follow-up. Thus, accurate reporting and reliable collection of Lung-RADS scores are fundamental components of LCS program evaluation and improvement. Unfortunately, due to variability in radiology reports, extraction of Lung-RADS scores is non-trivial, and best practices do not exist. The purpose of this project is to compare mechanisms to extract Lung-RADS scores from free-text radiology reports. Methods: We retrospectively analyzed reports of LCS low-dose computed tomography (LDCT) examinations performed at a multihospital integrated healthcare network in New York State between January 2016 and July 2023. We compared three methods of Lung-RADS score extraction: manual physician entry at time of report creation, manual LCS specialist entry after report creation, and an internally developed, rule-based natural language processing (NLP) algorithm. Accuracy, recall, precision, and completeness (i.e., the proportion of LCS exams to which a Lung-RADS score has been assigned) were compared between the three methods. Results: The dataset includes 24,060 LCS examinations on 14,243 unique patients. The mean patient age was 65 years, and most patients were male (54 %) and white (75 %). Completeness rate was 65 %, 68 %, and 99 % for radiologists' manual entry, LCS specialists' entry, and NLP algorithm, respectively. Accuracy, recall, and precision were high across all extraction methods (>94 %), though the NLP-based approach was consistently higher than both manual entries in all metrics. Discussion: An NLP-based method of LCS score determination is an efficient and more accurate means of extracting Lung-RADS scores than manual review and data entry. NLP-based methods should be considered best practice for extracting structured Lung-RADS scores from free-text radiology reports.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] ChatGPT vs. Gemini: Comparative accuracy and efficiency in Lung-RADS score assignment from radiology reports
    Singh, Ria
    Hamouda, Mohamed
    Chamberlin, Jordan H.
    Toth, Adrienn
    Munford, James
    Silbergleit, Matthew
    Baruah, Dhiraj
    Burt, Jeremy R.
    Kabakus, Ismail M.
    CLINICAL IMAGING, 2025, 121
  • [2] Automatic Lung-RADS™ classification with a natural language processing system
    Beyer, Sebastian E.
    McKee, Brady J.
    Regis, Shawn M.
    McKee, Andrea B.
    Flacke, Sebastian
    El Saadawi, Gilan
    Wald, Christoph
    JOURNAL OF THORACIC DISEASE, 2017, 9 (09) : 3114 - +
  • [3] Automated Extraction of BI-RADS Final Assessment Categories from Radiology Reports with Natural Language Processing
    Dorothy A. Sippo
    Graham I. Warden
    Katherine P. Andriole
    Ronilda Lacson
    Ichiro Ikuta
    Robyn L. Birdwell
    Ramin Khorasani
    Journal of Digital Imaging, 2013, 26 : 989 - 994
  • [4] Automated Extraction of BI-RADS Final Assessment Categories from Radiology Reports with Natural Language Processing
    Sippo, Dorothy A.
    Warden, Graham I.
    Andriole, Katherine P.
    Lacson, Ronilda
    Ikuta, Ichiro
    Birdwell, Robyn L.
    Khorasani, Ramin
    JOURNAL OF DIGITAL IMAGING, 2013, 26 (05) : 989 - 994
  • [5] Automatic Extraction of Major Osteoporotic Fractures from Radiology Reports using Natural Language Processing
    Wang, Yanshan
    Mehrabi, Saeed
    Sohn, Sunghwan
    Atkinson, Elizabeth
    Amin, Shreyasee
    Liu, Hongfang
    2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS WORKSHOPS (ICHI-W), 2018, : 64 - 65
  • [6] Natural Language Processing to identify pneumonia from radiology reports
    Dublin, Sascha
    Baldwin, Eric
    Walker, Rod L.
    Christensen, Lee M.
    Haug, Peter J.
    Jackson, Michael L.
    Nelson, Jennifer C.
    Ferraro, Jeffrey
    Carrell, David
    Chapman, Wendy W.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2013, 22 (08) : 834 - 841
  • [7] Between Always and Never: Evaluating Uncertainty in Radiology Reports Using Natural Language Processing
    Callen, Andrew L.
    Dupont, Sara M.
    Price, Adi
    Laguna, Ben
    McCoy, David
    Do, Bao
    Talbott, Jason
    Kohli, Marc
    Narvid, Jared
    JOURNAL OF DIGITAL IMAGING, 2020, 33 (05) : 1194 - 1201
  • [8] Between Always and Never: Evaluating Uncertainty in Radiology Reports Using Natural Language Processing
    Andrew L. Callen
    Sara M. Dupont
    Adi Price
    Ben Laguna
    David McCoy
    Bao Do
    Jason Talbott
    Marc Kohli
    Jared Narvid
    Journal of Digital Imaging, 2020, 33 : 1194 - 1201
  • [9] Identification of gallstones from radiology reports using natural language processing
    Fairfield, Cameron
    Ots, Riinu
    Antai, Roseline
    Drake, Tom
    Knight, Stephen
    Wigmore, Stephen
    Harrison, Ewen
    BRITISH JOURNAL OF SURGERY, 2018, 105 : 58 - 58
  • [10] ChatGPT vs Gemini: Comparative Accuracy and Efficiency in CAD-RADS Score Assignment from Radiology Reports
    Silbergleit, Matthew
    Toth, Adrienn
    Chamberlin, Jordan H.
    Hamouda, Mohamed
    Baruah, Dhiraj
    Derrick, Sydney
    Schoepf, U. Joseph
    Burt, Jeremy R.
    Kabakus, Ismail M.
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024,