AI-based improvement in lung cancer detection on chest radiographs: results of a multi-reader study in NLST dataset

被引:35
|
作者
Yoo, Hyunsuk [1 ]
Lee, Sang Hyup [1 ]
Arru, Chiara Daniela [2 ,3 ]
Doda Khera, Ruhani [2 ,3 ]
Singh, Ramandeep [2 ,3 ]
Siebert, Sean [2 ,3 ]
Kim, Dohoon [4 ]
Lee, Yuna [4 ]
Park, Ju Hyun [5 ]
Eom, Hye Joung [6 ]
Digumarthy, Subba R. [2 ,3 ]
Kalra, Mannudeep K. [2 ,3 ]
机构
[1] Lunit, Seoul, South Korea
[2] Massachusetts Gen Hosp, Div Thorac Imaging, Dept Radiol, 75 Blossom Court, Boston, MA 02114 USA
[3] Harvard Med Sch, Boston, MA 02115 USA
[4] Seoul Natl Univ, Dept Radiol, Coll Med, Seoul, South Korea
[5] Sungkyunkwan Univ, Sch Med, Kangbuk Samsung Hosp, Suwon Total Healthcare Ctr, Youngin Si 16954, Gyeongi Do, South Korea
[6] Cheju Halla Gen Hosp, 65 Doryeong Ro, Jeju Si, Jeju Do, South Korea
关键词
Radiography; Lung cancer; Mass screening; Artificial intelligence; Computer-assisted radiographic image interpretation; SPECIFICITY; SENSITIVITY;
D O I
10.1007/s00330-021-08074-7
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objective Assess if deep learning-based artificial intelligence (AI) algorithm improves reader performance for lung cancer detection on chest X-rays (CXRs). Methods This reader study included 173 images from cancer-positive patients (n = 98) and 346 images from cancer-negative patients (n = 196) selected from National Lung Screening Trial (NLST). Eight readers, including three radiology residents, and five board-certified radiologists, participated in the observer performance test. AI algorithm provided image-level probability of pulmonary nodule or mass on CXRs and a heatmap of detected lesions. Reader performance was compared with AUC, sensitivity, specificity, false-positives per image (FPPI), and rates of chest CT recommendations. Results With AI, the average sensitivity of readers for the detection of visible lung cancer increased for residents, but was similar for radiologists compared to that without AI (0.61 [95% CI, 0.55-0.67] vs. 0.72 [95% CI, 0.66-0.77], p = 0.016 for residents, and 0.76 [95% CI, 0.72-0.81] vs. 0.76 [95% CI, 0.72-0.81, p = 1.00 for radiologists), while false-positive findings per image (FPPI) was similar for residents, but decreased for radiologists (0.15 [95% CI, 0.11-0.18] vs. 0.12 [95% CI, 0.09-0.16], p = 0.13 for residents, and 0.24 [95% CI, 0.20-0.29] vs. 0.17 [95% CI, 0.13-0.20], p < 0.001 for radiologists). With AI, the average rate of chest CT recommendation in patients positive for visible cancer increased for residents, but was similar for radiologists (54.7% [95% CI, 48.2-61.2%] vs. 70.2% [95% CI, 64.2-76.2%], p < 0.001 for residents and 72.5% [95% CI, 68.0-77.1%] vs. 73.9% [95% CI, 69.4-78.3%], p = 0.68 for radiologists), while that in cancer-negative patients was similar for residents, but decreased for radiologists (11.2% [95% CI, 9.6-13.1%] vs. 9.8% [95% CI, 8.0-11.6%], p = 0.32 for residents and 16.4% [95% CI, 14.7-18.2%] vs. 11.7% [95% CI, 10.2-13.3%], p < 0.001 for radiologists). Conclusions AI algorithm can enhance the performance of readers for the detection of lung cancers on chest radiographs when used as second reader.
引用
收藏
页码:9664 / 9674
页数:11
相关论文
共 50 条
  • [1] AI-based improvement in lung cancer detection on chest radiographs: results of a multi-reader study in NLST dataset
    Hyunsuk Yoo
    Sang Hyup Lee
    Chiara Daniela Arru
    Ruhani Doda Khera
    Ramandeep Singh
    Sean Siebert
    Dohoon Kim
    Yuna Lee
    Ju Hyun Park
    Hye Joung Eom
    Subba R. Digumarthy
    Mannudeep K. Kalra
    [J]. European Radiology, 2021, 31 : 9664 - 9674
  • [2] Can incorrect artificial intelligence (AI) results impact radiologists, and if so, what can we do about it? A multi-reader pilot study of lung cancer detection with chest radiography
    Bernstein, Michael H.
    Atalay, Michael K.
    Dibble, Elizabeth H.
    Maxwell, Aaron W. P.
    Karam, Adib R.
    Agarwal, Saurabh
    Ward, Robert C.
    Healey, Terrance T.
    Baird, Grayson L.
    [J]. EUROPEAN RADIOLOGY, 2023, 33 (11) : 8263 - 8269
  • [3] Can incorrect artificial intelligence (AI) results impact radiologists, and if so, what can we do about it? A multi-reader pilot study of lung cancer detection with chest radiography
    Michael H. Bernstein
    Michael K. Atalay
    Elizabeth H. Dibble
    Aaron W. P. Maxwell
    Adib R. Karam
    Saurabh Agarwal
    Robert C. Ward
    Terrance T. Healey
    Grayson L. Baird
    [J]. European Radiology, 2023, 33 : 8263 - 8269
  • [4] Improving traumatic fracture detection on radiographs with artificial intelligence support: a multi-reader study
    Bachmann, Rikke
    Gunes, Gozde
    Hangaard, Stine
    Nexmann, Andreas
    Lisouski, Pavel
    Boesen, Mikael
    Lundemann, Michael
    Baginski, Scott G.
    [J]. BJR OPEN, 2024, 6 (01):
  • [5] Early Detection of Heart Failure with Autonomous AI-Based Model Using Chest Radiographs: A Multicenter Study
    Garza-Frias, Emiliano
    Kaviani, Parisa
    Karout, Lina
    Fahimi, Roshan
    Hosseini, Seyedehelaheh
    Putha, Preetham
    Tadepalli, Manoj
    Kiran, Sai
    Arora, Charu
    Robert, Dennis
    Bizzo, Bernardo
    Dreyer, Keith J.
    Kalra, Mannudeep K.
    Digumarthy, Subba R.
    [J]. DIAGNOSTICS, 2024, 14 (15)
  • [6] STRAMPN: Histopathological image dataset for ovarian cancer detection incorporating AI-based methods
    Samridhi Singh
    Malti Kumari Maurya
    Nagendra Pratap Singh
    [J]. Multimedia Tools and Applications, 2024, 83 : 28175 - 28196
  • [7] STRAMPN: Histopathological image dataset for ovarian cancer detection incorporating AI-based methods
    Singh, Samridhi
    Maurya, Malti Kumari
    Singh, Nagendra Pratap
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 28175 - 28196
  • [8] Coronary risk assessment at X-ray dose equivalent ungated chest CT: Results of a multi-reader study
    Messerli, Michael
    Hechelhammer, Lukas
    Leschka, Sebastian
    Warschkow, Rene
    Wildermuth, Simon
    Bauer, Ralf W.
    [J]. CLINICAL IMAGING, 2018, 49 : 73 - 79
  • [9] Confidence-Aware Severity Assessment of Lung Disease from Chest X-Rays Using Deep Neural Network on a Multi-Reader Dataset
    Zandehshahvar, Mohammadreza
    van Assen, Marly
    Kim, Eun
    Kiarashi, Yashar
    Keerthipati, Vikranth
    Tessarin, Giovanni
    Muscogiuri, Emanuele
    Stillman, Arthur E.
    Filev, Peter
    Davarpanah, Amir H.
    Berkowitz, Eugene A.
    Tigges, Stefan
    Lee, Scott J.
    Vey, Brianna L.
    De Cecco, Carlo
    Adibi, Ali
    [J]. JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024,
  • [10] Artificial intelligence-assisted diagnosis of congenital heart disease and associated pulmonary arterial hypertension from chest radiographs: A multi-reader multi-case study
    Han, Pei-Lun
    Jiang, Lei
    Cheng, Jun-Long
    Shi, Ke
    Huang, Shan
    Jiang, Yu
    Jiang, Li
    Xia, Qing
    Li, Yi-Yue
    Zhu, Min
    Li, Kang
    Yang, Zhi-Gang
    [J]. EUROPEAN JOURNAL OF RADIOLOGY, 2024, 171