Independent evaluation of the accuracy of 5 artificial intelligence software for detecting lung nodules on chest X-rays

被引:0
|
作者
Arzamasov, Kirill [1 ,2 ]
Vasilev, Yuriy [1 ,3 ]
Zelenova, Maria [1 ]
Pestrenin, Lev [1 ]
Busygina, Yulia [1 ]
Bobrovskaya, Tatiana [1 ]
Chetverikov, Sergey [1 ]
Shikhmuradov, David [1 ]
Pankratov, Andrey [1 ]
Kirpichev, Yury [1 ]
Sinitsyn, Valentin [1 ,4 ]
Son, Irina [5 ]
Omelyanskaya, Olga [1 ]
机构
[1] Moscow Hlth Care Dept, State Budget Funded Hlth Care Inst City Moscow, Res & Pract Clin Ctr Diagnost & Telemed Technol, Petrovka str 24, Moscow 127051, Russia
[2] MIREA Russian Technol Univ, Vernadsky Ave 78, Moscow 119454, Russia
[3] Minist Hlth Russian Federat, Fed State Budgetary Inst, Natl Med & Surg Ctr, Moscow, Russia
[4] Lomonosov Moscow State Univ, Moscow, Russia
[5] Minist Healthcare Russian Federat, Fed State Budgetary Educ Inst Further Profess Educ, Russian Med Acad Continuous Profess Educ, Moscow, Russia
关键词
Chest X-ray (CXR); artificial intelligence (AI); lung nodules; radiology; computer vision;
D O I
10.21037/qims-24-160
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: The integration of artificial intelligence (AI) into medicine is growing, with some experts predicting its standalone use soon. However, skepticism remains due to limited positive outcomes from independent validations. This research evaluates AI software's effectiveness in analyzing chest X-rays (CXR) to identify lung nodules, a possible lung cancer indicator. Methods: This retrospective study analyzed 7,670,212 record pairs from radiological exams conducted between 2020 and 2022 during the Moscow Computer Vision Experiment, focusing on CXR and computed tomography (CT) scans. All images were acquired during clinical routine. The final dataset comprised 100 CXR images (50 with lung nodules, 50 without), selected consecutively and based on inclusion and exclusion criteria, to evaluate the performance of all five AI-based solutions, participating in the Moscow Computer Vision Experiment and analyzing CXR. The evaluation was performed in 3 stages. In the first stage, the probability of a nodule in the lung obtained from AI services was compared with the Ground Truth (1-there is a nodule, 0-there is no nodule). In the second stage, 3 radiologists evaluated the segmentation of nodules performed by the AI services (1-nodule correctly segmented, 0-nodule incorrectly segmented or not segmented at all). In the third stage, the same radiologists additionally evaluated the classification of the nodules (1-nodule correctly segmented and classified, 0-all other cases). The results obtained in stages 2 and 3 were compared with Ground Truth, which was common to all three stages. For each stage, diagnostic accuracy metrics were calculated for each AI service. Results: Three software solutions (Celsus, Lunit INSIGHT CXR, and qXR) demonstrated diagnostic metrics that matched or surpassed the vendor specifications, and achieved the highest area under the receiver operating characteristic curve (AUC) of 0.956 [95% confidence interval (CI): 0.918 to 0.994]. However, when evaluated by three radiologists for accurate nodule segmentation and classification, all solutions performed below the vendor-declared metrics, with the highest AUC reaching 0.812 (95% CI: 0.744 to 0.879). Meanwhile, all AI services demonstrated 100% specificity at stages 2 and 3 of the study. Conclusions: To ensure the reliability and applicability of AI-based software, it is crucial to validate performance metrics using high-quality datasets and engage radiologists in the evaluation process. Developers are recommended to improve the accuracy of the underlying models before allowing the standalone use of the software for lung nodule detection.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] ROLE OF SPIROMETRY AND CHEST X-RAYS IN THE EVALUATION OF PULMONARY OUTPATIENTS
    OWENS, MW
    KINASEWITZ, GT
    LAMBERT, RS
    MATTHEWS, WH
    PAYNE, DK
    [J]. CLINICAL RESEARCH, 1986, 34 (01): : A227 - A227
  • [42] Clinical evaluation of wavelet compression of digitized chest X-rays
    Erickson, BJ
    Manduca, A
    Persons, KR
    [J]. IMAGE DISPLAY - MEDICAL IMAGING 1997, 1997, 3031 : 312 - 317
  • [43] EVOLVING DEEP ENSEMBLES FOR DETECTING COVID-19 IN CHEST X-RAYS
    Bosowski, Piotr
    Bosowska, Joanna
    Nalepa, Jakub
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3772 - 3776
  • [44] Identifying patients at risk for osteoporosis using artificial intelligence on X-rays
    Dasilva, Chris
    Teitel, Jack
    Sani, Numair
    Mitten, David
    [J]. QUALITY OF LIFE RESEARCH, 2018, 27 : S10 - S10
  • [45] MRI Seen as Alternative to Chest X-Rays in Children With Lung Disease
    Nahlen, Dawn Drennan
    [J]. NEWBORN AND INFANT NURSING REVIEWS, 2008, 8 (02) : C27 - C28
  • [46] Negative chest X-rays in primary care patients with lung cancer
    Stapley, Sally
    Sharp, Deborah
    Hamilton, William
    [J]. BRITISH JOURNAL OF GENERAL PRACTICE, 2006, 56 (529): : 570 - 573
  • [47] Chest X-rays and Lung Ultrasound Are Not Interchangeable in Intensive Care Practice
    Schmidt, Stefan
    Behnke, Nico
    Dieks, Jana-Katharina
    [J]. DIAGNOSTICS, 2024, 14 (01)
  • [48] A STUDY OF VARIABILITY IN INTERPRETATION OF CHEST X-RAYS IN DETECTION OF LUNG CANCER
    LILIENFELD, AM
    KORDAN, B
    [J]. CANCER RESEARCH, 1966, 26 (10) : 2145 - +
  • [49] Measurement of Cardiothoracic Ratio on Chest X-rays Using Artificial Intelligence-A Systematic Review and Meta-Analysis
    Kufel, Jakub
    Czogalik, Lukasz
    Bielowka, Michal
    Magiera, Mikolaj
    Mitrega, Adam
    Dudek, Piotr
    Bargiel-Laczek, Katarzyna
    Stencel, Magdalena
    Bartnikowska, Wiktoria
    Mielcarska, Sylwia
    Modlinska, Sandra
    Nawrat, Zbigniew
    Cebula, Maciej
    Gruszczynska, Katarzyna
    [J]. JOURNAL OF CLINICAL MEDICINE, 2024, 13 (16)
  • [50] Artificial Intelligence for Early Detection of Chest Nodules in X-ray Images
    Chiu, Hwa-Yen
    Peng, Rita Huan-Ting
    Lin, Yi-Chian
    Wang, Ting-Wei
    Yang, Ya-Xuan
    Chen, Ying-Ying
    Wu, Mei-Han
    Shiao, Tsu-Hui
    Chao, Heng-Sheng
    Chen, Yuh-Min
    Wu, Yu-Te
    [J]. BIOMEDICINES, 2022, 10 (11)