Independent evaluation of the accuracy of 5 artificial intelligence software for detecting lung nodules on chest X-rays

被引:0
|
作者
Arzamasov, Kirill [1 ,2 ]
Vasilev, Yuriy [1 ,3 ]
Zelenova, Maria [1 ]
Pestrenin, Lev [1 ]
Busygina, Yulia [1 ]
Bobrovskaya, Tatiana [1 ]
Chetverikov, Sergey [1 ]
Shikhmuradov, David [1 ]
Pankratov, Andrey [1 ]
Kirpichev, Yury [1 ]
Sinitsyn, Valentin [1 ,4 ]
Son, Irina [5 ]
Omelyanskaya, Olga [1 ]
机构
[1] Moscow Hlth Care Dept, State Budget Funded Hlth Care Inst City Moscow, Res & Pract Clin Ctr Diagnost & Telemed Technol, Petrovka str 24, Moscow 127051, Russia
[2] MIREA Russian Technol Univ, Vernadsky Ave 78, Moscow 119454, Russia
[3] Minist Hlth Russian Federat, Fed State Budgetary Inst, Natl Med & Surg Ctr, Moscow, Russia
[4] Lomonosov Moscow State Univ, Moscow, Russia
[5] Minist Healthcare Russian Federat, Fed State Budgetary Educ Inst Further Profess Educ, Russian Med Acad Continuous Profess Educ, Moscow, Russia
关键词
Chest X-ray (CXR); artificial intelligence (AI); lung nodules; radiology; computer vision;
D O I
10.21037/qims-24-160
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: The integration of artificial intelligence (AI) into medicine is growing, with some experts predicting its standalone use soon. However, skepticism remains due to limited positive outcomes from independent validations. This research evaluates AI software's effectiveness in analyzing chest X-rays (CXR) to identify lung nodules, a possible lung cancer indicator. Methods: This retrospective study analyzed 7,670,212 record pairs from radiological exams conducted between 2020 and 2022 during the Moscow Computer Vision Experiment, focusing on CXR and computed tomography (CT) scans. All images were acquired during clinical routine. The final dataset comprised 100 CXR images (50 with lung nodules, 50 without), selected consecutively and based on inclusion and exclusion criteria, to evaluate the performance of all five AI-based solutions, participating in the Moscow Computer Vision Experiment and analyzing CXR. The evaluation was performed in 3 stages. In the first stage, the probability of a nodule in the lung obtained from AI services was compared with the Ground Truth (1-there is a nodule, 0-there is no nodule). In the second stage, 3 radiologists evaluated the segmentation of nodules performed by the AI services (1-nodule correctly segmented, 0-nodule incorrectly segmented or not segmented at all). In the third stage, the same radiologists additionally evaluated the classification of the nodules (1-nodule correctly segmented and classified, 0-all other cases). The results obtained in stages 2 and 3 were compared with Ground Truth, which was common to all three stages. For each stage, diagnostic accuracy metrics were calculated for each AI service. Results: Three software solutions (Celsus, Lunit INSIGHT CXR, and qXR) demonstrated diagnostic metrics that matched or surpassed the vendor specifications, and achieved the highest area under the receiver operating characteristic curve (AUC) of 0.956 [95% confidence interval (CI): 0.918 to 0.994]. However, when evaluated by three radiologists for accurate nodule segmentation and classification, all solutions performed below the vendor-declared metrics, with the highest AUC reaching 0.812 (95% CI: 0.744 to 0.879). Meanwhile, all AI services demonstrated 100% specificity at stages 2 and 3 of the study. Conclusions: To ensure the reliability and applicability of AI-based software, it is crucial to validate performance metrics using high-quality datasets and engage radiologists in the evaluation process. Developers are recommended to improve the accuracy of the underlying models before allowing the standalone use of the software for lung nodule detection.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] A review on lung boundary detection in chest X-rays
    Sema Candemir
    Sameer Antani
    [J]. International Journal of Computer Assisted Radiology and Surgery, 2019, 14 : 563 - 576
  • [22] Diagnostic Accuracy and Performance of Artificial Intelligence in Detecting Lung Nodules in Patients With Complex Lung Disease A Noninferiority Study
    Abadia, Andres F.
    Yacoub, Basel
    Stringer, Natalie
    Snoddy, Madalyn
    Kocher, Madison
    Schoepf, U. Joseph
    Aquino, Gilberto J.
    Kabakus, Ismail
    Dargis, Danielle
    Hoelzer, Philipp
    Sperl, Jonathan, I
    Sahbaee, Pooyan
    Vingiani, Vincenzo
    Mercer, Megan
    Burt, Jeremy R.
    [J]. JOURNAL OF THORACIC IMAGING, 2022, 37 (03) : 154 - 161
  • [23] Diagnostic Accuracy of Initial Chest X-Rays in Thorax Trauma
    Tataroglu, Ozlem
    Erdogan, Secil Telli
    Erdogan, Mehmet Ozgur
    Tayfur, Ismail
    Afacan, Mustafa Ahmet
    Yavuz, Burcu Genc
    Colak, Sahin
    [J]. JCPSP-JOURNAL OF THE COLLEGE OF PHYSICIANS AND SURGEONS PAKISTAN, 2018, 28 (07): : 546 - 548
  • [24] Application of artificial intelligence in diagnosing COVID-19 disease symptoms on chest X-rays: A review
    Kufel, Jakub
    Bargiel, Katarzyna
    Kozlik, Maciej
    Czogalik, Lukasz
    Dudek, Piotr
    Jaworski, Aleksander
    Cebula, Maciej
    Gruszczynska, Katarzyna
    [J]. INTERNATIONAL JOURNAL OF MEDICAL SCIENCES, 2022, 19 (12): : 1743 - 1752
  • [25] Automatic detection of COVID-19 pneumonia through artificial intelligence applied to chest X-rays
    Anorbe, E.
    Bereciartua, A.
    Aisa, P.
    Garcia-Tejedor, A.
    Valero, A.
    Picon, A.
    Palomares, T.
    Garrote, E.
    [J]. EUROPEAN RESPIRATORY JOURNAL, 2022, 60
  • [26] Detecting Coronavirus from Chest X-rays Using Transfer Learning
    Badawi, Abeer
    Elgazzar, Khalid
    [J]. COVID, 2021, 1 (01): : 403 - 415
  • [27] DETECTING SHORTCUTS IN MEDICAL IMAGES - A CASE STUDY IN CHEST X-RAYS
    Jimenez-Sanchez, Amelia
    Juodelyte, Dovile
    Chamberlain, Bethany
    Cheplygina, Veronika
    [J]. 2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [28] CHEST X-RAYS AND FULL LUNG TOMOGRAMS IN GYNECOLOGIC MALIGNANCY
    GORDON, RE
    METTLER, FA
    WICKS, JD
    BARTOW, SA
    [J]. CANCER, 1983, 52 (03) : 559 - 562
  • [29] Lung Ultrasound and Chest X-Rays: Together to Improve the Diagnosis
    Sperandeo, Marco
    Rea, Gaetano
    Filabozzi, Paola
    Carnevale, Vincenzo
    [J]. RESPIRATION, 2017, 93 (03) : 226 - 227
  • [30] Development and assessment of artificial intelligence detection of lung nodules on chest roentgenograms
    Higuchi, M.
    Nagata, T.
    Suzuki, J.
    Yabuki, T.
    Inomata, S.
    Suzuki, H.
    [J]. JOURNAL OF THORACIC ONCOLOGY, 2023, 18 (04) : S101 - S101