Independent evaluation of the accuracy of 5 artificial intelligence software for detecting lung nodules on chest X-rays

被引:0
|
作者
Arzamasov, Kirill [1 ,2 ]
Vasilev, Yuriy [1 ,3 ]
Zelenova, Maria [1 ]
Pestrenin, Lev [1 ]
Busygina, Yulia [1 ]
Bobrovskaya, Tatiana [1 ]
Chetverikov, Sergey [1 ]
Shikhmuradov, David [1 ]
Pankratov, Andrey [1 ]
Kirpichev, Yury [1 ]
Sinitsyn, Valentin [1 ,4 ]
Son, Irina [5 ]
Omelyanskaya, Olga [1 ]
机构
[1] Moscow Hlth Care Dept, State Budget Funded Hlth Care Inst City Moscow, Res & Pract Clin Ctr Diagnost & Telemed Technol, Petrovka str 24, Moscow 127051, Russia
[2] MIREA Russian Technol Univ, Vernadsky Ave 78, Moscow 119454, Russia
[3] Minist Hlth Russian Federat, Fed State Budgetary Inst, Natl Med & Surg Ctr, Moscow, Russia
[4] Lomonosov Moscow State Univ, Moscow, Russia
[5] Minist Healthcare Russian Federat, Fed State Budgetary Educ Inst Further Profess Educ, Russian Med Acad Continuous Profess Educ, Moscow, Russia
关键词
Chest X-ray (CXR); artificial intelligence (AI); lung nodules; radiology; computer vision;
D O I
10.21037/qims-24-160
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: The integration of artificial intelligence (AI) into medicine is growing, with some experts predicting its standalone use soon. However, skepticism remains due to limited positive outcomes from independent validations. This research evaluates AI software's effectiveness in analyzing chest X-rays (CXR) to identify lung nodules, a possible lung cancer indicator. Methods: This retrospective study analyzed 7,670,212 record pairs from radiological exams conducted between 2020 and 2022 during the Moscow Computer Vision Experiment, focusing on CXR and computed tomography (CT) scans. All images were acquired during clinical routine. The final dataset comprised 100 CXR images (50 with lung nodules, 50 without), selected consecutively and based on inclusion and exclusion criteria, to evaluate the performance of all five AI-based solutions, participating in the Moscow Computer Vision Experiment and analyzing CXR. The evaluation was performed in 3 stages. In the first stage, the probability of a nodule in the lung obtained from AI services was compared with the Ground Truth (1-there is a nodule, 0-there is no nodule). In the second stage, 3 radiologists evaluated the segmentation of nodules performed by the AI services (1-nodule correctly segmented, 0-nodule incorrectly segmented or not segmented at all). In the third stage, the same radiologists additionally evaluated the classification of the nodules (1-nodule correctly segmented and classified, 0-all other cases). The results obtained in stages 2 and 3 were compared with Ground Truth, which was common to all three stages. For each stage, diagnostic accuracy metrics were calculated for each AI service. Results: Three software solutions (Celsus, Lunit INSIGHT CXR, and qXR) demonstrated diagnostic metrics that matched or surpassed the vendor specifications, and achieved the highest area under the receiver operating characteristic curve (AUC) of 0.956 [95% confidence interval (CI): 0.918 to 0.994]. However, when evaluated by three radiologists for accurate nodule segmentation and classification, all solutions performed below the vendor-declared metrics, with the highest AUC reaching 0.812 (95% CI: 0.744 to 0.879). Meanwhile, all AI services demonstrated 100% specificity at stages 2 and 3 of the study. Conclusions: To ensure the reliability and applicability of AI-based software, it is crucial to validate performance metrics using high-quality datasets and engage radiologists in the evaluation process. Developers are recommended to improve the accuracy of the underlying models before allowing the standalone use of the software for lung nodule detection.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Potential of automated evaluation of chest X-rays by artificial intelligence for clinic and practice
    Wenzel, M.
    Chun, F. K. H.
    Hinz, O.
    Abdel-Karim, B.
    [J]. PNEUMOLOGE, 2020, 17 (01): : 59 - 64
  • [2] The accuracy of mobile teleradiology in the evaluation of chest X-rays
    Schwartz, Adam B.
    Siddiqui, Gina
    Barbieri, John S.
    Akhtar, Amana L.
    Kim, Woojin
    Littman-Quinn, Ryan
    Conant, Emily F.
    Gupta, Narainder K.
    Pukenas, Bryan A.
    Ramchandani, Parvati
    Lev-Toaff, Anna S.
    Tobey, Jennifer D.
    Torigian, Drew A.
    Praestgaard, Amy H.
    Kovarik, Carrie L.
    [J]. JOURNAL OF TELEMEDICINE AND TELECARE, 2014, 20 (08) : 460 - 463
  • [3] The added value of using artificial intelligence in adult chest X-rays for nodules and masses detection in daily radiology practice
    Farouk, Suzan
    Osman, Ahmed M.
    Awadallah, Shrouk M.
    Abdelrahman, Ahmed S.
    [J]. EGYPTIAN JOURNAL OF RADIOLOGY AND NUCLEAR MEDICINE, 2023, 54 (01):
  • [4] The added value of using artificial intelligence in adult chest X-rays for nodules and masses detection in daily radiology practice
    Suzan Farouk
    Ahmed M. Osman
    Shrouk M. Awadallah
    Ahmed S. Abdelrahman
    [J]. Egyptian Journal of Radiology and Nuclear Medicine, 54
  • [5] Do the nodules detected in chest X-rays always indicate a disease of lung parenchyma?
    Karapolat, Sami
    Turkyilmaz, Atila
    Buran, Alaaddin
    Tekinbas, Celal
    [J]. ANZ JOURNAL OF SURGERY, 2019, 89 (03) : E92 - E93
  • [6] Automated Localization of Lung Nodules from Chest X-rays With Deep Neural Networks
    Maksudov, B.
    Kiselev, S.
    Kholiavchenko, M.
    Mustafaev, T.
    Kuleev, R.
    Ibragimov, B.
    [J]. INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2020, 108 (03): : E294 - E294
  • [7] Interpretable artificial intelligence framework for COVID-19 screening on chest X-rays
    Tsiknakis, Nikos
    Trivizakis, Eleftherios
    Vassalou, Evangelia E.
    Papadakis, Georgios Z.
    Spandidos, Demetrios A.
    Tsatsakis, Aristidis
    Sanchez-Garcia, Jose
    Lopez-Gonzalez, Rafael
    Papanikolaou, Nikolaos
    Karantanas, Apostolos H.
    Marias, Kostas
    [J]. EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2020, 20 (02) : 727 - 735
  • [8] The rise of artificial intelligence reading of chest X-rays for enhanced TB diagnosis and elimination
    Geric, C.
    Qin, Z. Z.
    Denkinger, C. M.
    Kik, S. V.
    Marais, B.
    Anjos, A.
    David, P-M.
    Khan, F. Ahmad
    Trajman, A.
    [J]. INTERNATIONAL JOURNAL OF TUBERCULOSIS AND LUNG DISEASE, 2023, 27 (05) : 367 - 372
  • [9] Can Artificial Intelligence Substitute Right-Heart Catheterization With Chest X-Rays?
    Higaki, Akinori
    [J]. CANADIAN JOURNAL OF CARDIOLOGY, 2021, 37 (08) : 1298 - 1298
  • [10] Explainability Of Artificial Intelligence For Diagnosing COVID-19 From Chest X-Rays
    Goel, Abhishek
    Jogi, Sandeep Panwar
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 598 - 603