Gross failure rates and failure modes for a commercial AI-based auto-segmentation algorithm in head and neck cancer patients

被引：2

作者：

Temple, Simon W. P. ^{[1
]}

Rowbottom, Carl G. ^{[1
,2
]}

机构：

[1] Clatterbridge Canc Ctr NHS Fdn Trust, Med Phys Dept, Liverpool, England

[2] Univ Liverpool, Dept Phys, Liverpool, England

来源：

JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS | 2024年 / 25卷 / 06期

关键词：

auto-segmentation; deep learning; failure modes; INTEROBSERVER VARIABILITY; DELINEATION; ORGANS; RISK; IMPLEMENTATION; ONCOLOGY; QUALITY;

D O I：

10.1002/acm2.14273

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

PurposeArtificial intelligence (AI) based commercial software can be used to automatically delineate organs at risk (OAR), with potential for efficiency savings in the radiotherapy treatment planning pathway, and reduction of inter- and intra-observer variability. There has been little research investigating gross failure rates and failure modes of such systems.Method50 head and neck (H&N) patient data sets with "gold standard" contours were compared to AI-generated contours to produce expected mean and standard deviation values for the Dice Similarity Coefficient (DSC), for four common H&N OARs (brainstem, mandible, left and right parotid). An AI-based commercial system was applied to 500 H&N patients. AI-generated contours were compared to manual contours, outlined by an expert human, and a gross failure was set at three standard deviations below the expected mean DSC. Failures were inspected to assess reason for failure of the AI-based system with failures relating to suboptimal manual contouring censored. True failures were classified into 4 sub-types (setup position, anatomy, image artefacts and unknown).ResultsThere were 24 true failures of the AI-based commercial software, a gross failure rate of 1.2%. Fifteen failures were due to patient anatomy, four were due to dental image artefacts, three were due to patient position and two were unknown. True failure rates by OAR were 0.4% (brainstem), 2.2% (mandible), 1.4% (left parotid) and 0.8% (right parotid).ConclusionTrue failures of the AI-based system were predominantly associated with a non-standard element within the CT scan. It is likely that these non-standard elements were the reason for the gross failure, and suggests that patient datasets used to train the AI model did not contain sufficient heterogeneity of data. Regardless of the reasons for failure, the true failure rate for the AI-based system in the H&N region for the OARs investigated was low (similar to 1%).

引用

页数：10

共 50 条

[41] Cascaded deep learning-based auto-segmentation for head and neck cancer patients: Organs at risk on T2-weighted magnetic resonance imaging
Korte, James C.
Hardcastle, Nicholas
Ng, Sweet Ping
Clark, Brett
Kron, Tomas
Jackson, Price
MEDICAL PHYSICS, 2021, 48 (12) : 7757 - 7772
[42] Evaluation of three AI-based CT auto-contouring systems for head&neck, thorax and pelvis
Casati, M.
Loi, M.
Arilli, C.
Marrazzo, L.
Talamonti, C.
Zani, M.
Compagnucci, A.
Simontacchi, G.
Di Cataldo, V.
Desideri, I.
Bonomo, P.
Franza, N.
Raspanti, D.
Pellegrini, R.
Livi, L.
Pallotta, S.
RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S276 - S278
[43] Validation of a multi-atlas based auto-segmentation of the heart in breast cancer patients
Van Dijk-Peters, F. B. J.
Sijtsema, N. M.
Kierkels, R. G. J.
Vliegenthart, R.
Langendijk, J. A.
Maduro, J. H.
Crijns, A. P. G.
RADIOTHERAPY AND ONCOLOGY, 2015, 115 : S132 - S133
[44] Auto-segmentation of organs at risk for head and neck radiotherapy planning: From atlas-based to deep learning methods
Vrtovec, Tomaz
Mocnik, Domen
Strojan, Primoz
Pernus, Franjo
Ibragimov, Bulat
MEDICAL PHYSICS, 2020, 47 (09) : E929 - E950
[45] A Preliminary Experience of Implementing Deep-Learning Based Auto-Segmentation in Head and Neck Cancer: A Study on Real-World Clinical Cases
Zhong, Yang
Yang, Yanju
Fang, Yingtao
Wang, Jiazhou
Hu, Weigang
FRONTIERS IN ONCOLOGY, 2021, 11
[46] Atlas based auto-segmentation of CT images: Clinical evaluation of using auto-contouring in high-dose, high-precision radiotherapy of cancer in the head and neck
Levendag, P. C.
Hoogeman, M.
Teguh, D.
Wolf, T.
Hibbard, L.
Wijers, O.
Heijmen, B.
Nowak, P.
Vasquez-Osorio, E.
Han, X.
INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2008, 72 (01): : S401 - S401
[47] Comparative Clinical Evaluation Of Deep-Learning-Based Algorithms In Auto-Segmentation Of Organs-At-Risk For Head And Neck Cancers
Liu, A.
Li, R.
Han, C.
Du, D.
Sampath, S.
Amini, A.
Glaser, S. M.
Wong, J. Y. C.
INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2020, 108 (03): : E817 - E817
[48] Auto-segmentation of normal and target structures in head and neck CT images: A feature-driven model-based approach
Qazi, Arish A.
Pekar, Vladimir
Kim, John
Xie, Jason
Breen, Stephen L.
Jaffray, David A.
MEDICAL PHYSICS, 2011, 38 (11) : 6160 - 6170
[49] Geometric and dosimetric evaluation of atlas based auto-segmentation of cardiac structures in breast cancer patients
Kaderka, Robert
Gillespie, Erin F.
Mundt, Robert C.
Bryant, Alex K.
Sanudo-Thomas, Camila B.
Harrison, Anna L.
Wouters, Emilie L.
Moiseenko, Vitali
Moore, Kevin L.
Atwood, Todd F.
Murphy, James D.
RADIOTHERAPY AND ONCOLOGY, 2019, 131 : 215 - 220
[50] Accurate and robust auto-segmentation of head and neck organ-at-risks based on a novel CNN fine-tuning workflow
Luan, Shunyao
Wu, Kun
Wu, Yuan
Zhu, Benpeng
Wei, Wei
Xue, Xudong
JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2024, 25 (01):

← 1 2 3 4 5 →