Gross failure rates and failure modes for a commercial AI-based auto-segmentation algorithm in head and neck cancer patients

被引:2
|
作者
Temple, Simon W. P. [1 ]
Rowbottom, Carl G. [1 ,2 ]
机构
[1] Clatterbridge Canc Ctr NHS Fdn Trust, Med Phys Dept, Liverpool, England
[2] Univ Liverpool, Dept Phys, Liverpool, England
来源
关键词
auto-segmentation; deep learning; failure modes; INTEROBSERVER VARIABILITY; DELINEATION; ORGANS; RISK; IMPLEMENTATION; ONCOLOGY; QUALITY;
D O I
10.1002/acm2.14273
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeArtificial intelligence (AI) based commercial software can be used to automatically delineate organs at risk (OAR), with potential for efficiency savings in the radiotherapy treatment planning pathway, and reduction of inter- and intra-observer variability. There has been little research investigating gross failure rates and failure modes of such systems.Method50 head and neck (H&N) patient data sets with "gold standard" contours were compared to AI-generated contours to produce expected mean and standard deviation values for the Dice Similarity Coefficient (DSC), for four common H&N OARs (brainstem, mandible, left and right parotid). An AI-based commercial system was applied to 500 H&N patients. AI-generated contours were compared to manual contours, outlined by an expert human, and a gross failure was set at three standard deviations below the expected mean DSC. Failures were inspected to assess reason for failure of the AI-based system with failures relating to suboptimal manual contouring censored. True failures were classified into 4 sub-types (setup position, anatomy, image artefacts and unknown).ResultsThere were 24 true failures of the AI-based commercial software, a gross failure rate of 1.2%. Fifteen failures were due to patient anatomy, four were due to dental image artefacts, three were due to patient position and two were unknown. True failure rates by OAR were 0.4% (brainstem), 2.2% (mandible), 1.4% (left parotid) and 0.8% (right parotid).ConclusionTrue failures of the AI-based system were predominantly associated with a non-standard element within the CT scan. It is likely that these non-standard elements were the reason for the gross failure, and suggests that patient datasets used to train the AI model did not contain sufficient heterogeneity of data. Regardless of the reasons for failure, the true failure rate for the AI-based system in the H&N region for the OARs investigated was low (similar to 1%).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Cascaded deep learning-based auto-segmentation for head and neck cancer patients: Organs at risk on T2-weighted magnetic resonance imaging
    Korte, James C.
    Hardcastle, Nicholas
    Ng, Sweet Ping
    Clark, Brett
    Kron, Tomas
    Jackson, Price
    MEDICAL PHYSICS, 2021, 48 (12) : 7757 - 7772
  • [42] Evaluation of three AI-based CT auto-contouring systems for head&neck, thorax and pelvis
    Casati, M.
    Loi, M.
    Arilli, C.
    Marrazzo, L.
    Talamonti, C.
    Zani, M.
    Compagnucci, A.
    Simontacchi, G.
    Di Cataldo, V.
    Desideri, I.
    Bonomo, P.
    Franza, N.
    Raspanti, D.
    Pellegrini, R.
    Livi, L.
    Pallotta, S.
    RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S276 - S278
  • [43] Validation of a multi-atlas based auto-segmentation of the heart in breast cancer patients
    Van Dijk-Peters, F. B. J.
    Sijtsema, N. M.
    Kierkels, R. G. J.
    Vliegenthart, R.
    Langendijk, J. A.
    Maduro, J. H.
    Crijns, A. P. G.
    RADIOTHERAPY AND ONCOLOGY, 2015, 115 : S132 - S133
  • [44] Auto-segmentation of organs at risk for head and neck radiotherapy planning: From atlas-based to deep learning methods
    Vrtovec, Tomaz
    Mocnik, Domen
    Strojan, Primoz
    Pernus, Franjo
    Ibragimov, Bulat
    MEDICAL PHYSICS, 2020, 47 (09) : E929 - E950
  • [45] A Preliminary Experience of Implementing Deep-Learning Based Auto-Segmentation in Head and Neck Cancer: A Study on Real-World Clinical Cases
    Zhong, Yang
    Yang, Yanju
    Fang, Yingtao
    Wang, Jiazhou
    Hu, Weigang
    FRONTIERS IN ONCOLOGY, 2021, 11
  • [46] Atlas based auto-segmentation of CT images: Clinical evaluation of using auto-contouring in high-dose, high-precision radiotherapy of cancer in the head and neck
    Levendag, P. C.
    Hoogeman, M.
    Teguh, D.
    Wolf, T.
    Hibbard, L.
    Wijers, O.
    Heijmen, B.
    Nowak, P.
    Vasquez-Osorio, E.
    Han, X.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2008, 72 (01): : S401 - S401
  • [47] Comparative Clinical Evaluation Of Deep-Learning-Based Algorithms In Auto-Segmentation Of Organs-At-Risk For Head And Neck Cancers
    Liu, A.
    Li, R.
    Han, C.
    Du, D.
    Sampath, S.
    Amini, A.
    Glaser, S. M.
    Wong, J. Y. C.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2020, 108 (03): : E817 - E817
  • [48] Auto-segmentation of normal and target structures in head and neck CT images: A feature-driven model-based approach
    Qazi, Arish A.
    Pekar, Vladimir
    Kim, John
    Xie, Jason
    Breen, Stephen L.
    Jaffray, David A.
    MEDICAL PHYSICS, 2011, 38 (11) : 6160 - 6170
  • [49] Geometric and dosimetric evaluation of atlas based auto-segmentation of cardiac structures in breast cancer patients
    Kaderka, Robert
    Gillespie, Erin F.
    Mundt, Robert C.
    Bryant, Alex K.
    Sanudo-Thomas, Camila B.
    Harrison, Anna L.
    Wouters, Emilie L.
    Moiseenko, Vitali
    Moore, Kevin L.
    Atwood, Todd F.
    Murphy, James D.
    RADIOTHERAPY AND ONCOLOGY, 2019, 131 : 215 - 220
  • [50] Accurate and robust auto-segmentation of head and neck organ-at-risks based on a novel CNN fine-tuning workflow
    Luan, Shunyao
    Wu, Kun
    Wu, Yuan
    Zhu, Benpeng
    Wei, Wei
    Xue, Xudong
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2024, 25 (01):