Enhancing the reliability of deep learning-based head and neck tumour segmentation using uncertainty estimation with multi-modal images

被引:1
|
作者
Ren, Jintao [1 ,2 ,3 ]
Teuwen, Jonas [4 ]
Nijkamp, Jasper [1 ,3 ]
Rasmussen, Mathis [1 ,2 ,3 ]
Gouw, Zeno [4 ]
Eriksen, Jesper Grau [2 ,3 ]
Sonke, Jan-Jakob [4 ]
Korreman, Stine [1 ,2 ,3 ]
机构
[1] Aarhus Univ Hosp, Danish Ctr Particle Therapy, Palle Juul Jensens Blvd 25, DK-8200 Aarhus N, Denmark
[2] Aarhus Univ Hosp, Dept Oncol, Palle Juul Jensens Blvd 25, DK-8200 Aarhus N, Denmark
[3] Aarhus Univ, Dept Clin Med, Palle Juul Jensens Blvd 25, DK-8200 Aarhus N, Denmark
[4] Netherlands Canc Inst, Dept Radiat Oncol, Plesmanlaan 121, NL-1066 CX Amsterdam, Netherlands
来源
PHYSICS IN MEDICINE AND BIOLOGY | 2024年 / 69卷 / 16期
关键词
uncertainty estimation; deep learning; radiotherapy; gross tumour volume; head and neck cancer; tumour segmentation; uncertainty quantification; QUANTIFICATION; OROPHARYNGEAL; DELINEATION; DAHANCA;
D O I
10.1088/1361-6560/ad682d
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective. Deep learning shows promise in autosegmentation of head and neck cancer (HNC) primary tumours (GTV-T) and nodal metastases (GTV-N). However, errors such as including non-tumour regions or missing nodal metastases still occur. Conventional methods often make overconfident predictions, compromising reliability. Incorporating uncertainty estimation, which provides calibrated confidence intervals can address this issue. Our aim was to investigate the efficacy of various uncertainty estimation methods in improving segmentation reliability. We evaluated their confidence levels in voxel predictions and ability to reveal potential segmentation errors. Approach. We retrospectively collected data from 567 HNC patients with diverse cancer sites and multi-modality images (CT, PET, T1-, and T2-weighted MRI) along with their clinical GTV-T/N delineations. Using the nnUNet 3D segmentation pipeline, we compared seven uncertainty estimation methods, evaluating them based on segmentation accuracy (Dice similarity coefficient, DSC), confidence calibration (Expected Calibration Error, ECE), and their ability to reveal segmentation errors (Uncertainty-Error overlap using DSC, UE-DSC). Main results. Evaluated on the hold-out test dataset (n = 97), the median DSC scores for GTV-T and GTV-N segmentation across all uncertainty estimation methods had a narrow range, from 0.73 to 0.76 and 0.78 to 0.80, respectively. In contrast, the median ECE exhibited a wider range, from 0.30 to 0.12 for GTV-T and 0.25 to 0.09 for GTV-N. Similarly, the median UE-DSC also ranged broadly, from 0.21 to 0.38 for GTV-T and 0.22 to 0.36 for GTV-N. A probabilistic network-PhiSeg method consistently demonstrated the best performance in terms of ECE and UE-DSC. Significance. Our study highlights the importance of uncertainty estimation in enhancing the reliability of deep learning for autosegmentation of HNC GTV. The results show that while segmentation accuracy can be similar across methods, their reliability, measured by calibration error and uncertainty-error overlap, varies significantly. Used with visualisation maps, these methods may effectively pinpoint uncertainties and potential errors at the voxel level.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Enhancing non-small cell lung cancer radiotherapy planning: A deep learning-based multi-modal fusion approach for accurate GTV segmentation
    Atiya, Shaik Ummay
    Ramesh, N. V. K.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
  • [32] An Attention-Based Residual U-Net for Tumour Segmentation Using Multi-Modal MRI Brain Images
    Naqvi, Najme Zehra
    Seeja, K. R.
    IEEE ACCESS, 2025, 13 : 10240 - 10251
  • [33] Multi-Modal Physiological Data Fusion for Affect Estimation Using Deep Learning
    Hssayeni, Murtadha D.
    Ghoraani, Behnaz
    IEEE ACCESS, 2021, 9 : 21642 - 21652
  • [34] Wavelet based texture segmentation of multi-modal tomographic images
    Busch, C
    COMPUTERS & GRAPHICS-UK, 1997, 21 (03): : 347 - 358
  • [35] Deterministic Uncertainty Estimation for Multi-Modal Regression With Deep Neural Networks
    Cho, Jaehak
    Kim, Jae Myung
    Han, Seungyub
    Lee, Jungwoo
    IEEE ACCESS, 2025, 13 : 45281 - 45289
  • [36] Contour propagation and uncertainty estimation using deep learning in head and neck treatments
    Rivetti, L.
    Studen, A.
    Sharma, M.
    Chan, J.
    Jeraj, R.
    RADIOTHERAPY AND ONCOLOGY, 2023, 182 : S1691 - S1692
  • [37] Deep learning-based multi-modal approach using RGB and skeleton sequences for human activity recognition
    Pratishtha Verma
    Animesh Sah
    Rajeev Srivastava
    Multimedia Systems, 2020, 26 : 671 - 685
  • [38] Deep learning-based multi-modal data integration enhancing breast cancer disease-free survival prediction
    Wang, Zehua
    Lin, Ruichong
    Li, Yanchun
    Zeng, Jin
    Chen, Yongjian
    Ouyang, Wenhao
    Li, Han
    Jia, Xueyan
    Lai, Zijia
    Yu, Yunfang
    Yao, Herui
    Su, Weifeng
    PRECISION CLINICAL MEDICINE, 2024, 7 (02)
  • [39] Deep learning-based multi-modal approach using RGB and skeleton sequences for human activity recognition
    Verma, Pratishtha
    Sah, Animesh
    Srivastava, Rajeev
    MULTIMEDIA SYSTEMS, 2020, 26 (06) : 671 - 685
  • [40] Deep learning-based auto segmentation using generative adversarial network on magnetic resonance images obtained for head and neck cancer patients
    Kawahara, Daisuke
    Tsuneda, Masato
    Ozawa, Shuichi
    Okamoto, Hiroyuki
    Nakamura, Mitsuhiro
    Nishio, Teiji
    Nagata, Yasushi
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2022, 23 (05):