Enhancing the reliability of deep learning-based head and neck tumour segmentation using uncertainty estimation with multi-modal images

被引:1
|
作者
Ren, Jintao [1 ,2 ,3 ]
Teuwen, Jonas [4 ]
Nijkamp, Jasper [1 ,3 ]
Rasmussen, Mathis [1 ,2 ,3 ]
Gouw, Zeno [4 ]
Eriksen, Jesper Grau [2 ,3 ]
Sonke, Jan-Jakob [4 ]
Korreman, Stine [1 ,2 ,3 ]
机构
[1] Aarhus Univ Hosp, Danish Ctr Particle Therapy, Palle Juul Jensens Blvd 25, DK-8200 Aarhus N, Denmark
[2] Aarhus Univ Hosp, Dept Oncol, Palle Juul Jensens Blvd 25, DK-8200 Aarhus N, Denmark
[3] Aarhus Univ, Dept Clin Med, Palle Juul Jensens Blvd 25, DK-8200 Aarhus N, Denmark
[4] Netherlands Canc Inst, Dept Radiat Oncol, Plesmanlaan 121, NL-1066 CX Amsterdam, Netherlands
来源
PHYSICS IN MEDICINE AND BIOLOGY | 2024年 / 69卷 / 16期
关键词
uncertainty estimation; deep learning; radiotherapy; gross tumour volume; head and neck cancer; tumour segmentation; uncertainty quantification; QUANTIFICATION; OROPHARYNGEAL; DELINEATION; DAHANCA;
D O I
10.1088/1361-6560/ad682d
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective. Deep learning shows promise in autosegmentation of head and neck cancer (HNC) primary tumours (GTV-T) and nodal metastases (GTV-N). However, errors such as including non-tumour regions or missing nodal metastases still occur. Conventional methods often make overconfident predictions, compromising reliability. Incorporating uncertainty estimation, which provides calibrated confidence intervals can address this issue. Our aim was to investigate the efficacy of various uncertainty estimation methods in improving segmentation reliability. We evaluated their confidence levels in voxel predictions and ability to reveal potential segmentation errors. Approach. We retrospectively collected data from 567 HNC patients with diverse cancer sites and multi-modality images (CT, PET, T1-, and T2-weighted MRI) along with their clinical GTV-T/N delineations. Using the nnUNet 3D segmentation pipeline, we compared seven uncertainty estimation methods, evaluating them based on segmentation accuracy (Dice similarity coefficient, DSC), confidence calibration (Expected Calibration Error, ECE), and their ability to reveal segmentation errors (Uncertainty-Error overlap using DSC, UE-DSC). Main results. Evaluated on the hold-out test dataset (n = 97), the median DSC scores for GTV-T and GTV-N segmentation across all uncertainty estimation methods had a narrow range, from 0.73 to 0.76 and 0.78 to 0.80, respectively. In contrast, the median ECE exhibited a wider range, from 0.30 to 0.12 for GTV-T and 0.25 to 0.09 for GTV-N. Similarly, the median UE-DSC also ranged broadly, from 0.21 to 0.38 for GTV-T and 0.22 to 0.36 for GTV-N. A probabilistic network-PhiSeg method consistently demonstrated the best performance in terms of ECE and UE-DSC. Significance. Our study highlights the importance of uncertainty estimation in enhancing the reliability of deep learning for autosegmentation of HNC GTV. The results show that while segmentation accuracy can be similar across methods, their reliability, measured by calibration error and uncertainty-error overlap, varies significantly. Used with visualisation maps, these methods may effectively pinpoint uncertainties and potential errors at the voxel level.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Deep Collaborative Multi-Modal Learning for Unsupervised Kinship Estimation
    Dong, Guan-Nan
    Pun, Chi-Man
    Zhang, Zheng
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4197 - 4210
  • [42] Reinforcement Learning-Based Resource Allocation for Streaming in a Multi-Modal Deep Space Network
    Ha, Taeyun
    Oh, Junsuk
    Lee, Donghyun
    Lee, Jeonghwa
    Jeon, Yongin
    Cho, Sungrae
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 201 - 206
  • [43] A Robust Multi-Modal Deep Learning-Based Fault Diagnosis Method for PV Systems
    Afrasiabi, Shahabodin
    Allahmoradi, Sarah
    Afrasiabi, Mousa
    Liang, Xiaodong
    Chung, C. Y.
    Aghaei, Jamshid
    IEEE OPEN ACCESS JOURNAL OF POWER AND ENERGY, 2024, 11 : 583 - 594
  • [44] Deep learning-based multi-modal computing with feature disentanglement for MRI image synthesis
    Fei, Yuchen
    Zhan, Bo
    Hong, Mei
    Wu, Xi
    Zhou, Jiliu
    Wang, Yan
    MEDICAL PHYSICS, 2021, 48 (07) : 3778 - 3789
  • [45] Tackling the class imbalance problem of deep learning-based head and neck organ segmentation
    Tappeiner, Elias
    Welk, Martin
    Schubert, Rainer
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (11) : 2103 - 2111
  • [46] Impact of Dataset Size on Deep Learning-Based Auto Segmentation for Head and Neck Cancer
    Fang, Y.
    Wang, J.
    Chen, S.
    Shen, S.
    Zhang, Z.
    Hu, W.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2019, 105 (01): : E129 - E130
  • [47] Tackling the class imbalance problem of deep learning-based head and neck organ segmentation
    Tappeiner, Elias
    Welk, Martin
    Schubert, Rainer
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022,
  • [48] Tackling the class imbalance problem of deep learning-based head and neck organ segmentation
    Elias Tappeiner
    Martin Welk
    Rainer Schubert
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 2103 - 2111
  • [49] Learning Cross-Modal Deep Representations for Multi-Modal MR Image Segmentation
    Li, Cheng
    Sun, Hui
    Liu, Zaiyi
    Wang, Meiyun
    Zheng, Hairong
    Wang, Shanshan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 57 - 65
  • [50] Learning-Based Multi-Modal Rigid Image Registration By Using Bhattacharyya Distances
    So, Ronald W. K.
    Chung, Albert C. S.
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 2642 - 2645