Focus U-Net: A novel dual attention-gated CNN for polyp segmentation during colonoscopy

被引:79
|
作者
Yeung, Michael [1 ,2 ]
Sala, Evis [1 ,3 ]
Schonlieb, Carola-Bibiane [4 ]
Rundo, Leonardo [1 ,3 ]
机构
[1] Univ Cambridge, Dept Radiol, Box 218,Cambridge Biomed Campus, Cambridge CB2 0QQ, England
[2] Univ Cambridge, Sch Clin Med, Cambridge CB2 0SP, England
[3] Univ Cambridge, Canc Res UK Cambridge Ctr, Cambridge CB2 0RE, England
[4] Univ Cambridge, Dept Appl Math & Theoret Phys, Cambridge CB3 0WA, England
基金
英国工程与自然科学研究理事会; 英国惠康基金; 欧盟地平线“2020”; 英国科学技术设施理事会;
关键词
Polyp segmentation; Colorectal cancer; Colonoscopy; Computer-aided diagnosis; Focus U-Net; Attention mechanisms; Loss function; COLORECTAL-CANCER; MISS RATE; NETWORKS; RISK;
D O I
10.1016/j.compbiomed.2021.104815
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Colonoscopy remains the gold-standard screening for colorectal cancer. However, significant miss rates for polyps have been reported, particularly when there are multiple small adenomas. This presents an opportunity to leverage computer-aided systems to support clinicians and reduce the number of polyps missed. Method: In this work we introduce the Focus U-Net, a novel dual attention-gated deep neural network, which combines efficient spatial and channel-based attention into a single Focus Gate module to encourage selective learning of polyp features. The Focus U-Net incorporates several further architectural modifications, including the addition of short-range skip connections and deep supervision. Furthermore, we introduce the Hybrid Focal loss, a new compound loss function based on the Focal loss and Focal Tversky loss, designed to handle classimbalanced image segmentation. For our experiments, we selected five public datasets containing images of polyps obtained during optical colonoscopy: CVC-ClinicDB, Kvasir-SEG, CVC-ColonDB, ETIS-Larib PolypDB and EndoScene test set. We first perform a series of ablation studies and then evaluate the Focus U-Net on the CVCClinicDB and Kvasir-SEG datasets separately, and on a combined dataset of all five public datasets. To evaluate model performance, we use the Dice similarity coefficient (DSC) and Intersection over Union (IoU) metrics. Results: Our model achieves state-of-the-art results for both CVC-ClinicDB and Kvasir-SEG, with a mean DSC of 0.941 and 0.910, respectively. When evaluated on a combination of five public polyp datasets, our model similarly achieves state-of-the-art results with a mean DSC of 0.878 and mean IoU of 0.809, a 14% and 15% improvement over the previous state-of-the-art results of 0.768 and 0.702, respectively. Conclusions: This study shows the potential for deep learning to provide fast and accurate polyp segmentation results for use during colonoscopy. The Focus U-Net may be adapted for future use in newer non-invasive colorectal cancer screening and more broadly to other biomedical image segmentation tasks similarly involving class imbalance and requiring efficiency.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Segmentation of the Thoracic Aorta using an Attention-Gated U-Net
    Zhong, Jiayang
    Bian, Zhangxing
    Hatt, Charles R.
    Burris, Nicholas S.
    MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
  • [2] S3AR U-Net: A separable squeezed similarity attention-gated residual U-Net for glottis segmentation
    Montalbo, Francis Jesmar P.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
  • [3] Multimodal attention-gated cascaded U-Net model for automatic brain tumor detection and segmentation
    Chinnam, Siva Koteswara Rao
    Sistla, Venkatramaphanikumar
    Kolli, Venkata Krishna Kishore
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [4] Attention-gated U-Net networks for simultaneous axial/sagittal planes segmentation of injured spinal cords
    Masse-Gignac, Nicolas
    Florez-Jimenez, Salomon
    Mac-Thiong, Jean-Marc
    Duong, Luc
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2023, 24 (10):
  • [5] A mixed attention-gated U-Net for continuous cuffless blood pressure estimation
    Yiting Zhong
    Yongyi Chen
    Dan Zhang
    Yanghui Xu
    Hamid Reza Karimi
    Signal, Image and Video Processing, 2023, 17 : 4143 - 4151
  • [6] A mixed attention-gated U-Net for continuous cuffless blood pressure estimation
    Zhong, Yiting
    Chen, Yongyi
    Zhang, Dan
    Xu, Yanghui
    Karimi, Hamid Reza
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (08) : 4143 - 4151
  • [7] Dual Encoder Attention U-net for nuclei segmentation
    Vahadane, Abhishek
    Atheeth, B.
    Majumdar, Shantanu
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3205 - 3208
  • [8] Attention-Gated U-Net Implementation in Total Marrow Irradiation Plan Dose Prediction
    Du, D.
    Qing, K.
    Watkins, W.
    Han, C.
    Ketcherside, T.
    Wong, J.
    Williams, T.
    Liu, A.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [9] Polyp Segmentation in Colonoscopy Images using U-Net and Cyclic Learning Rate
    Bulut, Betul
    Butun, Ertan
    Kaya, Mehmet
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 1149 - 1152
  • [10] Automatic cortical surface parcellation in the fetal brain using attention-gated spherical U-net
    You, Sungmin
    Barba, Anette De Leon
    Tamayo, Valeria Cruz
    Yun, Hyuk Jin
    Yang, Edward
    Grant, P. Ellen
    Im, Kiho
    FRONTIERS IN NEUROSCIENCE, 2024, 18