Focus U-Net: A novel dual attention-gated CNN for polyp segmentation during colonoscopy

被引:79
|
作者
Yeung, Michael [1 ,2 ]
Sala, Evis [1 ,3 ]
Schonlieb, Carola-Bibiane [4 ]
Rundo, Leonardo [1 ,3 ]
机构
[1] Univ Cambridge, Dept Radiol, Box 218,Cambridge Biomed Campus, Cambridge CB2 0QQ, England
[2] Univ Cambridge, Sch Clin Med, Cambridge CB2 0SP, England
[3] Univ Cambridge, Canc Res UK Cambridge Ctr, Cambridge CB2 0RE, England
[4] Univ Cambridge, Dept Appl Math & Theoret Phys, Cambridge CB3 0WA, England
基金
英国工程与自然科学研究理事会; 英国惠康基金; 欧盟地平线“2020”; 英国科学技术设施理事会;
关键词
Polyp segmentation; Colorectal cancer; Colonoscopy; Computer-aided diagnosis; Focus U-Net; Attention mechanisms; Loss function; COLORECTAL-CANCER; MISS RATE; NETWORKS; RISK;
D O I
10.1016/j.compbiomed.2021.104815
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Colonoscopy remains the gold-standard screening for colorectal cancer. However, significant miss rates for polyps have been reported, particularly when there are multiple small adenomas. This presents an opportunity to leverage computer-aided systems to support clinicians and reduce the number of polyps missed. Method: In this work we introduce the Focus U-Net, a novel dual attention-gated deep neural network, which combines efficient spatial and channel-based attention into a single Focus Gate module to encourage selective learning of polyp features. The Focus U-Net incorporates several further architectural modifications, including the addition of short-range skip connections and deep supervision. Furthermore, we introduce the Hybrid Focal loss, a new compound loss function based on the Focal loss and Focal Tversky loss, designed to handle classimbalanced image segmentation. For our experiments, we selected five public datasets containing images of polyps obtained during optical colonoscopy: CVC-ClinicDB, Kvasir-SEG, CVC-ColonDB, ETIS-Larib PolypDB and EndoScene test set. We first perform a series of ablation studies and then evaluate the Focus U-Net on the CVCClinicDB and Kvasir-SEG datasets separately, and on a combined dataset of all five public datasets. To evaluate model performance, we use the Dice similarity coefficient (DSC) and Intersection over Union (IoU) metrics. Results: Our model achieves state-of-the-art results for both CVC-ClinicDB and Kvasir-SEG, with a mean DSC of 0.941 and 0.910, respectively. When evaluated on a combination of five public polyp datasets, our model similarly achieves state-of-the-art results with a mean DSC of 0.878 and mean IoU of 0.809, a 14% and 15% improvement over the previous state-of-the-art results of 0.768 and 0.702, respectively. Conclusions: This study shows the potential for deep learning to provide fast and accurate polyp segmentation results for use during colonoscopy. The Focus U-Net may be adapted for future use in newer non-invasive colorectal cancer screening and more broadly to other biomedical image segmentation tasks similarly involving class imbalance and requiring efficiency.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] A Segmentation Method Based on Dual Attention Mechanism and U-Net for Corrosion Images
    Chen F.
    Cheng M.
    Yang Y.
    Chen B.
    Xiao W.
    Xiao N.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2021, 55 (12): : 119 - 128
  • [22] Polyp Segmentation Using Stair-structured U-Net
    SHI Yonggang
    LI Yi
    ZHOU Zhiguo
    ZHANG Yue
    XIA Zhuoyan
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (01) : 39 - 47
  • [23] DARU-Net: A dual attention residual U-Net for uterine fibroids segmentation on MRI
    Zhang, Jian
    Liu, Yang
    Chen, Liping
    Ma, Si
    Zhong, Yuqing
    He, Zhimin
    Li, Chengwei
    Xiao, Zhibo
    Zheng, Yineng
    Lv, Fajin
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2023, 24 (06):
  • [24] Enhanced U-Net: A Feature Enhancement Network for Polyp Segmentation
    Patel, Krushi
    Bur, Andres M.
    Wang, Guanghui
    2021 18TH CONFERENCE ON ROBOTS AND VISION (CRV 2021), 2021, : 181 - 188
  • [25] Attention guided U-Net for accurate iris segmentation
    Lian, Sheng
    Luo, Zhiming
    Zhong, Zhun
    Lin, Xiang
    Su, Songzhi
    Li, Shaozi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 296 - 304
  • [26] Multiscale Attention U-Net for Skin Lesion Segmentation
    Alahmadi, Mohammad D.
    IEEE ACCESS, 2022, 10 : 59145 - 59154
  • [27] U-Net with Attention Mechanism for Retinal Vessel Segmentation
    Si, Ze
    Fu, Dongmei
    Li, Jiahao
    IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 668 - 677
  • [28] Synergistic attention U-Net for sublingual vein segmentation
    Tingxiao Yang
    Yuichiro Yoshimura
    Akira Morita
    Takao Namiki
    Toshiya Nakaguchi
    Artificial Life and Robotics, 2019, 24 : 550 - 559
  • [29] Synergistic attention U-Net for sublingual vein segmentation
    Yang, Tingxiao
    Yoshimura, Yuichiro
    Morita, Akira
    Namiki, Takao
    Nakaguchi, Toshiya
    ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (04) : 550 - 559
  • [30] Epileptic Seizure Detection in EEG via Fusion of Multi-View Attention-Gated U-Net Deep Neural Networks
    Chatzichristos, C.
    Dan, J.
    Narayanan, A. Mundanad
    Seeuws, N.
    Vandecasteele, K.
    De Vos, M.
    Bertrand, A.
    Van Huffel, S.
    2020 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM, 2020,