CGMA-Net: Cross-Level Guidance and Multi-Scale Aggregation Network for Polyp Segmentation

被引:0
|
作者
Zheng, Jianwei [1 ]
Yan, Yidong [1 ]
Zhao, Liang [2 ,3 ]
Pan, Xiang [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
[2] Stomatol Hosp, Xiamen Med Coll, Xiamen 361000, Peoples R China
[3] Xiamen Key Lab Stomatol Dis Diag & Treatment, Xiamen 361000, Peoples R China
关键词
Image segmentation; Semantics; Feature extraction; Transformers; Decoding; Data mining; Bioinformatics; Polyp segmentation; cross-level feature guidance (CFG); multi-scale feature aggregation; cross interaction; details refinement (DR); VALIDATION; ATTENTION;
D O I
10.1109/JBHI.2023.3345479
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Colonoscopy is considered the best prevention and control method for colorectal cancer, which suffers extremely high rates of mortality and morbidity. Automated polyp segmentation of colonoscopy images is of great importance since manual polyp segmentation requires a considerable time of experienced specialists. However, due to the high similarity between polyps and mucosa, accompanied by the complex morphological features of colonic polyps, the performance of automatic polyp segmentation is still unsatisfactory. Accordingly, we propose a network, namely Cross-level Guidance and Multi-scale Aggregation (CGMA-Net), to earn a performance promotion. Specifically, three modules, including Cross-level Feature Guidance (CFG), Multi-scale Aggregation Decoder (MAD), and Details Refinement (DR), are individually proposed and synergistically assembled. With CFG, we generate spatial attention maps from the higher-level features and then multiply them with the lower-level features, highlighting the region of interest and suppressing the background information. In MAD, we parallelly use multiple dilated convolutions of different sizes to capture long-range dependencies between features. For DR, an asynchronous convolution is used along with the attention mechanism to enhance both the local details and the global information. The proposed CGMA-Net is evaluated on two benchmark datasets, i.e., CVC-ClinicDB and Kvasir-SEG, whose results demonstrate that our method not only presents state-of-the-art performance but also holds relatively fewer parameters. Concretely, we achieve the Dice Similarity Coefficient (DSC) of 91.85% and 95.73% on Kvasir-SEG and CVC-ClinicDB, respectively. The assessment of model generalization is also conducted, resulting in DSC scores of 86.25% and 86.97% on the two datasets respectively.
引用
收藏
页码:1424 / 1435
页数:12
相关论文
共 50 条
  • [1] Cross-level Feature Aggregation Network for Polyp Segmentation
    Zhou, Tao
    Zhou, Yi
    He, Kelei
    Gong, Chen
    Yang, Jian
    Fu, Huazhu
    Shen, Dinggang
    [J]. PATTERN RECOGNITION, 2023, 140
  • [2] CIFG-Net: Cross-level information fusion and guidance network for Polyp Segmentation
    Li, Weisheng
    Huang, Zhaopeng
    Li, Feiyan
    Zhao, Yinghui
    Zhang, Hongchuan
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [3] A multi-scale perceptual polyp segmentation network based on boundary guidance
    Lu, Lu
    Chen, Shuhan
    Tang, Haonan
    Zhang, Xinfeng
    Hu, Xuelong
    [J]. IMAGE AND VISION COMPUTING, 2023, 138
  • [4] EMS-Net: Enhanced Multi-Scale Network for Polyp Segmentation
    Wang, Miao
    An, Xingwei
    Li, Yuhao
    Li, Ning
    Hang, Wei
    Liu, Gang
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2936 - 2939
  • [5] CafeNet : A Novel Multi-Scale Context Aggregation and Multi-Level Foreground Enhancement Network for Polyp Segmentation
    Ji, Zhanlin
    Li, Xiaoyu
    Wang, Zhiwu
    Zhang, Haiyang
    Yuan, Na
    Zhang, Xueji
    Ganchev, Ivan
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (05)
  • [6] 2MGAS-Net: multi-level multi-scale gated attentional squeezed network for polyp segmentation
    Bakkouri, Ibtissam
    Bakkouri, Siham
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (6-7) : 5377 - 5386
  • [7] Cross-Level Context Fusion Network for Polyp Segmentation in Colonoscopy Images
    Cai, Duanfang
    Zhan, Kongcai
    Tan, Youguo
    Chen, Xiaoyan
    Luo, Heng
    Li, Guangyu
    [J]. IEEE ACCESS, 2024, 12 : 35366 - 35377
  • [8] DEMF-Net: A dual encoder multi-scale feature fusion network for polyp segmentation
    Cao, Xiaorui
    Yu, He
    Yan, Kang
    Cui, Rong
    Guo, Jinming
    Li, Xuan
    Xing, Xiaoxue
    Huang, Tao
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96
  • [9] MC-Net: Multi-Scale Feature Fusion and Cross-Level Information Interaction Network for Traffic Sign Detection
    Yu, Zhongyi
    Cheng, Debo
    Zhang, Wenzhen
    Chen, Jing
    Zhang, Shichao
    [J]. 2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 841 - 848
  • [10] CrossFormer: Multi-scale cross-attention for polyp segmentation
    Chen, Lifang
    Ge, Hongze
    Li, Jiawei
    [J]. IET IMAGE PROCESSING, 2023, 17 (12) : 3441 - 3452