Attention combined pyramid vision transformer for polyp segmentation

被引:4
|
作者
Liu, Xiaogang [1 ]
Song, Shuang [1 ]
机构
[1] Nanjing Tech Univ, Coll Comp & Informat Engn, Nanjing 211800, Peoples R China
关键词
Polyp segmentation; Convolutional neural network; Pyramid vision transformer; Endoscope; COLORECTAL-CANCER; NETWORK; IMAGES; ENDOSCOPY; COLON;
D O I
10.1016/j.bspc.2023.105792
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Colorectal cancer (CRC) has become one of the most frequent cancers in the world. To prevent CRC, proper polyp localization in endoscopy images plays a vital role for detecting and removing colorectal polyps. Most polyp segmentation methods use convolutional neural networks (CNN) as their backbone, and have achieved promising results to effectively assist clinicians in their diagnosis. However, those CNN-based approaches have limitations in modeling accurate location and shape of polyps, due to the intrinsic locality property of convolutional operations. To address these limitations, this study proposes a novel network, namely AttPVT, that combines CNN and Pyramid Vision Transformer (PVT) together for poly segmentation. The main challenge lies in maintaining long-range semantic information without sacrificing low-level features. Att-PVT applies multidimensional information extraction (MIE) to generate refined feature maps extracted from PVT for better feature representation. Cascaded context integration (CCI) is designed to adaptively aggregating the three highest layers of refined polyp features for learning semantic and location information. To accurately segment colorectal polyps, Att-PVT introduces multilevel feature fusion (MFF) module that explores the boundary information in the shallower layer based on the global map. The proposed workflow has undergone comparative experiments on three public datasets, namely Kvasir, ColonDB, and ETIS. The results show that the proposed approach achieves impressive mDice scores of 0.926, 0.817, and 0.794 for polyp segmentation tasks on these datasets, surpassing other state-of-the-art methods. This indicates the superior generalization and scalability of the proposed approach.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Colorectal Polyp Segmentation Combining Pyramid Vision Transformer and Axial Attention
    Zhou, Xue
    Bai, Zhengyao
    Lu, Qianjie
    Fan, Shenglan
    [J]. Computer Engineering and Applications, 2023, 59 (11) : 222 - 230
  • [2] Three-stage polyp segmentation network based on reverse attention feature purification with Pyramid Vision transformer
    Meng, Lingbing
    Li, Yuting
    Duan, Weiwei
    [J]. Computers in Biology and Medicine, 2024, 179
  • [3] Polyp2Seg: Improved Polyp Segmentation with Vision Transformer
    Mandujano-Cornejo, Vittorino
    Montoya-Zegarra, Javier A.
    [J]. MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 519 - 534
  • [4] PVT-MA: pyramid vision transformers with multi-attention fusion mechanism for polyp segmentation
    Shang, Xiao
    Wu, Siqi
    Liu, Yuhao
    Zhao, Zhenfeng
    Wang, Shenwen
    [J]. Applied Intelligence, 2025, 55 (01)
  • [5] SAEFormer: stepwise attention emphasis transformer for polyp segmentation
    Tan, Yicai
    Chen, Lei
    Zheng, Chudong
    Ling, Hui
    Lai, Xinshan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 74833 - 74853
  • [6] Improved dual-aggregation polyp segmentation network combining a pyramid vision transformer with a fully convolutional network
    Li, Feng
    Huang, Zetao
    Zhou, Lu
    Chen, Yuyang
    Tang, Shiqing
    Ding, Pengchao
    Peng, Haixia
    Chu, Yimin
    [J]. BIOMEDICAL OPTICS EXPRESS, 2024, 15 (04): : 2590 - 2621
  • [7] PolySegNet: improving polyp segmentation through swin transformer and vision transformer fusion
    Lijin, P.
    Ullah, Mohib
    Vats, Anuja
    Cheikh, Faouzi Alaya
    Kumar, G. Santhosh
    Nair, Madhu S.
    [J]. BIOMEDICAL ENGINEERING LETTERS, 2024, : 1421 - 1431
  • [8] Shared Hybrid Attention Transformer network for colon polyp segmentation
    Ji, Zexuan
    Qian, Hao
    Ma, Xiao
    [J]. Neurocomputing, 2025, 616
  • [9] PYRAMID TRANSFORMER DRIVEN MULTIBRANCH FUSION FOR POLYP SEGMENTATION IN COLONOSCOPIC VIDEO IMAGES
    Wang, Ao
    Wu, Ming
    Qi, Hao
    Shi, Hong
    Chen, Jianhua
    Chen, Yinran
    Luo, Xiongbiao
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2350 - 2354
  • [10] Attention-Guided Pyramid Context Network for Polyp Segmentation in Colonoscopy Images
    Yue, Guanghui
    Li, Siying
    Cong, Runmin
    Zhou, Tianwei
    Lei, Baiying
    Wang, Tianfu
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72