Polyp-LVT: Polyp segmentation with lightweight vision transformers

被引:0
|
作者
Lin L. [1 ]
Lv G. [1 ]
Wang B. [2 ]
Xu C. [1 ]
Liu J. [3 ]
机构
[1] School of Information Science and Engineering, Lanzhou University, Lanzhou, Gansu
[2] School of Information Engineering, Nanjing University of Finance & Economics, Nanjing
[3] School of Computing, Ulster University, Belfast, Northern Ireland
关键词
Colorectal cancer; Lightweight vision transformer; Polyp segmentation; Pooling layer;
D O I
10.1016/j.knosys.2024.112181
中图分类号
R96 [药理学]; R3 [基础医学]; R4 [临床医学];
学科分类号
1001 ; 1002 ; 100602 ; 100706 ;
摘要
Automatic segmentation of polyps in endoscopic images is crucial for early diagnosis and surgical planning of colorectal cancer. However, polyps closely resemble surrounding mucosal tissue in both texture and indistinct borders and vary in size, appearance, and location which possess great challenge to polyp segmentation. Although some recent attempts have been made to apply Vision Transformer (ViT) to polyp segmentation and achieved promising performance, their application in clinical scenarios is still limited by high computational complexity, large model size, redundant dependencies, and significant training costs. To address these limitations, we propose a novel ViT-based approach named Polyp-LVT, strategically replacing the attention layer in the encoder with a global max pooling layer, which significantly reduces the model's parameter count and computational cost while keeping the performance undegraded. Furthermore, we introduce a network block, named Inter-block Feature Fusion Module (IFFM), into the decoder, aiming to offer a streamlined yet highly efficient feature extraction. We conduct extensive experiments on three public polyp image benchmarks to evaluate our method. The experimental results show that compared with the baseline models, our Polyp-LVT network achieves a nearly 44% reduction in model parameters while gaining comparable segmentation performance. © 2024 Elsevier B.V.
引用
收藏
相关论文
共 50 条
  • [1] Polyp2Seg: Improved Polyp Segmentation with Vision Transformer
    Mandujano-Cornejo, Vittorino
    Montoya-Zegarra, Javier A.
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 519 - 534
  • [2] Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
    Nanni, Loris
    Fantozzi, Carlo
    Loreggia, Andrea
    Lumini, Alessandra
    SENSORS, 2023, 23 (10)
  • [3] PVT-MA: pyramid vision transformers with multi-attention fusion mechanism for polyp segmentation
    Shang, Xiao
    Wu, Siqi
    Liu, Yuhao
    Zhao, Zhenfeng
    Wang, Shenwen
    Applied Intelligence, 2025, 55 (01)
  • [4] Attention combined pyramid vision transformer for polyp segmentation
    Liu, Xiaogang
    Song, Shuang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [5] Polyp-SAM: Transfer SAM for Polyp Segmentation
    Li, Yuheng
    Hu, Mingzhe
    Yang, Xiaofeng
    COMPUTER-AIDED DIAGNOSIS, MEDICAL IMAGING 2024, 2024, 12927
  • [6] Semantic Polyp Generation for Improving Polyp Segmentation Performance
    Song, Hun
    Shin, Younghak
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2024, 44 (02) : 280 - 292
  • [7] Meta-Polyp: a baseline for efficient Polyp segmentation
    Trinh, Quoc-Huy
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 742 - 747
  • [8] Polyp segmentation network based on lightweight model and reverse attention mechanisms
    Long, Jianwu
    Yang, Chengxin
    Song, Xinlei
    Zeng, Ziqin
    Ren, Yan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (03)
  • [9] Tiny polyp detection from endoscopic video frames using vision transformers
    Liu, Entong
    He, Bishi
    Zhu, Darong
    Chen, Yuanjiao
    Xu, Zhe
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
  • [10] Endoscopy-assisted lightweight diagnosis system based on transformers for colon polyp detection
    Weiming Fan
    Jiahui Yu
    Zhaojie Ju
    Ju, Zhaojie (zhaojie.ju@port.ac.uk), 2025, 21 (01) : 57 - 64