Polyp-LVT: Polyp segmentation with lightweight vision transformers

被引：0

作者：

Lin L. ^{[1
]}

Lv G. ^{[1
]}

Wang B. ^{[2
]}

Xu C. ^{[1
]}

Liu J. ^{[3
]}

机构：

[1] School of Information Science and Engineering, Lanzhou University, Lanzhou, Gansu

[2] School of Information Engineering, Nanjing University of Finance & Economics, Nanjing

[3] School of Computing, Ulster University, Belfast, Northern Ireland

来源：

Knowledge-Based Systems | 2024年 / 300卷

关键词：

Colorectal cancer; Lightweight vision transformer; Polyp segmentation; Pooling layer;

D O I：

10.1016/j.knosys.2024.112181

中图分类号：

R96 [药理学]; R3 [基础医学]; R4 [临床医学];

学科分类号：

1001 ; 1002 ; 100602 ; 100706 ;

摘要：

Automatic segmentation of polyps in endoscopic images is crucial for early diagnosis and surgical planning of colorectal cancer. However, polyps closely resemble surrounding mucosal tissue in both texture and indistinct borders and vary in size, appearance, and location which possess great challenge to polyp segmentation. Although some recent attempts have been made to apply Vision Transformer (ViT) to polyp segmentation and achieved promising performance, their application in clinical scenarios is still limited by high computational complexity, large model size, redundant dependencies, and significant training costs. To address these limitations, we propose a novel ViT-based approach named Polyp-LVT, strategically replacing the attention layer in the encoder with a global max pooling layer, which significantly reduces the model's parameter count and computational cost while keeping the performance undegraded. Furthermore, we introduce a network block, named Inter-block Feature Fusion Module (IFFM), into the decoder, aiming to offer a streamlined yet highly efficient feature extraction. We conduct extensive experiments on three public polyp image benchmarks to evaluate our method. The experimental results show that compared with the baseline models, our Polyp-LVT network achieves a nearly 44% reduction in model parameters while gaining comparable segmentation performance. © 2024 Elsevier B.V.

引用

共 50 条

[1] Polyp2Seg: Improved Polyp Segmentation with Vision Transformer
Mandujano-Cornejo, Vittorino
Montoya-Zegarra, Javier A.
MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 519 - 534
[2] Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
Nanni, Loris
Fantozzi, Carlo
Loreggia, Andrea
Lumini, Alessandra
SENSORS, 2023, 23 (10)
[3] PVT-MA: pyramid vision transformers with multi-attention fusion mechanism for polyp segmentation
Shang, Xiao
Wu, Siqi
Liu, Yuhao
Zhao, Zhenfeng
Wang, Shenwen
Applied Intelligence, 2025, 55 (01)
[4] Attention combined pyramid vision transformer for polyp segmentation
Liu, Xiaogang
Song, Shuang
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
[5] Polyp-SAM: Transfer SAM for Polyp Segmentation
Li, Yuheng
Hu, Mingzhe
Yang, Xiaofeng
COMPUTER-AIDED DIAGNOSIS, MEDICAL IMAGING 2024, 2024, 12927
[6] Semantic Polyp Generation for Improving Polyp Segmentation Performance
Song, Hun
Shin, Younghak
JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2024, 44 (02) : 280 - 292
[7] Meta-Polyp: a baseline for efficient Polyp segmentation
Trinh, Quoc-Huy
2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 742 - 747
[8] Polyp segmentation network based on lightweight model and reverse attention mechanisms
Long, Jianwu
Yang, Chengxin
Song, Xinlei
Zeng, Ziqin
Ren, Yan
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (03)
[9] Tiny polyp detection from endoscopic video frames using vision transformers
Liu, Entong
He, Bishi
Zhu, Darong
Chen, Yuanjiao
Xu, Zhe
PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
[10] Endoscopy-assisted lightweight diagnosis system based on transformers for colon polyp detection
Weiming Fan
Jiahui Yu
Zhaojie Ju
Ju, Zhaojie (zhaojie.ju@port.ac.uk), 2025, 21 (01) : 57 - 64

← 1 2 3 4 5 →