TACT: Text attention based CNN-Transformer network for polyp segmentation

被引：0

作者：

Zhao, Yiyang ^{[1
]}

Li, Jinjiang ^{[1
,3
]}

Hua, Zhen ^{[2
]}

机构：

[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China

[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China

[3] Shandong Technol & Business Univ, Yantai 264005, Peoples R China

来源：

INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY | 2024年 / 34卷 / 02期

基金：

中国国家自然科学基金;

关键词：

CNN-Transformer; colonoscopy; medical image segmentation; polyp segmentation;

D O I：

10.1002/ima.22997

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Colorectal cancer (CRC) has been one of the top three disease in the world in terms of incidence for many years. Therefore, how to prevent and treat CRC has become a topic of concern for an increasing number of people, and colonoscopy is the most effective detection method in polyp examination. According to studies, 90% of CRC is caused by adenomatous polyps of the large intestine. In clinical practice, the diversity of polyps' size, number, and shape and the unclear boundary between polyps and colon folds can reduce the operator's accuracy of polyps segmentation and lead to a higher rate of missed diagnosis. To better address the inaccurate segmentation or high miss rate due to the above factors, we propose a text attention-based CNN-Transformer network for polyp segmentation (TACT) network to process the images in a way that minimizes operator subjectivity and miss rate. The network is based on the CNN-Transformer structure, and on this basis, a fully attention progressive sampling module is added to more accurately divide the polyp boundary. Moreover, an auxiliary text classification task was added to focus on polyp size and number features in the form of text attention, which more effectively copes with the segmentation tasks of different sizes and different numbers of polyps. After comparing with multiple state-of-the-art segmentation methods in four challenging datasets, our proposed TACT improves segmentation accuracy for polyps of different sizes in different datasets.

引用

页数：16

共 50 条

[31] HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation
Yu, Zhihong
Lee, Feifei
Chen, Qiu
APPLIED INTELLIGENCE, 2023, 53 (17) : 19990 - 20006
[32] HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation
Zhihong Yu
Feifei Lee
Qiu Chen
Applied Intelligence, 2023, 53 : 19990 - 20006
[33] A Hybrid CNN-Transformer Architecture for Semantic Segmentation of Radar Sounder data
Ghosh, Raktim
Bovolo, Francesca
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1320 - 1323
[34] A CNN-transformer hybrid approach for decoding visual neural activity into text
Zhang, Jiang
Li, Chen
Liu, Ganwanming
Min, Min
Wang, Chong
Li, Jiyi
Wang, Yuting
Yan, Hongmei
Zuo, Zhentao
Huang, Wei
Chen, Huafu
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 214
[35] STA-Former: enhancing medical image segmentation with Shrinkage Triplet Attention in a hybrid CNN-Transformer model
Yuzhao Liu
Liming Han
Bin Yao
Qing Li
Signal, Image and Video Processing, 2024, 18 : 1901 - 1910
[36] SWFormer: A scale-wise hybrid CNN-Transformer network for multi-classes weed segmentation
Jiang, Hongkui
Chen, Qiupu
Wang, Rujing
Du, Jianming
Chen, Tianjiao
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)
[37] Shallow Attention Network for Polyp Segmentation
Wei, Jun
Hu, Yiwen
Zhang, Ruimao
Li, Zhen
Zhou, S. Kevin
Cui, Shuguang
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 699 - 708
[38] ECA-TFUnet: A U-shaped CNN-Transformer network with efficient channel attention for organ segmentation in anatomical sectional images of canines
Liu, Yunling
Liu, Yaxiong
Li, Jingsong
Chen, Yaoxing
Xu, Fengjuan
Xu, Yifa
Cao, Jing
Ma, Yuntao
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (10) : 18650 - 18669
[39] MedFCT: A Frequency Domain Joint CNN-Transformer Network for Semi-supervised Medical Image Segmentation
Xie, Shiao
Huang, Huimin
Niu, Ziwei
Lin, Lanfen
Chen, Yen-Wei
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1913 - 1918
[40] SAEFormer: stepwise attention emphasis transformer for polyp segmentation
Tan, Yicai
Chen, Lei
Zheng, Chudong
Ling, Hui
Lai, Xinshan
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 74833 - 74853

← 1 2 3 4 5 →