TACT: Text attention based CNN-Transformer network for polyp segmentation

被引:0
|
作者
Zhao, Yiyang [1 ]
Li, Jinjiang [1 ,3 ]
Hua, Zhen [2 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China
[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China
[3] Shandong Technol & Business Univ, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
CNN-Transformer; colonoscopy; medical image segmentation; polyp segmentation;
D O I
10.1002/ima.22997
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Colorectal cancer (CRC) has been one of the top three disease in the world in terms of incidence for many years. Therefore, how to prevent and treat CRC has become a topic of concern for an increasing number of people, and colonoscopy is the most effective detection method in polyp examination. According to studies, 90% of CRC is caused by adenomatous polyps of the large intestine. In clinical practice, the diversity of polyps' size, number, and shape and the unclear boundary between polyps and colon folds can reduce the operator's accuracy of polyps segmentation and lead to a higher rate of missed diagnosis. To better address the inaccurate segmentation or high miss rate due to the above factors, we propose a text attention-based CNN-Transformer network for polyp segmentation (TACT) network to process the images in a way that minimizes operator subjectivity and miss rate. The network is based on the CNN-Transformer structure, and on this basis, a fully attention progressive sampling module is added to more accurately divide the polyp boundary. Moreover, an auxiliary text classification task was added to focus on polyp size and number features in the form of text attention, which more effectively copes with the segmentation tasks of different sizes and different numbers of polyps. After comparing with multiple state-of-the-art segmentation methods in four challenging datasets, our proposed TACT improves segmentation accuracy for polyps of different sizes in different datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation
    Yu, Zhihong
    Lee, Feifei
    Chen, Qiu
    APPLIED INTELLIGENCE, 2023, 53 (17) : 19990 - 20006
  • [32] HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation
    Zhihong Yu
    Feifei Lee
    Qiu Chen
    Applied Intelligence, 2023, 53 : 19990 - 20006
  • [33] A Hybrid CNN-Transformer Architecture for Semantic Segmentation of Radar Sounder data
    Ghosh, Raktim
    Bovolo, Francesca
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1320 - 1323
  • [34] A CNN-transformer hybrid approach for decoding visual neural activity into text
    Zhang, Jiang
    Li, Chen
    Liu, Ganwanming
    Min, Min
    Wang, Chong
    Li, Jiyi
    Wang, Yuting
    Yan, Hongmei
    Zuo, Zhentao
    Huang, Wei
    Chen, Huafu
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 214
  • [35] STA-Former: enhancing medical image segmentation with Shrinkage Triplet Attention in a hybrid CNN-Transformer model
    Yuzhao Liu
    Liming Han
    Bin Yao
    Qing Li
    Signal, Image and Video Processing, 2024, 18 : 1901 - 1910
  • [36] SWFormer: A scale-wise hybrid CNN-Transformer network for multi-classes weed segmentation
    Jiang, Hongkui
    Chen, Qiupu
    Wang, Rujing
    Du, Jianming
    Chen, Tianjiao
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)
  • [37] Shallow Attention Network for Polyp Segmentation
    Wei, Jun
    Hu, Yiwen
    Zhang, Ruimao
    Li, Zhen
    Zhou, S. Kevin
    Cui, Shuguang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 699 - 708
  • [38] ECA-TFUnet: A U-shaped CNN-Transformer network with efficient channel attention for organ segmentation in anatomical sectional images of canines
    Liu, Yunling
    Liu, Yaxiong
    Li, Jingsong
    Chen, Yaoxing
    Xu, Fengjuan
    Xu, Yifa
    Cao, Jing
    Ma, Yuntao
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (10) : 18650 - 18669
  • [39] MedFCT: A Frequency Domain Joint CNN-Transformer Network for Semi-supervised Medical Image Segmentation
    Xie, Shiao
    Huang, Huimin
    Niu, Ziwei
    Lin, Lanfen
    Chen, Yen-Wei
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1913 - 1918
  • [40] SAEFormer: stepwise attention emphasis transformer for polyp segmentation
    Tan, Yicai
    Chen, Lei
    Zheng, Chudong
    Ling, Hui
    Lai, Xinshan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 74833 - 74853