TACT: Text attention based CNN-Transformer network for polyp segmentation

被引:0
|
作者
Zhao, Yiyang [1 ]
Li, Jinjiang [1 ,3 ]
Hua, Zhen [2 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China
[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China
[3] Shandong Technol & Business Univ, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
CNN-Transformer; colonoscopy; medical image segmentation; polyp segmentation;
D O I
10.1002/ima.22997
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Colorectal cancer (CRC) has been one of the top three disease in the world in terms of incidence for many years. Therefore, how to prevent and treat CRC has become a topic of concern for an increasing number of people, and colonoscopy is the most effective detection method in polyp examination. According to studies, 90% of CRC is caused by adenomatous polyps of the large intestine. In clinical practice, the diversity of polyps' size, number, and shape and the unclear boundary between polyps and colon folds can reduce the operator's accuracy of polyps segmentation and lead to a higher rate of missed diagnosis. To better address the inaccurate segmentation or high miss rate due to the above factors, we propose a text attention-based CNN-Transformer network for polyp segmentation (TACT) network to process the images in a way that minimizes operator subjectivity and miss rate. The network is based on the CNN-Transformer structure, and on this basis, a fully attention progressive sampling module is added to more accurately divide the polyp boundary. Moreover, an auxiliary text classification task was added to focus on polyp size and number features in the form of text attention, which more effectively copes with the segmentation tasks of different sizes and different numbers of polyps. After comparing with multiple state-of-the-art segmentation methods in four challenging datasets, our proposed TACT improves segmentation accuracy for polyps of different sizes in different datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] An FFT-based CNN-Transformer Encoder for Semantic Segmentation of Radar Sounder Signal
    Ghosh, Raktim
    Bovolo, Francesca
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVIII, 2022, 12267
  • [22] A hierarchical CNN-Transformer model for network intrusion detection
    Luo, Sijie
    Zhao, Zhiheng
    Hu, Qiyuan
    Liu, Yang
    2ND INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELLING, AND INTELLIGENT COMPUTING (CAMMIC 2022), 2022, 12259
  • [23] A CNN-transformer hybrid network with selective fusion and dual attention for image super-resolution
    Chun Zhang
    Jin Wang
    Yunhui Shi
    Baocai Yin
    Nam Ling
    Multimedia Systems, 2025, 31 (2)
  • [24] Dual-branch feature extraction network combined with Transformer and CNN for polyp segmentation
    Liu, Qiaohong
    Lin, Yuanjie
    Han, Xiaoxiang
    Chen, Keyan
    Zhang, Weikun
    Yang, Hui
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (01)
  • [25] DHAFormer: Dual-channel hybrid attention network with transformer for polyp segmentation
    Huang, Xuejie
    Wang, Liejun
    Jiang, Shaochen
    Xu, Lianghui
    PLOS ONE, 2024, 19 (07):
  • [26] MFH-Net: A Hybrid CNN-Transformer Network Based Multi-Scale Fusion for Medical Image Segmentation
    Wang, Ying
    Zhang, Meng
    Liang, Jian'an
    Liang, Meiyan
    International Journal of Imaging Systems and Technology, 2024, 34 (06)
  • [27] EEG classification algorithm of motor imagery based on CNN-Transformer fusion network
    Liu, Haofeng
    Liu, Yuefeng
    Wang, Yue
    Liu, Bo
    Bao, Xiang
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1302 - 1309
  • [28] Semantic segmentation of terrace image regions based on lightweight CNN-Transformer hybrid networks
    Liu X.
    Yi S.
    Li L.
    Cheng X.
    Wang C.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2023, 39 (13): : 171 - 181
  • [29] A Parkinson's disease-related nuclei segmentation network based on CNN-Transformer interleaved encoder with feature fusion
    Chen, Hongyi
    Fu, Junyan
    Liu, Xiao
    Zheng, Zhiji
    Luo, Xiao
    Zhou, Kun
    Xu, Zhijian
    Geng, Daoying
    Computerized Medical Imaging and Graphics, 2024, 118
  • [30] STA-Former: enhancing medical image segmentation with Shrinkage Triplet Attention in a hybrid CNN-Transformer model
    Liu, Yuzhao
    Han, Liming
    Yao, Bin
    Li, Qing
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (02) : 1901 - 1910