Parallel Transformer-CNN Model for Medical Image Segmentation

被引:1
|
作者
Zhou, Mingkun [1 ]
Nie, Xueyun [1 ]
Liu, Yuhang [1 ]
Li, Doudou [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Comp Sci & Technol, Sch Artificial Intelligence, Chongqing, Peoples R China
[2] Shanghai Int Studies Univ, Sch Business & Management, Shanghai, Peoples R China
关键词
component; medical image segmentation; parallel encoder; hierarchical feature fusion(HFF) block; lightweight decoder;
D O I
10.1109/ICCEA62105.2024.10603650
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In recent years, methods based on deep learning have made significant progress in the field of medical image segmentation, especially the method combining Transformer with Convolutional Neural Network (CNN). However, the existing hybrid models usually stack the Transformer module after convolutional layers, forming a serial structure. In this structure, CNN is first responsible for extracting the local features of the image, and then these features are passed into Transformer for extracting global information. Although this design combines the advantages of both to some extent, there is a problem in this serial structure that Transformer may not fully utilize the local features extracted by CNN when integrating global information, resulting in the loss of information in the transmission process, affecting the accuracy of segmentation. In addition, medical equipment often has certain limitations, and the number of model parameters cannot be too large. Therefore, in response to the mentioned problems, we present a novel model that combines Transformer and CNN in a parallel manner; a Hierarchical Feature Fusion (HFF) block is specially designed to effectively merge feature information from the two branches; A lightweight decoder is proposed, this decoder not only integrates the different scale features from the encoder, but also reduces the number of model parameters and improves training efficiency.
引用
收藏
页码:1048 / 1051
页数:4
相关论文
共 50 条
  • [1] TCI-UNet: transformer-CNN interactive module for medical image segmentation
    Bian, Xuan
    Wang, Guanglei
    Li, Yan
    Wang, Hongrui
    BIOMEDICAL OPTICS EXPRESS, 2023, 14 (11) : 5904 - 5920
  • [2] Hierarchical Decoder with Parallel Transformer and CNN for Medical Image Segmentation
    Li, Shijie
    Gong, Yu
    Xiang, Qingyuan
    Li, Zheng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV, 2025, 15044 : 133 - 147
  • [3] A transformer-CNN parallel network for image guided depth completion
    Li, Tao
    Dong, Xiucheng
    Lin, Jie
    Peng, Yonghong
    PATTERN RECOGNITION, 2024, 150
  • [4] CSegNet: a hybrid transformer-CNN network for road crack image segmentation
    Dong, Hao
    Du, Yinlai
    Feng, Dong
    Hu, Qingyuan
    Zhou, Mingzhu
    Xing, Jun
    Zhang, Long
    Wang, Shu
    Liu, Yong
    INSIGHT, 2024, 66 (12) : 737 - 746
  • [5] HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation
    Jiang, Qingxin
    Fan, Ying
    Li, Menghan
    Fang, Sheng
    Zhu, Weifang
    Xiang, Dehui
    Peng, Tao
    Chen, Xinjian
    Xu, Xun
    Shi, Fei
    BIOMEDICAL OPTICS EXPRESS, 2024, 15 (11): : 6156 - 6170
  • [6] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
    Jianfei He
    Canhui Xu
    Applied Intelligence, 2023, 53 : 28542 - 28554
  • [7] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
    He, Jianfei
    Xu, Canhui
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28542 - 28554
  • [8] SmokeSeger: A Transformer-CNN coupled model for urban scene smoke segmentation
    Jing, Tao
    Meng, Qing-Hao
    Hou, Hui-Rang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 1385 - 1396
  • [9] Hybrid transformer-CNN and LSTM model for lung disease segmentation and classification
    Shafi, Syed Mohammed
    Chinnappan, Sathiya Kumar
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [10] A transformer-CNN for deep image inpainting forensics
    Zhu, Xinshan
    Lu, Junyan
    Ren, Honghao
    Wang, Hongquan
    Sun, Biao
    VISUAL COMPUTER, 2023, 39 (10): : 4721 - 4735