HmsU-Net: A hybrid multi-scale U-net based on a CNN and transformer for medical image segmentation

被引:4
|
作者
Fu, Bangkang [1 ,2 ]
Peng, Yunsong [2 ]
He, Junjie [2 ]
Tian, Chong [2 ]
Sun, Xinhuan [2 ]
Wang, Rongpin [2 ,3 ]
机构
[1] Guizhou Univ, Med Coll, Guiyang 550000, Guizhou, Peoples R China
[2] Guizhou Prov Peoples Hosp, Dept Radiol, Key Lab Intelligent Med Imaging Anal & Accurate D, Int Exemplary Cooperat Base Precis Imaging Diag &, Guiyang 550002, Peoples R China
[3] Guizhou Prov Peoples Hosp, Dept Med Imaging, Int Exemplary Cooperat Base Precis Imaging Diag &, 83 Zhongshan East Rd, Guiyang 550002, Guizhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -scale features; Transformer; U; -net; Medical image segmentation; Convolution neural network;
D O I
10.1016/j.compbiomed.2024.108013
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate medical image segmentation is of great significance for subsequent diagnosis and analysis. The acquisition of multi-scale information plays an important role in segmenting regions of interest of different sizes. With the emergence of Transformers, numerous networks adopted hybrid structures incorporating Transformers and CNNs to learn multi-scale information. However, the majority of research has focused on the design and composition of CNN and Transformer structures, neglecting the inconsistencies in feature learning between Transformer and CNN. This oversight has resulted in the hybrid network's performance not being fully realized. In this work, we proposed a novel hybrid multi-scale segmentation network named HmsU-Net, which effectively fused multi-scale features. Specifically, HmsU-Net employed a parallel design incorporating both CNN and Transformer architectures. To address the inconsistency in feature learning between CNN and Transformer within the same stage, we proposed the multi-scale feature fusion module. For feature fusion across different stages, we introduced the cross-attention module. Comprehensive experiments conducted on various datasets demonstrate that our approach surpasses current state-of-the-art methods.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [21] WTransU-Net: Wiener deconvolution meets multi-scale transformer-based U-net for image deblurring
    Shixin Zhao
    Yuanxiu Xing
    Hongyang Xu
    Signal, Image and Video Processing, 2023, 17 : 4265 - 4273
  • [22] WTransU-Net: Wiener deconvolution meets multi-scale transformer-based U-net for image deblurring
    Zhao, Shixin
    Xing, Yuanxiu
    Xu, Hongyang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (08) : 4265 - 4273
  • [23] U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?
    Jia, Xi
    Bartlett, Joseph
    Zhang, Tianyang
    Lu, Wenqi
    Qiu, Zhaowen
    Duan, Jinming
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 151 - 160
  • [24] Medical Image Segmentation based on U-Net: A Review
    Du, Getao
    Cao, Xu
    Liang, Jimin
    Chen, Xueli
    Zhan, Yonghua
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2020, 64 (02)
  • [25] An Automatic Nuclei Image Segmentation Based on Multi-Scale Split-Attention U-Net
    Xu, Qing
    Duan, Wenting
    MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 236 - 245
  • [26] Multi-Scale Semantic Segmentation for Fire Smoke Image Based on Global Information and U-Net
    Zheng, Yuanpan
    Wang, Zhenyu
    Xu, Boyang
    Niu, Yiqing
    ELECTRONICS, 2022, 11 (17)
  • [27] U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
    Petit, Olivier
    Thome, Nicolas
    Rambour, Clement
    Themyr, Loic
    Collins, Toby
    Soler, Luc
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 267 - 276
  • [28] A multi-scale large kernel attention with U-Net for medical image registration
    Chen, Yilin
    Hu, Xin
    Lu, Tao
    Zou, Lu
    Liao, Xiangyun
    Journal of Supercomputing, 2025, 81 (01):
  • [29] Multi-Scale Fusion U-Net for the Segmentation of Breast Lesions
    Li, Jingyao
    Cheng, Lianglun
    Xia, Tingjian
    Ni, Haomin
    Li, Jiao
    IEEE ACCESS, 2021, 9 : 137125 - 137139
  • [30] EU-net: An automated CNN based ebola U-net model for efficient medical image segmentation
    Rayachoti, Eswaraiah
    Vedantham, Ramachandran
    Gundabatini, Sanjay Gandhi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 74323 - 74347