EPT-Net: Edge Perception Transformer for 3D Medical Image Segmentation

被引:13
|
作者
Yang, Jingyi [1 ]
Jiao, Licheng [1 ]
Shang, Ronghua [1 ]
Liu, Xu [1 ]
Li, Ruiyang [2 ]
Xu, Longchang [2 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Key Lab Intelligent Percept & Image Understanding, Minist Educ China, Xian 710071, Peoples R China
[2] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; convolu-tional neural networks; transformer; attention mechanism; ARCHITECTURE; ATTENTION;
D O I
10.1109/TMI.2023.3278461
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The convolutional neural network has achieved remarkable results in most medical image seg- mentation applications. However, the intrinsic locality of convolution operation has limitations in modeling the long-range dependency. Although the Transformer designed for sequence-to-sequence global prediction was born to solve this problem, it may lead to limited positioning capability due to insufficient low-level detail features. Moreover, low-level features have rich fine-grained information, which greatly impacts edge segmentation decisions of different organs. However, a simple CNN module is difficult to capture the edge information in fine-grained features, and the computational power and memory consumed in processing high-resolution 3D features are costly. This paper proposes an encoder-decoder network that effectively combines edge perception and Transformer structure to segment medical images accurately, called EPT-Net. Under this framework, this paper proposes a Dual Position Transformer to enhance the 3D spatial positioning ability effectively. In addition, as low-level features contain detailed information, we conduct an Edge Weight Guidance module to extract edge information by minimizing the edge information function without adding network parameters. Furthermore, we verified the effectiveness of the proposed method on three datasets, including SegTHOR 2019, Multi-Atlas Labeling Beyond the Cranial Vault and the re-labeled KiTS19 dataset called KiTS19-M by us. The experimental results show that EPT-Net has significantly improved compared with the state-of-the-art medical image segmentation method.
引用
收藏
页码:3229 / 3243
页数:15
相关论文
共 50 条
  • [21] Abstract: 3D Medical Image Segmentation with Transformer-based Scaling of ConvNets MedNeXt
    Roy, Saikat
    Koehler, Gregor
    Baumgartner, Michael
    Ulrich, Constantin
    Isensee, Fabian
    Jaeger, Paul F.
    Maier-Hein, Klaus
    BILDVERARBEITUNG FUR DIE MEDIZIN 2024, 2024, : 79 - 79
  • [22] Diffusion Transformer U-Net for Medical Image Segmentation
    Chowdary, G. Jignesh
    Yin, Zhaozheng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 622 - 631
  • [23] MIXED TRANSFORMER U-NET FOR MEDICAL IMAGE SEGMENTATION
    Wang, Hongyi
    Xie, Shiao
    Lin, Lanfen
    Iwamoto, Yutaro
    Han, Xian-Hua
    Chen, Yen-Wei
    Tong, Ruofeng
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2390 - 2394
  • [24] HCA-former: Hybrid Convolution Attention Transformer for 3D Medical Image Segmentation
    Yang, Fan
    Wang, Fan
    Dong, Pengwei
    Wang, Bo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 90
  • [25] SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation
    Yu, Bin
    Zhou, Quan
    Zhang, Xuming
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 376 - 387
  • [26] 3D Swin Transformer for Partial Medical Auto Segmentation
    Rangnekar, Aneesh
    Jiang, Jue
    Veeraraghavan, Harini
    FAST, LOW-RESOURCE, AND ACCURATE ORGAN AND PAN-CANCER SEGMENTATION IN ABDOMEN CT, FLARE 2023, 2024, 14544 : 222 - 235
  • [27] Dynamic Linear Transformer for 3D Biomedical Image Segmentation
    Zhang, Zheyuan
    Bagci, Ulas
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 171 - 180
  • [28] SEAformer: Selective Edge Aggregation transformer for 2D medical image segmentation
    Li, Jingwen
    Chen, Jilong
    Jiang, Lei
    Li, Ruoyu
    Han, Peilun
    Cheng, Junlong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 102
  • [29] EFFICIENT 3D TRANSFORMER WITH CLUSTER-BASED DOMAIN-ADVERSARIAL LEARNING FOR 3D MEDICAL IMAGE SEGMENTATION
    Zhang, Haoran
    Chen, Hao
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [30] nnU-Net Revisited: A Call for Rigorous Validation in 3D Medical Image Segmentation
    Isensee, Fabian
    Wald, Tassilo
    Ulrich, Constantin
    Baumgartner, Michael
    Roy, Saikat
    Maier-Hein, Klaus
    Jaeger, Paul F.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 488 - 498