EPT-Net: Edge Perception Transformer for 3D Medical Image Segmentation

被引:13
|
作者
Yang, Jingyi [1 ]
Jiao, Licheng [1 ]
Shang, Ronghua [1 ]
Liu, Xu [1 ]
Li, Ruiyang [2 ]
Xu, Longchang [2 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Key Lab Intelligent Percept & Image Understanding, Minist Educ China, Xian 710071, Peoples R China
[2] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; convolu-tional neural networks; transformer; attention mechanism; ARCHITECTURE; ATTENTION;
D O I
10.1109/TMI.2023.3278461
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The convolutional neural network has achieved remarkable results in most medical image seg- mentation applications. However, the intrinsic locality of convolution operation has limitations in modeling the long-range dependency. Although the Transformer designed for sequence-to-sequence global prediction was born to solve this problem, it may lead to limited positioning capability due to insufficient low-level detail features. Moreover, low-level features have rich fine-grained information, which greatly impacts edge segmentation decisions of different organs. However, a simple CNN module is difficult to capture the edge information in fine-grained features, and the computational power and memory consumed in processing high-resolution 3D features are costly. This paper proposes an encoder-decoder network that effectively combines edge perception and Transformer structure to segment medical images accurately, called EPT-Net. Under this framework, this paper proposes a Dual Position Transformer to enhance the 3D spatial positioning ability effectively. In addition, as low-level features contain detailed information, we conduct an Edge Weight Guidance module to extract edge information by minimizing the edge information function without adding network parameters. Furthermore, we verified the effectiveness of the proposed method on three datasets, including SegTHOR 2019, Multi-Atlas Labeling Beyond the Cranial Vault and the re-labeled KiTS19 dataset called KiTS19-M by us. The experimental results show that EPT-Net has significantly improved compared with the state-of-the-art medical image segmentation method.
引用
收藏
页码:3229 / 3243
页数:15
相关论文
共 50 条
  • [1] TT-Net: Tensorized Transformer Network for 3D medical image segmentation
    Wang, Jing
    Qu, Aixi
    Wang, Qing
    Zhao, Qibin
    Liu, Ju
    Wu, Qiang
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 107
  • [2] Efficient combined algorithm of Transformer and U-Net for 3D medical image segmentation
    Zhang, Mingyan
    Wang, Aixia
    Yang, Gang
    Li, Jingjiao
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4377 - 4382
  • [3] 3D bi-directional transformer U-Net for medical image segmentation
    Fu, Xiyao
    Sun, Zhexian
    Tang, Haoteng
    Zou, Eric M.
    Huang, Heng
    Wang, Yong
    Zhan, Liang
    FRONTIERS IN BIG DATA, 2023, 5
  • [4] FATUnetr:fully attention Transformer for 3D medical image segmentation
    Li, QingFeng
    Tong, Jigang
    Yang, Sen
    Du, Shengzhi
    2024 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, ICMA 2024, 2024, : 1415 - 1419
  • [5] nnFormer: Volumetric Medical Image Segmentation via a 3D Transformer
    Zhou, Hong-Yu
    Guo, Jiansen
    Zhang, Yinghao
    Han, Xiaoguang
    Yu, Lequan
    Wang, Liansheng
    Yu, Yizhou
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4036 - 4045
  • [6] Medical Image Segmentation Based on 3D U-net
    Chen, Silu
    Hu, Guanghao
    Sun, Jun
    2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, : 130 - 133
  • [7] A 3D Medical Image Segmentation Framework Fusing Convolution and Transformer Features
    Zhu, Fazhan
    Lv, Jiaxing
    Lu, Kun
    Wang, Wenyan
    Cong, Hongshou
    Zhang, Jun
    Chen, Peng
    Zhao, Yuan
    Wu, Ziheng
    INTELLIGENT COMPUTING THEORIES AND APPLICATION (ICIC 2022), PT I, 2022, 13393 : 772 - 786
  • [8] MCRformer: Morphological constraint reticular transformer for 3D medical image segmentation
    Li, Jun
    Chen, Nan
    Zhou, Han
    Lai, Taotao
    Dong, Heng
    Feng, Chunhui
    Chen, Riqing
    Yang, Changcai
    Cai, Fanggang
    Wei, Lifang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [9] DAST: Differentiable Architecture Search with Transformer for 3D Medical Image Segmentation
    Yang, Dong
    Xu, Ziyue
    He, Yufan
    Nath, Vishwesh
    Li, Wenqi
    Myronenko, Andriy
    Hatamizadeh, Ali
    Zhao, Can
    Roth, Holger R.
    Xu, Daguang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 747 - 756
  • [10] A Transformer-Based Network for Anisotropic 3D Medical Image Segmentation
    Guo, Danfeng
    Terzopoulos, Demetri
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8857 - 8861