3D CVT-GAN: A 3D Convolutional Vision Transformer-GAN for PET Reconstruction

被引:13
|
作者
Zeng, Pinxian [1 ]
Zhou, Luping [2 ]
Zu, Chen [3 ]
Zeng, Xinyi [1 ]
Jiao, Zhengyang [1 ]
Wu, Xi [4 ]
Zhou, Jiliu [1 ,4 ]
Shen, Dinggang [5 ,6 ]
Wang, Yan [1 ]
机构
[1] Sichuan Univ, Sch Comp Sci, Chengdu, Peoples R China
[2] Univ Sydney, Sch Elect & Informat Engn, Camperdown, Australia
[3] JD COM, Dept Risk Controlling Res, Beijing, Peoples R China
[4] Chengdu Univ Informat Technol, Sch Comp Sci, Chengdu, Peoples R China
[5] ShanghaiTech Univ, Sch Biomed Engn, Shanghai, Peoples R China
[6] Shanghai United Imaging Intelligence Co Ltd, Dept Res & Dev, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative adversarial network (GAN); Vision transformer; Positron emission tomography (PET); PET reconstruction;
D O I
10.1007/978-3-031-16446-0_49
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
To obtain high-quality positron emission tomography (PET) scans while reducing potential radiation hazards brought to patients, various generative adversarial network (GAN)-based methods have been developed to reconstruct high-quality standard-dose PET (SPET) images from low-dose PET (LPET) images. However, due to the intrinsic locality of convolution operator, these methods have failed to explore global contexts of the entire 3D PET image. In this paper, we propose a novel 3D convolutional vision transformer GAN framework, named 3D CVT-GAN, for SPET reconstruction using LPET images. Specifically, we innovatively design a generator with a hierarchical structure that uses multiple 3D CVT blocks as the encoder for feature extraction and also multiple 3D transposed CVT (TCVT) blocks as the decoder for SPET restoration, capturing both local spatial features and global contexts from different network layers. Different from the vanilla 2D vision transformer that uses linear embedding and projection, our 3D CVT and TCVT blocks employ 3D convolutional embedding and projection instead, allowing the model to overcome semantic ambiguity problem caused by the attention mechanism and further preserve spatial details. In addition, residual learning and a patch-based discriminator embedded with 3D CVT blocks are added inside and after the generator, facilitating the training process while mining more discriminative feature representations. Validation on the clinical PET dataset shows that our proposed 3D CVT-GAN outperforms the state-of-the-art methods qualitatively and quantitatively with minimal parameters.
引用
收藏
页码:516 / 526
页数:11
相关论文
共 50 条
  • [1] 3D Transformer-GAN for High-Quality PET Reconstruction
    Luo, Yanmei
    Wang, Yan
    Zu, Chen
    Zhan, Bo
    Wu, Xi
    Zhou, Jiliu
    Shen, Dinggang
    Zhou, Luping
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VI, 2021, 12906 : 276 - 285
  • [2] 3D multi-modality Transformer-GAN for high-quality PET reconstruction
    Wang, Yan
    Luo, Yanmei
    Zu, Chen
    Zhan, Bo
    Jiao, Zhengyang
    Wu, Xi
    Zhou, Jiliu
    Shen, Dinggang
    Zhou, Luping
    [J]. MEDICAL IMAGE ANALYSIS, 2024, 91
  • [3] 3D convolutional GAN for fast simulation
    Vallecorsa, Sofia
    Carminati, Federico
    Khattak, Gulrukh
    [J]. 23RD INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2018), 2019, 214
  • [4] Monocular 3D Object Reconstruction with GAN Inversion
    Zhang, Junzhe
    Ren, Daxuan
    Cai, Zhongang
    Yeo, Chai Kiat
    Dai, Bo
    Loy, Chen Change
    [J]. COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 673 - 689
  • [5] 3D-porous-GAN: a high-performance 3D GAN for digital core reconstruction from a single 3D image
    Xiangchao Shi
    Dandan Li
    Junhai Chen
    Yan Chen
    [J]. Journal of Petroleum Exploration and Production Technology, 2023, 13 : 2329 - 2345
  • [6] 3D-porous-GAN: a high-performance 3D GAN for digital core reconstruction from a single 3D image
    Shi, Xiangchao
    Li, Dandan
    Chen, Junhai
    Chen, Yan
    [J]. JOURNAL OF PETROLEUM EXPLORATION AND PRODUCTION TECHNOLOGY, 2023, 13 (12) : 2329 - 2345
  • [7] A Lip Reading Method Based on 3D Convolutional Vision Transformer
    Wang, Huijuan
    Pu, Gangqiang
    Chen, Tingyu
    [J]. IEEE ACCESS, 2022, 10 : 77205 - 77212
  • [8] 3D Structural Convolutional Sparse Coding for PET Image Reconstruction
    Xie, Nuobei
    Gong, Kuang
    Guo, Ning
    Qin, Zhixin
    Wu, Zhifang
    Liu, Huafeng
    Li, Quanzheng
    [J]. JOURNAL OF NUCLEAR MEDICINE, 2020, 61
  • [9] Multi-view convolutional vision transformer for 3D object recognition
    Li, Jie
    Liu, Zhao
    Li, Li
    Lin, Junqin
    Yao, Jian
    Tu, Jingmin
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [10] 3D GAN LEDS - TECHNOLOGIES AND ANALYTICS
    Ledig, J.
    Fuendling, S.
    Popp, M.
    Steib, F.
    Hartmann, J.
    Wehmann, H. -H
    Sperling, A.
    Waag, A.
    [J]. PROCEEDINGS OF CIE EXPERT SYMPOSIUM ON THE CIE S 025 LED LAMPS, LED LUMINAIRES AND LED MODULES TEST STANDARD, 2016, : 71 - 79