Advancing 3D medical image analysis with variable dimension transform based supervised 3D pre-training

被引:2
|
作者
Zhang, Shu [1 ]
Li, Zihao [1 ]
Zhou, Hong-Yu [2 ]
Ma, Jiechao [1 ]
Yu, Yizhou [1 ,2 ]
机构
[1] Deepwise Artificial Intelligence Lab, 8 Haidian Ave, Beijing, Peoples R China
[2] Univ Hong Kong, Dept Comp Sci, Pokfulam, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
3D medical image; Transfer learning; Variable dimension transform; Supervised pre-training; CT; NETWORK;
D O I
10.1016/j.neucom.2023.01.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The difficulties in both data acquisition and annotation substantially restrict the sample sizes of training datasets for 3D medical imaging applications. Therefore, it is non-trivial to build well-performing 3D con-volutional neural networks from scratch. Previous efforts on 3D pre-training have frequently relied on self-supervised approaches, which use either predictive or contrastive learning on unlabeled data to build invariant 3D representations. However, because of the unavailability of large-scale supervision informa-tion, obtaining semantically invariant and discriminative representations from these learning frame-works remains problematic. In this paper, we revisit an innovative yet simple fully-supervised 3D network pre-training framework to take advantage of semantic supervision from large-scale 2D natural image datasets. With a redesigned 3D network architecture, reformulated natural images are used to address the problem of data scarcity and develop powerful 3D representations. Comprehensive experi-ments on five benchmark datasets demonstrate that the proposed pre-trained models can effectively accelerate convergence while also improving accuracy for a variety of 3D medical imaging tasks such as classification, segmentation, and detection. In addition, as compared to training from scratch, it can save up to 60% of annotation efforts. On the NIH DeepLesion dataset, it also achieves state-of-the-art detection performance, outperforming earlier self-supervised and fully-supervised pre-training approaches, as well as methods that do training from scratch. To facilitate further development of 3D medical models, our code and pre-trained model weights are publicly available at https://github.com/u rmagicsmine/CSPR. (c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:11 / 22
页数:12
相关论文
共 50 条
  • [21] Masked Autoencoder for Pre-Training on 3D Point Cloud Object Detection
    Xie, Guangda
    Li, Yang
    Qu, Hongquan
    Sun, Zaiming
    MATHEMATICS, 2022, 10 (19)
  • [22] A PRE-TRAINING METHOD FOR 3D BUILDING POINT CLOUD SEMANTIC SEGMENTATION
    Cao, Yuwei
    Scaioni, Marco
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 219 - 226
  • [23] ECO-3D: Equivariant Contrastive Learning for Pre-training on Perturbed 3D Point Cloud
    Wang, Ruibin
    Ying, Xianghua
    Xing, Bowei
    Yang, Jinfa
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2626 - 2634
  • [24] Analysis and display of 3D medical image data
    Luo, L.
    Xie, X.
    Chinese Journal of Biomedical Engineering, 1995, 14 (02): : 113 - 115
  • [25] Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors
    Hou, Ji
    Dai, Xiaoliang
    He, Zijian
    Dai, Angela
    Niessner, Matthias
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13510 - 13519
  • [26] PointVST: Self-Supervised Pre-Training for 3D Point Clouds via View-Specific Point-to-Image Translation
    Zhang, Qijian
    Hou, Junhui
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (10) : 6900 - 6912
  • [27] 3D Model Retrieval Based on 3D Discrete Cosine Transform
    Lmaati, Elmustapha Ait
    El Oirrak, Ahmed
    Kaddioui, Mohamaed Najib
    Ouahman, Abdellah Ait
    Sadgal, Mohammed
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2010, 7 (03) : 264 - 270
  • [28] 3D model search and retrieval based on the 3D Radon Transform
    Zarpalas, D
    Daras, P
    Tzovaras, D
    Strintzis, MG
    2004 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-7, 2004, : 1375 - 1379
  • [29] PRE-PROCESSING OF HOLOSCOPIC 3D IMAGE FOR AUTOSTEREOSCOPIC 3D DISPLAYS
    Swash, M. R.
    Aggoun, A.
    Abdulfatah, O.
    Li, B.
    Fernandez, J. C.
    Alazawi, E.
    Tsekleves, E.
    2013 INTERNATIONAL CONFERENCE ON 3D IMAGING (IC3D), 2013,
  • [30] Dual stream fusion module integrating 2D and 3D for 3D semi-supervised medical image segmentation
    Zhiquan He
    Yating Ouyang
    Yang Wen
    Signal, Image and Video Processing, 2025, 19 (4)