vox2vec: A Framework for Self-supervised Contrastive Learning of Voxel-Level Representations in Medical Images

被引:5
|
作者
Goncharov, Mikhail [1 ]
Soboleva, Vera [2 ]
Kurmukov, Anvar [3 ]
Pisov, Maxim [4 ]
Belyaev, Mikhail [1 ,3 ]
机构
[1] Skolkovo Inst Sci & Technol, Moscow, Russia
[2] Artificial Intelligence Res Inst AIRI, Moscow, Russia
[3] Inst Informat Transmiss Problems, Moscow, Russia
[4] IRA Labs, Moscow, Russia
基金
俄罗斯科学基金会;
关键词
Contrastive Self-Supervised Representation Learning; Medical Image Segmentation;
D O I
10.1007/978-3-031-43907-0_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces vox2vec - a contrastive method for self-supervised learning (SSL) of voxel-level representations. vox2vec representations are modeled by a Feature Pyramid Network (FPN): a voxel representation is a concatenation of the corresponding feature vectors from different pyramid levels. The FPN is pre-trained to produce similar representations for the same voxel in different augmented contexts and distinctive representations for different voxels. This results in unified multi-scale representations that capture both global semantics (e.g., body part) and local semantics (e.g., different small organs or healthy versus tumor tissue). We use vox2vec to pre-train a FPN on more than 6500 publicly available computed tomography images. We evaluate the pre-trained representations by attaching simple heads on top of them and training the resulting models for 22 segmentation tasks. We show that vox2vec outperforms existing medical imaging SSL techniques in three evaluation setups: linear and non-linear probing and end-to-end fine-tuning. Moreover, a non-linear head trained on top of the frozen vox2vec representations achieves competitive performance with the FPN trained from scratch while having 50 times fewer trainable parameters. The code is available at https://github.com/mishgon/vox2vec.
引用
收藏
页码:605 / 614
页数:10
相关论文
共 50 条
  • [31] TimeCLR: A self-supervised contrastive learning framework for univariate time series representation
    Yang, Xinyu
    Zhang, Zhenguo
    Cui, Rongyi
    KNOWLEDGE-BASED SYSTEMS, 2022, 245
  • [32] Self-Supervised Learning of Remote Sensing Scene Representations Using Contrastive Multiview Coding
    Stojnic, Vladan
    Risojevic, Vladimir
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1182 - 1191
  • [33] Self-Supervised Spectral-Level Contrastive Learning for Hyperspectral Target Detection
    Wang, Yulei
    Chen, Xi
    Zhao, Enyu
    Song, Meiping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [34] Nuc2Vec: Learning Representations of Nuclei in Histopathology Images with Contrastive Loss
    Feng, Chao
    Vanderbilt, Chad
    Fuchs, Thomas J.
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 143, 2021, 143 : 179 - 189
  • [35] Semantic segmentation algorithm for foggy cityscapes images by fusing self-supervised contrastive learning
    Liu, Liwei
    Wang, Rui
    Meng, Xutao
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (07) : 990 - 1000
  • [36] Self-Supervised Contrastive Learning for Automated Segmentation of Brain Tumor MRI Images in Schizophrenia
    Meng, Lingmiao
    Zhao, Liwei
    Yi, Xin
    Yu, Qingming
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [37] Generating Views Using Atmospheric Correction for Contrastive Self-Supervised Learning of Multispectral Images
    Patnala, Ankit
    Stadtler, Scarlet
    Schultz, Martin G. G.
    Gall, Juergen
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [38] Point2Vec for Self-supervised Representation Learning on Point Clouds
    Abou Zeid, Karim
    Schult, Jonas
    Hermans, Alexander
    Leibe, Bastian
    PATTERN RECOGNITION, DAGM GCPR 2023, 2024, 14264 : 131 - 146
  • [39] A MULTI-TASK SELF-SUPERVISED LEARNING FRAMEWORK FOR SCOPY IMAGES
    Li, Yuexiang
    Chen, Jiawei
    Zheng, Yefeng
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 2005 - 2009
  • [40] Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework
    Tao, Li
    Wang, Xueting
    Yamasaki, Toshihiko
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2193 - 2201