vox2vec: A Framework for Self-supervised Contrastive Learning of Voxel-Level Representations in Medical Images

被引:5
|
作者
Goncharov, Mikhail [1 ]
Soboleva, Vera [2 ]
Kurmukov, Anvar [3 ]
Pisov, Maxim [4 ]
Belyaev, Mikhail [1 ,3 ]
机构
[1] Skolkovo Inst Sci & Technol, Moscow, Russia
[2] Artificial Intelligence Res Inst AIRI, Moscow, Russia
[3] Inst Informat Transmiss Problems, Moscow, Russia
[4] IRA Labs, Moscow, Russia
基金
俄罗斯科学基金会;
关键词
Contrastive Self-Supervised Representation Learning; Medical Image Segmentation;
D O I
10.1007/978-3-031-43907-0_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces vox2vec - a contrastive method for self-supervised learning (SSL) of voxel-level representations. vox2vec representations are modeled by a Feature Pyramid Network (FPN): a voxel representation is a concatenation of the corresponding feature vectors from different pyramid levels. The FPN is pre-trained to produce similar representations for the same voxel in different augmented contexts and distinctive representations for different voxels. This results in unified multi-scale representations that capture both global semantics (e.g., body part) and local semantics (e.g., different small organs or healthy versus tumor tissue). We use vox2vec to pre-train a FPN on more than 6500 publicly available computed tomography images. We evaluate the pre-trained representations by attaching simple heads on top of them and training the resulting models for 22 segmentation tasks. We show that vox2vec outperforms existing medical imaging SSL techniques in three evaluation setups: linear and non-linear probing and end-to-end fine-tuning. Moreover, a non-linear head trained on top of the frozen vox2vec representations achieves competitive performance with the FPN trained from scratch while having 50 times fewer trainable parameters. The code is available at https://github.com/mishgon/vox2vec.
引用
收藏
页码:605 / 614
页数:10
相关论文
共 50 条
  • [21] Self-Supervised Contrastive Learning for Medical Time Series: A Systematic Review
    Liu, Ziyu
    Alavi, Azadeh
    Li, Minyi
    Zhang, Xiang
    SENSORS, 2023, 23 (09)
  • [22] ContIG: Self-supervised Multimodal Contrastive Learning for Medical Imaging with Genetics
    Taleb, Aiham
    Kirchler, Matthias
    Monti, Remo
    Lippert, Christoph
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20876 - 20889
  • [23] Self-supervised Graph-level Representation Learning with Adversarial Contrastive Learning
    Luo, Xiao
    Ju, Wei
    Gu, Yiyang
    Mao, Zhengyang
    Liu, Luchen
    Yuan, Yuhui
    Zhang, Ming
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (02)
  • [24] Multi-level Contrastive Learning for Self-Supervised Vision Transformers
    Mo, Shentong
    Sun, Zhun
    Li, Chao
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2777 - 2786
  • [25] data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
    Baevski, Alexei
    Hsu, Wei-Ning
    Xu, Qiantong
    Babu, Arun
    Gu, Jiatao
    Auli, Michael
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [26] SpatialScene2Vec: A self-supervised contrastive representation learning method for spatial scene similarity evaluation
    Guo, Danhuai
    Yu, Yingxue
    Ge, Shiyin
    Gao, Song
    Mai, Gengchen
    Chen, Huixuan
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 128
  • [27] Classification of Ground-Based Cloud Images by Contrastive Self-Supervised Learning
    Lv, Qi
    Li, Qian
    Chen, Kai
    Lu, Yao
    Wang, Liwen
    REMOTE SENSING, 2022, 14 (22)
  • [28] Contrastive self-supervised representation learning framework for metal surface defect detection
    Mahe Zabin
    Anika Nahian Binte Kabir
    Muhammad Khubayeeb Kabir
    Ho-Jin Choi
    Jia Uddin
    Journal of Big Data, 10
  • [29] Contrastive self-supervised representation learning framework for metal surface defect detection
    Zabin, Mahe
    Kabir, Anika Nahian Binte
    Kabir, Muhammad Khubayeeb
    Choi, Ho-Jin
    Uddin, Jia
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [30] An End-to-End Contrastive Self-Supervised Learning Framework for Language Understanding
    Fang, Hongchao
    Xie, Pengtao
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1324 - 1340