vox2vec: A Framework for Self-supervised Contrastive Learning of Voxel-Level Representations in Medical Images

被引：5

作者：

Goncharov, Mikhail ^{[1
]}

Soboleva, Vera ^{[2
]}

Kurmukov, Anvar ^{[3
]}

Pisov, Maxim ^{[4
]}

Belyaev, Mikhail ^{[1
,3
]}

机构：

[1] Skolkovo Inst Sci & Technol, Moscow, Russia

[2] Artificial Intelligence Res Inst AIRI, Moscow, Russia

[3] Inst Informat Transmiss Problems, Moscow, Russia

[4] IRA Labs, Moscow, Russia

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I | 2023年 / 14220卷

基金：

俄罗斯科学基金会;

关键词：

Contrastive Self-Supervised Representation Learning; Medical Image Segmentation;

D O I：

10.1007/978-3-031-43907-0_58

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper introduces vox2vec - a contrastive method for self-supervised learning (SSL) of voxel-level representations. vox2vec representations are modeled by a Feature Pyramid Network (FPN): a voxel representation is a concatenation of the corresponding feature vectors from different pyramid levels. The FPN is pre-trained to produce similar representations for the same voxel in different augmented contexts and distinctive representations for different voxels. This results in unified multi-scale representations that capture both global semantics (e.g., body part) and local semantics (e.g., different small organs or healthy versus tumor tissue). We use vox2vec to pre-train a FPN on more than 6500 publicly available computed tomography images. We evaluate the pre-trained representations by attaching simple heads on top of them and training the resulting models for 22 segmentation tasks. We show that vox2vec outperforms existing medical imaging SSL techniques in three evaluation setups: linear and non-linear probing and end-to-end fine-tuning. Moreover, a non-linear head trained on top of the frozen vox2vec representations achieves competitive performance with the FPN trained from scratch while having 50 times fewer trainable parameters. The code is available at https://github.com/mishgon/vox2vec.

引用

页码：605 / 614

页数：10

共 50 条

[21] Self-Supervised Contrastive Learning for Medical Time Series: A Systematic Review
Liu, Ziyu
Alavi, Azadeh
Li, Minyi
Zhang, Xiang
SENSORS, 2023, 23 (09)
[22] ContIG: Self-supervised Multimodal Contrastive Learning for Medical Imaging with Genetics
Taleb, Aiham
Kirchler, Matthias
Monti, Remo
Lippert, Christoph
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20876 - 20889
[23] Self-supervised Graph-level Representation Learning with Adversarial Contrastive Learning
Luo, Xiao
Ju, Wei
Gu, Yiyang
Mao, Zhengyang
Liu, Luchen
Yuan, Yuhui
Zhang, Ming
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (02)
[24] Multi-level Contrastive Learning for Self-Supervised Vision Transformers
Mo, Shentong
Sun, Zhun
Li, Chao
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2777 - 2786
[25] data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Baevski, Alexei
Hsu, Wei-Ning
Xu, Qiantong
Babu, Arun
Gu, Jiatao
Auli, Michael
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[26] SpatialScene2Vec: A self-supervised contrastive representation learning method for spatial scene similarity evaluation
Guo, Danhuai
Yu, Yingxue
Ge, Shiyin
Gao, Song
Mai, Gengchen
Chen, Huixuan
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 128
[27] Classification of Ground-Based Cloud Images by Contrastive Self-Supervised Learning
Lv, Qi
Li, Qian
Chen, Kai
Lu, Yao
Wang, Liwen
REMOTE SENSING, 2022, 14 (22)
[28] Contrastive self-supervised representation learning framework for metal surface defect detection
Mahe Zabin
Anika Nahian Binte Kabir
Muhammad Khubayeeb Kabir
Ho-Jin Choi
Jia Uddin
Journal of Big Data, 10
[29] Contrastive self-supervised representation learning framework for metal surface defect detection
Zabin, Mahe
Kabir, Anika Nahian Binte
Kabir, Muhammad Khubayeeb
Choi, Ho-Jin
Uddin, Jia
JOURNAL OF BIG DATA, 2023, 10 (01)
[30] An End-to-End Contrastive Self-Supervised Learning Framework for Language Understanding
Fang, Hongchao
Xie, Pengtao
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1324 - 1340

← 1 2 3 4 5 →