MRM: Masked Relation Modeling for Medical Image Pre-Training with Genetics

被引:0
|
作者
Yang, Qiushi [1 ]
Li, Wuyang [1 ]
Li, Baopu
Yuan, Yixuan [1 ,2 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China
关键词
D O I
10.1109/ICCV51070.2023.01961
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern deep learning techniques on automatic multi-modal medical diagnosis rely on massive expert annotations, which is time-consuming and prohibitive. Recent masked image modeling (MIM)-based pre-training methods have witnessed impressive advances for learning meaningful representations from unlabeled data and transferring to downstream tasks. However, these methods focus on natural images and ignore the specific properties of medical data, yielding unsatisfying generalization performance on downstream medical diagnosis. In this paper, we aim to leverage genetics to boost image pre-training and present a masked relation modeling (MRM) framework. Instead of explicitly masking input data in previous MIM methods leading to loss of disease-related semantics, we design relation masking to mask out token-wise feature relation in both self- and cross-modality levels, which preserves intact semantics within the input and allows the model to learn rich disease-related information. Moreover, to enhance semantic relation modeling, we propose relation matching to align the sample-wise relation between the intact and masked features. The relation matching exploits inter-sample relation by encouraging global constraints in the feature space to render sufficient semantic relation for feature representation. Extensive experiments demonstrate that the proposed framework is simple yet powerful, achieving state-of-the-art transfer performance on various downstream diagnosis tasks. Codes are available at https://github. com/ CityU-AIM-Group/MRM.
引用
收藏
页码:21395 / 21405
页数:11
相关论文
共 50 条
  • [1] MimCo: Masked Image Modeling Pre-training with Contrastive Teacher
    Zhou, Qiang
    Yu, Chaohui
    Luo, Hao
    Wang, Zhibin
    Li, Hao
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4487 - 4495
  • [2] Hybrid Pre-training Based on Masked Autoencoders for Medical Image Segmentation
    Han, Yufei
    Chen, Haoyuan
    Xu, Pin
    Li, Yanyi
    Li, Kuan
    Yin, Jianping
    [J]. THEORETICAL COMPUTER SCIENCE, NCTCS 2022, 2022, 1693 : 175 - 182
  • [3] SELF PRE-TRAINING WITH MASKED AUTOENCODERS FOR MEDICAL IMAGE CLASSIFICATION AND SEGMENTATION
    Zhou, Lei
    Liu, Huidong
    Bae, Joseph
    He, Junjun
    Samaras, Dimitris
    Prasanna, Prateek
    [J]. 2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [4] Masked Channel Modeling for Bootstrapping Visual Pre-training
    Liu, Yang
    Wang, Xinlong
    Zhu, Muzhi
    Cao, Yue
    Huang, Tiejun
    Shen, Chunhua
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
  • [5] GMIM: Self-supervised pre-training for 3D medical image segmentation with adaptive and hierarchical masked image modeling
    Qi, Liangce
    Jiang, Zhengang
    Shi, Weili
    Qu, Feng
    Feng, Guanyuan
    [J]. Computers in Biology and Medicine, 2024, 176
  • [6] On Masked Pre-training and the Marginal Likelihood
    Moreno-Munoz, Pablo
    Recasens, Pol G.
    Hauberg, Soren
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework
    Niizumi, Daisuke
    Takeuchi, Daiki
    Ohishi, Yasunori
    Harada, Noboru
    Kashino, Kunio
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2391 - 2406
  • [8] CXRMIM: MASKED IMAGE MODELING PRE-TRAINING PARADIGM FOR CHEST X-RAY IMAGES ANALYSIS
    Wang, Zhendong
    Ma, Haowen
    Niu, Jianwei
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2250 - 2254
  • [9] SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling
    Dong, Jiaxiang
    Wu, Haixu
    Zhang, Haoran
    Zhang, Li
    Wang, Jianmin
    Long, Mingsheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Pre-training on Grayscale ImageNet Improves Medical Image Classification
    Xie, Yiting
    Richmond, David
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT VI, 2019, 11134 : 476 - 484