Dense Semantic Contrast for Self-Supervised Visual Representation Learning

被引:18
|
作者
Li, Xiaoni [1 ,2 ]
Zhou, Yu [1 ,2 ]
Zhang, Yifei [1 ,2 ]
Zhang, Aoting [1 ]
Wang, Wei [1 ,2 ]
Jiang, Ning [3 ]
Wu, Haiying [3 ]
Wang, Weiping [1 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Mashang Consumer Finance Co Ltd, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Self-Supervised Learning; Representation Learning; Contrastive; Learning; Dense Representation; Semantics Discovery;
D O I
10.1145/3474085.3475551
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised representation learning for visual pre-training has achieved remarkable success with sample (instance or pixel) discrimination and semantics discovery of instance, whereas there still exists a non-negligible gap between pre-trained model and downstream dense prediction tasks. Concretely, these downstream tasks require more accurate representation, in other words, the pixels from the same object must belong to a shared semantic category, which is lacking in the previous methods. In this work, we present Dense Semantic Contrast (DSC) for modeling semantic category decision boundaries at a dense level to meet the requirement of these tasks. Furthermore, we propose a dense cross-image semantic contrastive learning framework for multi-granularity representation learning. Specially, we explicitly explore the semantic structure of the dataset by mining relations among pixels from different perspectives. For intra-image relation modeling, we discover pixel neighbors from multiple views. And for inter-image relations, we enforce pixel representation from the same semantic class to be more similar than the representation from different classes in one mini-batch. Experimental results show that our DSC model outperforms state-of-the-art methods when transferring to downstream dense prediction tasks, including object detection, semantic segmentation, and instance segmentation. Code will be made available.
引用
收藏
页码:1368 / 1376
页数:9
相关论文
共 50 条
  • [1] Self-Supervised Visual Representation Learning with Semantic Grouping
    Wen, Xin
    Zhao, Bingchen
    Zheng, Anlin
    Zhang, Xiangyu
    Qi, Xiaojuan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] Dense lead contrast for self-supervised representation learning of multilead electrocardiograms
    Liu, Wenhan
    Li, Zhoutong
    Zhang, Huaicheng
    Chang, Sheng
    Wang, Hao
    He, Jin
    Huang, Qijun
    [J]. INFORMATION SCIENCES, 2023, 634 (189-205) : 189 - 205
  • [3] Can Semantic Labels Assist Self-Supervised Visual Representation Learning?
    Wei, Longhui
    Xie, Lingxi
    He, Jianzhong
    Zhang, Xiaopeng
    Tian, Qi
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2642 - 2650
  • [4] Revisiting Self-Supervised Visual Representation Learning
    Kolesnikov, Alexander
    Zhai, Xiaohua
    Beyer, Lucas
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1920 - 1929
  • [5] Self-supervised contrastive representation learning for semantic segmentation
    Liu, Bochong
    Cai, Huaiyu
    Wang, Yi
    Chen, Xiaodong
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2024, 51 (01): : 125 - 134
  • [6] Self-Supervised Visual Descriptor Learning for Dense Correspondence
    Schmidt, Tanner
    Newcombe, Richard
    Fox, Dieter
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (02): : 420 - 427
  • [7] Mixed Autoencoder for Self-supervised Visual Representation Learning
    Chen, Kai
    Liu, Zhili
    Hong, Lanqing
    Xu, Hang
    Li, Zhenguo
    Yeung, Dit-Yan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22742 - 22751
  • [8] Scaling and Benchmarking Self-Supervised Visual Representation Learning
    Goyal, Priya
    Mahajan, Dhruv
    Gupta, Abhinav
    Misra, Ishan
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6400 - 6409
  • [9] Self-supervised representation learning by predicting visual permutations
    Zhao, Qilu
    Dong, Junyu
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 210
  • [10] Self-supervised Visual Representation Learning for Histopathological Images
    Yang, Pengshuai
    Hong, Zhiwei
    Yin, Xiaoxu
    Zhu, Chengzhan
    Jiang, Rui
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 47 - 57