Learning What and Where to Learn: A New Perspective on Self-supervised Learning

被引:1
|
作者
Zhao W. [1 ]
Yang L. [1 ]
Zhang W. [2 ]
Tian Y. [2 ]
Jia W. [1 ]
Li W. [1 ]
Yang M. [3 ]
Pan X. [4 ]
Yang H. [1 ]
机构
[1] School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing
[2] School of Information Engineering, Henan Institute of Science and Technology, Xinxiang
[3] Techmach (Beijing) Industrial Technology Co. Ltd, Beijing
[4] School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin
关键词
Computational modeling; Efficient framework; Feature extraction; Learning what; Learning where; Optimization; Positional information; Self-supervised learning; Semantics; Task analysis; Training;
D O I
10.1109/TCSVT.2023.3298937
中图分类号
学科分类号
摘要
Self-supervised learning (SSL) has demonstrated its power in generalized model acquisition by leveraging the discriminative semantic and explicit positional information of unlabeled datasets. Unfortunately, mainstream contrastive learning-based methods excessive focus on semantic information and ignore the position is also the carrier of image content, resulting in inadequate data utilization and extensive computational consumption. To address these issues, we present an efficient SSL framework, learning What and Where to learn (W2SSL), to aggregate semantic and position features. Concretely, we devise a spatially-coupled sampling manner to process images through pre-defined rules, which integrates the advantage of semantic (What) and positional (Where) features into framework to enrich the diversity of feature representation capabilities and improve data utilization. Besides, a spectrum of latent vectors is obtained by mapping the positional features, which implicitly explores the relationship between these vectors. Whereafter, the corresponding discriminative and contrastive optimization objectives are seamlessly embedded in the framework via a cascade paradigm to explore semantic and positional features. The proposed W2SSL is verified on different types of datasets, which demonstrates that it still outperforms state-of-the-art SSL methods even with half the computational consumption. Code will be available at https://github.com/WilyZhao8/W2SSL. IEEE
引用
收藏
页码:1 / 1
相关论文
共 50 条
  • [1] Learning Where to Learn in Cross-View Self-Supervised Learning
    Huang, Lang
    You, Shan
    Zheng, Mingkai
    Wang, Fei
    Qian, Chen
    Yamasaki, Toshihiko
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14431 - 14440
  • [2] A New Self-supervised Method for Supervised Learning
    Yang, Yuhang
    Ding, Zilin
    Cheng, Xuan
    Wang, Xiaomin
    Liu, Ming
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155
  • [3] A comprehensive perspective of contrastive self-supervised learning
    Songcan CHEN
    Chuanxing GENG
    [J]. Frontiers of Computer Science, 2021, (04) : 102 - 104
  • [4] A comprehensive perspective of contrastive self-supervised learning
    Songcan Chen
    Chuanxing Geng
    [J]. Frontiers of Computer Science, 2021, 15
  • [5] A comprehensive perspective of contrastive self-supervised learning
    Chen, Songcan
    Geng, Chuanxing
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2021, 15 (04)
  • [6] What Should Be Equivariant In Self-Supervised Learning
    Xie, Yuyang
    Wen, Jianhong
    Lau, Kin Wai
    Rehman, Yasar Abbas Ur
    Shen, Jiajun
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4110 - 4119
  • [7] How and What to Learn: Taxonomizing Self-Supervised Learning for 3D Action Recognition
    Ben Tanfous, Amor
    Zerroug, Aimen
    Linsley, Drew
    Serre, Thomas
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2888 - 2897
  • [8] Inpaint2Learn: A Self-Supervised Framework for Affordance Learning
    Zhang, Lingzhi
    Du, Weiyu
    Zhou, Shenghao
    Wang, Jiancong
    Shi, Jianbo
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3778 - 3787
  • [9] Repeat and learn: Self-supervised visual representations learning by Scene Localization
    Altabrawee, Hussein
    Noor, Mohd Halim Mohd
    [J]. PATTERN RECOGNITION, 2024, 156
  • [10] Self-Supervised Dialogue Learning
    Wu, Jiawei
    Wang, Xin
    Wang, William Yang
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3857 - 3867