Self-Supervised Pyramid Representation Learning for Multi-Label Visual Analysis and Beyond

被引:2
|
作者
Hsieh, Cheng-Yen [1 ]
Chang, Chih-Jung [1 ]
Yang, Fu-En [1 ]
Wang, Yu-Chiang Frank [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
关键词
D O I
10.1109/WACV56688.2023.00272
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While self-supervised learning has been shown to benefit a number of vision tasks, existing techniques mainly focus on image-level manipulation, which may not generalize well to downstream tasks at patch or pixel levels. Moreover, existing SSL methods might not sufficiently describe and associate the above representations within and across image scales. In this paper, we propose a Self-Supervised Pyramid Representation Learning (SS-PRL) framework. The proposed SS-PRL is designed to derive pyramid representations at patch levels via learning proper prototypes, with additional learners to observe and relate inherent semantic information within an image. In particular, we present a cross-scale patch-level correlation learning in SS-PRL, which allows the model to aggregate and associate information learned across patch scales. We show that, with our proposed SS-PRL for model pre-training, one can easily adapt and fine-tune the models for a variety of applications including multi-label classification, object detection, and instance segmentation.
引用
收藏
页码:2695 / 2704
页数:10
相关论文
共 50 条
  • [1] Self-Supervised Multi-Label Transformation Prediction for Video Representation Learning
    Assefa, Maregu
    Jiang, Wei
    Yilma, Getinet
    Kumeda, Bulbula
    Ayalew, Melese
    Seid, Mohammed
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (09)
  • [2] Multi-Label Self-Supervised Learning with Scene Images
    Zhu, Ke
    Fu, Minghao
    Wu, Jianxin
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6671 - 6680
  • [3] OBJECT-AWARE SELF-SUPERVISED MULTI-LABEL LEARNING
    Xu Kaixin
    Liu Liyang
    Zhao Ziyuan
    Zeng, Zeng
    Veeravalli, Bharadwaj
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 361 - 365
  • [4] Self-supervised Label-Visual Correlation Hashing for Multi-label Image Retrieval
    Liu, Yu
    Xie, Yanzhao
    Song, Jingkuan
    Wei, Rukai
    Zhou, Ke
    [J]. WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 129 - 143
  • [5] Supervised representation learning for multi-label classification
    Ming Huang
    Fuzhen Zhuang
    Xiao Zhang
    Xiang Ao
    Zhengyu Niu
    Min-Ling Zhang
    Qing He
    [J]. Machine Learning, 2019, 108 : 747 - 763
  • [6] Supervised representation learning for multi-label classification
    Huang, Ming
    Zhuang, Fuzhen
    Zhang, Xiao
    Ao, Xiang
    Niu, Zhengyu
    Zhang, Min-Ling
    He, Qing
    [J]. MACHINE LEARNING, 2019, 108 (05) : 747 - 763
  • [7] Revisiting Self-Supervised Visual Representation Learning
    Kolesnikov, Alexander
    Zhai, Xiaohua
    Beyer, Lucas
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1920 - 1929
  • [8] MULTI-AUGMENTATION FOR EFFICIENT SELF-SUPERVISED VISUAL REPRESENTATION LEARNING
    Tran, Van Nhiem
    Huang, Chi-En
    Liu, Shen-Hsuan
    Yang, Kai-Lin
    Ko, Timothy
    Li, Yung-Hui
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [9] Multi-Label Classification of Fundus Images With Graph Convolutional Network and Self-Supervised Learning
    Lin, Jinke
    Cai, Qingling
    Lin, Manying
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 454 - 458
  • [10] Multi-view and multi-augmentation for self-supervised visual representation learning
    Tran, Van Nhiem
    Huang, Chi-En
    Liu, Shen-Hsuan
    Aslam, Muhammad Saqlain
    Yang, Kai-Lin
    Li, Yung-Hui
    Wang, Jia-Ching
    [J]. APPLIED INTELLIGENCE, 2024, 54 (01) : 629 - 656