Mixup Feature: A Pretext Task Self-Supervised Learning Method for Enhanced Visual Feature Learning

被引:0
|
作者
Xu, Jiashu [1 ]
Stirenko, Sergii [1 ]
机构
[1] Natl Tech Univ Ukraine, Comp Engn Dept, Igor Sikorsky Kyiv Polytech Inst, UA-03056 Kiev, Ukraine
关键词
Computer vision; mixup feature; self-supervised learning; masked autoencoder;
D O I
10.1109/ACCESS.2023.3301561
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised learning has emerged as an increasingly popular research topic within the field of computer vision. In this study, we propose a novel self-supervised learning approach based on Mixup features as pretext tasks. The proposed method aims to learn visual representations by predicting the MixupFeature of a masked image, which serves as a proxy for higher-level semantic information. Specifically, we investigate the efficacy of Mixup features as the prediction target for self-supervised learning. By setting the hyperparameter ? through Mixup operations, pairwise combinations of Sobel edge feature maps, HOG feature maps, and LBP feature maps are created. We employ the vision transformer as the backbone network, drawing inspiration from masked autoencoders (MAE). We evaluate the proposed method on three benchmark datasets, namely Cifar-10, Cifar-100, and STL-10, and compare it with other state-ofthe-art self-supervised learning approaches. The experiments demonstrate that mixed HOG-Sobel feature maps after Mixup achieve the best results in fine-tuning experiments on Cifar-10 and STL-10. Furthermore, compared to contrastive learning-based self-supervised learning methods, our approach proves to be more efficient, with shorter training durations and no reliance on data augmentation. When compared to generative self-supervised learning approaches based on MAE, the average performance improvement is 0.4%. Overall, the proposed self-supervised learning method based on Mixup features offers a promising direction for future research in the computer vision domain and has the potential to enhance performance across various downstream tasks. Our code will be published in GitHub.
引用
收藏
页码:82400 / 82409
页数:10
相关论文
共 50 条
  • [1] Self-Supervised Feature Enhancement: Applying Internal Pretext Task to Supervised Learning
    Xie, Tianshu
    Yang, Yuhang
    Ding, Zilin
    Cheng, Xuan
    Wang, Xiaomin
    Gong, Haigang
    Liu, Ming
    [J]. IEEE ACCESS, 2023, 11 : 1708 - 1717
  • [2] On Feature Decorrelation in Self-Supervised Learning
    Hua, Tianyu
    Wang, Wenxiao
    Xue, Zihui
    Ren, Sucheng
    Wang, Yue
    Zhao, Hang
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9578 - 9588
  • [3] Self-supervised Visual Feature Learning and Classification Framework: Based on Contrastive Learning
    Wang, Zhibo
    Yan, Shen
    Zhang, Xiaoyu
    Lobo, Niels Da Vitoria
    [J]. 16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 719 - 725
  • [4] Self-Supervised Feature Learning by Learning to Spot Artifacts
    Jenni, Simon
    Favaro, Paolo
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2733 - 2742
  • [5] Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey
    Jing, Longlong
    Tian, Yingli
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) : 4037 - 4058
  • [6] Digging Into Self-Supervised Learning of Feature Descriptors
    Melekhov, Iaroslav
    Laskar, Zakaria
    Li, Xiaotian
    Wang, Shuzhe
    Kannala, Juho
    [J]. 2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 1144 - 1155
  • [7] Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task
    Nava, Mirko
    Carlotti, Nicholas
    Crupi, Luca
    Palossi, Daniele
    Giusti, Alessandro
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3363 - 3370
  • [8] An Emotion Recognition Method Based On Feature Fusion and Self-Supervised Learning
    Cao, Xuanmeng
    Sun, Ming
    [J]. 2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 216 - 221
  • [9] Self-Supervised Visual Terrain Classification From Unsupervised Acoustic Feature Learning
    Zurn, Jannik
    Burgard, Wolfram
    Valada, Abhinav
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (02) : 466 - 481
  • [10] Self-Supervised Feature Learning for Long-Term Metric Visual Localization
    Chen, Yuxuan
    Barfoot, Timothy D.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 472 - 479