Learning representational invariances for data-efficient action recognition

被引:6
|
作者
Zou, Yuliang [1 ]
Choi, Jinwoo [2 ]
Wang, Qitong [3 ]
Huang, Jia-Bin [4 ]
机构
[1] Virginia Tech, Dept Elect & Comp Engn, Blacksburg, VA USA
[2] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin, South Korea
[3] Univ Delaware, Dept Comp & Informat Sci, Newark, DE USA
[4] Univ Maryland Coll Pk, Dept Comp Sci, College Pk, MD USA
基金
美国国家科学基金会;
关键词
3D human pose and shape estimation; Self-supervised learning; Occlusion handling;
D O I
10.1016/j.cviu.2022.103597
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation is a ubiquitous technique for improving image classification when labeled data is scarce. Constraining the model predictions to be invariant to diverse data augmentations effectively injects the desired representational invariances to the model (e.g., invariance to photometric variations) and helps improve accuracy. Compared to image data, the appearance variations in videos are far more complex due to the additional temporal dimension. Yet, data augmentation methods for videos remain under-explored. This paper investigates various data augmentation strategies that capture different video invariances, including photometric, geometric, temporal, and actor/scene augmentations. When integrated with existing semi -supervised learning frameworks, we show that our data augmentation strategy leads to promising performance on the Kinetics-100/400, Mini-Something-v2, UCF-101, and HMDB-51 datasets in the low-label regime. We also validate our data augmentation strategy in the fully supervised setting and demonstrate improved performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Data-Efficient Graph Learning
    Ding, Kaize
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22663 - 22663
  • [2] Learning Data-Efficient Hierarchical Features for Robotic Graspable Object Recognition
    Wang, Zhichao
    Wang, Bin
    Guo, Chuangqiang
    Li, Zhiqi
    Liu, Yang
    Liu, Hong
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2017, : 590 - 595
  • [3] Data-Efficient Masked Video Modeling for Self-supervised Action Recognition
    Li, Qiankun
    Huang, Xiaolong
    Wan, Zhifan
    Hu, Lanqing
    Wu, Shuzhe
    Zhang, Jie
    Shan, Shiguang
    Wang, Zengfu
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2723 - 2733
  • [4] Data-Efficient Hierarchical Reinforcement Learning
    Nachum, Ofir
    Gu, Shixiang
    Lee, Honglak
    Levine, Sergey
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [5] Uniform Priors for Data-Efficient Learning
    Sinha, Samarth
    Roth, Karsten
    Goyal, Anirudh
    Ghassemi, Marzyeh
    Akata, Zeynep
    Larochelle, Hugo
    Garg, Animesh
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4026 - 4037
  • [6] Data-efficient Learning of Morphology and Controller for a Microrobot
    Liao, Thomas
    Wang, Grant
    Yang, Brian
    Lee, Rene
    Pister, Kristofer
    Levine, Sergey
    Calandra, Roberto
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2488 - 2494
  • [7] Data-Efficient Reinforcement Learning for Malaria Control
    Zou, Lixin
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 507 - 513
  • [8] Data-efficient performance learning for configurable systems
    Jianmei Guo
    Dingyu Yang
    Norbert Siegmund
    Sven Apel
    Atrisha Sarkar
    Pavel Valov
    Krzysztof Czarnecki
    Andrzej Wasowski
    Huiqun Yu
    [J]. Empirical Software Engineering, 2018, 23 : 1826 - 1867
  • [9] Pretraining Representations for Data-Efficient Reinforcement Learning
    Schwarzer, Max
    Rajkumar, Nitarshan
    Noukhovitch, Michael
    Anand, Ankesh
    Charlin, Laurent
    Hjelm, Devon
    Bachman, Philip
    Courville, Aaron
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs
    McAllister, Rowan Thomas
    Rasmussen, Carl Edward
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30