Learning representational invariances for data-efficient action recognition

被引：6

作者：

Zou, Yuliang ^{[1
]}

Choi, Jinwoo ^{[2
]}

Wang, Qitong ^{[3
]}

Huang, Jia-Bin ^{[4
]}

机构：

[1] Virginia Tech, Dept Elect & Comp Engn, Blacksburg, VA USA

[2] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin, South Korea

[3] Univ Delaware, Dept Comp & Informat Sci, Newark, DE USA

[4] Univ Maryland Coll Pk, Dept Comp Sci, College Pk, MD USA

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2023年 / 227卷

基金：

美国国家科学基金会;

关键词：

3D human pose and shape estimation; Self-supervised learning; Occlusion handling;

D O I：

10.1016/j.cviu.2022.103597

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data augmentation is a ubiquitous technique for improving image classification when labeled data is scarce. Constraining the model predictions to be invariant to diverse data augmentations effectively injects the desired representational invariances to the model (e.g., invariance to photometric variations) and helps improve accuracy. Compared to image data, the appearance variations in videos are far more complex due to the additional temporal dimension. Yet, data augmentation methods for videos remain under-explored. This paper investigates various data augmentation strategies that capture different video invariances, including photometric, geometric, temporal, and actor/scene augmentations. When integrated with existing semi -supervised learning frameworks, we show that our data augmentation strategy leads to promising performance on the Kinetics-100/400, Mini-Something-v2, UCF-101, and HMDB-51 datasets in the low-label regime. We also validate our data augmentation strategy in the fully supervised setting and demonstrate improved performance.

引用

页数：13

共 50 条

[1] Data-Efficient Graph Learning
Ding, Kaize
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22663 - 22663
[2] Learning Data-Efficient Hierarchical Features for Robotic Graspable Object Recognition
Wang, Zhichao
Wang, Bin
Guo, Chuangqiang
Li, Zhiqi
Liu, Yang
Liu, Hong
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2017, : 590 - 595
[3] Data-Efficient Masked Video Modeling for Self-supervised Action Recognition
Li, Qiankun
Huang, Xiaolong
Wan, Zhifan
Hu, Lanqing
Wu, Shuzhe
Zhang, Jie
Shan, Shiguang
Wang, Zengfu
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2723 - 2733
[4] Data-Efficient Hierarchical Reinforcement Learning
Nachum, Ofir
Gu, Shixiang
Lee, Honglak
Levine, Sergey
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[5] Uniform Priors for Data-Efficient Learning
Sinha, Samarth
Roth, Karsten
Goyal, Anirudh
Ghassemi, Marzyeh
Akata, Zeynep
Larochelle, Hugo
Garg, Animesh
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4026 - 4037
[6] Data-efficient Learning of Morphology and Controller for a Microrobot
Liao, Thomas
Wang, Grant
Yang, Brian
Lee, Rene
Pister, Kristofer
Levine, Sergey
Calandra, Roberto
[J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2488 - 2494
[7] Data-Efficient Reinforcement Learning for Malaria Control
Zou, Lixin
[J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 507 - 513
[8] Data-efficient performance learning for configurable systems
Jianmei Guo
Dingyu Yang
Norbert Siegmund
Sven Apel
Atrisha Sarkar
Pavel Valov
Krzysztof Czarnecki
Andrzej Wasowski
Huiqun Yu
[J]. Empirical Software Engineering, 2018, 23 : 1826 - 1867
[9] Pretraining Representations for Data-Efficient Reinforcement Learning
Schwarzer, Max
Rajkumar, Nitarshan
Noukhovitch, Michael
Anand, Ankesh
Charlin, Laurent
Hjelm, Devon
Bachman, Philip
Courville, Aaron
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[10] Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs
McAllister, Rowan Thomas
Rasmussen, Carl Edward
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30

← 1 2 3 4 5 →