Semi-Supervised Action Recognition with Temporal Contrastive Learning

被引:43
|
作者
Singh, Ankit [1 ]
Chakraborty, Omprakash [2 ]
Varshney, Ashutosh [2 ]
Panda, Rameswar [3 ]
Feris, Rogerio [3 ]
Saenko, Kate [3 ,4 ]
Das, Abir [2 ]
机构
[1] IIT Madras, Chennai, Tamil Nadu, India
[2] IIT Kharagpur, Kharagpur, W Bengal, India
[3] MIT IBM Watson AI Lab, Cambridge, MA USA
[4] Boston Univ, Boston, MA 02215 USA
关键词
D O I
10.1109/CVPR46437.2021.01025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning to recognize actions from only a handful of labeled videos is a challenging problem due to the scarcity of tediously collected activity labels. We approach this problem by learning a two-pathway temporal contrastive model using unlabeled videos at two different speeds leveraging the fact that changing video speed does not change an action. Specifically, we propose to maximize the similarity between encoded representations of the same video at two different speeds as well as minimize the similarity between different videos played at different speeds. This way we use the rich supervisory information in terms of 'time' that is present in otherwise unsupervised pool of videos. With this simple yet effective strategy of manipulating video playback rates, we considerably outperform video extensions of sophisticated state-of-the-art semi-supervised image recognition methods across multiple diverse benchmark datasets and network architectures. Interestingly, our proposed approach benefits from out-of-domain unlabeled videos showing generalization and robustness. We also perform rigorous ablations and analysis to validate our approach.
引用
收藏
页码:10384 / 10394
页数:11
相关论文
共 50 条
  • [1] Actor-Aware Contrastive Learning for Semi-Supervised Action Recognition
    Assefa, Maregu
    Jiang, Wei
    Gedamu, Kumie
    Yilma, Getinet
    Ayalew, Melese
    Seid, Mohammed
    [J]. 2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 660 - 665
  • [2] Learning from Temporal Gradient for Semi-supervised Action Recognition
    Xiao, Junfei
    Jing, Longlong
    Zhang, Lin
    He, Ju
    She, Qi
    Zhou, Zongwei
    Yuille, Alan
    Li, Yingwei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3242 - 3252
  • [3] Neighbor-Guided Consistent and Contrastive Learning for Semi-Supervised Action Recognition
    Wu, Jianlong
    Sun, Wei
    Gan, Tian
    Ding, Ning
    Jiang, Feijun
    Shen, Jialie
    Nie, Liqiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2215 - 2227
  • [4] Audio-Visual Contrastive and Consistency Learning for Semi-Supervised Action Recognition
    Assefa, Maregu
    Jiang, Wei
    Zhan, Jinyu
    Gedamu, Kumie
    Yilma, Getinet
    Ayalew, Melese
    Adhikari, Deepak
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3491 - 3504
  • [5] Ego-Vehicle Action Recognition based on Semi-Supervised Contrastive Learning
    Noguchi, Chihiro
    Tanizawa, Toshihiro
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5977 - 5987
  • [6] Semi-Supervised Contrastive Learning for Human Activity Recognition
    Liu, Dongxin
    Abdelzaher, Tarek
    [J]. 17TH ANNUAL INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS 2021), 2021, : 45 - 53
  • [7] Semi-Supervised Group Emotion Recognition Based on Contrastive Learning
    Zhang, Jiayi
    Wang, Xingzhi
    Zhang, Dong
    Lee, Dah-Jye
    [J]. ELECTRONICS, 2022, 11 (23)
  • [8] Semi-Supervised Action Recognition From Temporal Augmentation Using Curriculum Learning
    Tong, Anyang
    Tang, Chao
    Wang, Wenjian
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1305 - 1319
  • [9] CONTRASTIVE SEMI-SUPERVISED LEARNING FOR ASR
    Xiao, Alex
    Fuegen, Christian
    Mohamed, Abdelrahman
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3870 - 3874
  • [10] Contrastive Regularization for Semi-Supervised Learning
    Lee, Doyup
    Kim, Sungwoong
    Kim, Ildoo
    Cheon, Yeongjae
    Cho, Minsu
    Han, Wook-Shin
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3910 - 3919