Multi-View Action Recognition using Contrastive Learning

被引:16
|
作者
Shah, Ketul [1 ]
Shah, Anshul [1 ]
Lau, Chun Pong [1 ]
de Melo, Celso M. [2 ]
Chellappa, Rama [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] DEVCOM Army Res Lab, Adelphi, MD USA
关键词
D O I
10.1109/WACV56688.2023.00338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present a method for RGB-based action recognition using multi-view videos. We present a supervised contrastive learning framework to learn a feature embedding robust to changes in viewpoint, by effectively leveraging multi-view data. We use an improved supervised contrastive loss and augment the positives with those coming from synchronized viewpoints. We also propose a new approach to use classifier probabilities to guide the selection of hard negatives in the contrastive loss, to learn a more discriminative representation. Negative samples from confusing classes based on posterior are weighted higher. We also show that our method leads to better domain generalization compared to the standard supervised training based on synthetic multi-view data. Extensive experiments on real (NTU-60, NTU-120, NUMA) and synthetic (RoCoG) data demonstrate the effectiveness of our approach.
引用
下载
收藏
页码:3370 / 3380
页数:11
相关论文
共 50 条
  • [1] Multi-view representation learning for multi-view action recognition
    Hao, Tong
    Wu, Dan
    Wang, Qian
    Sun, Jin-Sheng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 453 - 460
  • [2] Contrastive Multi-View Kernel Learning
    Liu, Jiyuan
    Liu, Xinwang
    Yang, Yuexiang
    Liao, Qing
    Xia, Yuanqing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9552 - 9566
  • [3] Multi-view dreaming: multi-view world model with contrastive learning
    Kinose A.
    Okumura R.
    Okada M.
    Taniguchi T.
    Advanced Robotics, 2023, 37 (19) : 1212 - 1220
  • [4] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
    Shuai Bi
    Zhengping Hu
    Mengyao Zhao
    Hehao Zhang
    Jirui Di
    Zhe Sun
    Signal, Image and Video Processing, 2023, 17 : 3775 - 3782
  • [5] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
    Bi, Shuai
    Hu, Zhengping
    Zhao, Mengyao
    Zhang, Hehao
    Di, Jirui
    Sun, Zhe
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (07) : 3775 - 3782
  • [6] Heterogeneous Graph Contrastive Multi-view Learning
    Wang, Zehong
    Li, Qi
    Yu, Donghua
    Han, Xiaolong
    Gao, Xiao-Zhi
    Shen, Shigen
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 136 - 144
  • [7] Multi-view Contrastive Learning Network for Recommendation
    Bu, Xiya
    Ma, Ruixin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 319 - 330
  • [8] Dual contrastive learning for multi-view clustering
    Bao, Yichen
    Zhao, Wenhui
    Zhao, Qin
    Gao, Quanxue
    Yang, Ming
    NEUROCOMPUTING, 2024, 599
  • [9] Multi-View Contrastive Learning from Demonstrations
    Correia, Andre
    Alexandre, Luis A.
    2022 SIXTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC, 2022, : 338 - 344
  • [10] Contrastive Multi-View Representation Learning on Graphs
    Hassani, Kaveh
    Khasahmadi, Amir Hosein
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119