Multi-View Action Recognition using Contrastive Learning

被引：16

作者：

Shah, Ketul ^{[1
]}

Shah, Anshul ^{[1
]}

Lau, Chun Pong ^{[1
]}

de Melo, Celso M. ^{[2
]}

Chellappa, Rama ^{[1
]}

机构：

[1] Johns Hopkins Univ, Baltimore, MD 21218 USA

[2] DEVCOM Army Res Lab, Adelphi, MD USA

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00338

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we present a method for RGB-based action recognition using multi-view videos. We present a supervised contrastive learning framework to learn a feature embedding robust to changes in viewpoint, by effectively leveraging multi-view data. We use an improved supervised contrastive loss and augment the positives with those coming from synchronized viewpoints. We also propose a new approach to use classifier probabilities to guide the selection of hard negatives in the contrastive loss, to learn a more discriminative representation. Negative samples from confusing classes based on posterior are weighted higher. We also show that our method leads to better domain generalization compared to the standard supervised training based on synthetic multi-view data. Extensive experiments on real (NTU-60, NTU-120, NUMA) and synthetic (RoCoG) data demonstrate the effectiveness of our approach.

引用

下载

页码：3370 / 3380

页数：11

共 50 条

[1] Multi-view representation learning for multi-view action recognition
Hao, Tong
Wu, Dan
Wang, Qian
Sun, Jin-Sheng
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 453 - 460
[2] Contrastive Multi-View Kernel Learning
Liu, Jiyuan
Liu, Xinwang
Yang, Yuexiang
Liao, Qing
Xia, Yuanqing
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9552 - 9566
[3] Multi-view dreaming: multi-view world model with contrastive learning
Kinose A.
Okumura R.
Okada M.
Taniguchi T.
Advanced Robotics, 2023, 37 (19) : 1212 - 1220
[4] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
Shuai Bi
Zhengping Hu
Mengyao Zhao
Hehao Zhang
Jirui Di
Zhe Sun
Signal, Image and Video Processing, 2023, 17 : 3775 - 3782
[5] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
Bi, Shuai
Hu, Zhengping
Zhao, Mengyao
Zhang, Hehao
Di, Jirui
Sun, Zhe
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (07) : 3775 - 3782
[6] Heterogeneous Graph Contrastive Multi-view Learning
Wang, Zehong
Li, Qi
Yu, Donghua
Han, Xiaolong
Gao, Xiao-Zhi
Shen, Shigen
PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 136 - 144
[7] Multi-view Contrastive Learning Network for Recommendation
Bu, Xiya
Ma, Ruixin
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 319 - 330
[8] Dual contrastive learning for multi-view clustering
Bao, Yichen
Zhao, Wenhui
Zhao, Qin
Gao, Quanxue
Yang, Ming
NEUROCOMPUTING, 2024, 599
[9] Multi-View Contrastive Learning from Demonstrations
Correia, Andre
Alexandre, Luis A.
2022 SIXTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC, 2022, : 338 - 344
[10] Contrastive Multi-View Representation Learning on Graphs
Hassani, Kaveh
Khasahmadi, Amir Hosein
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119

← 1 2 3 4 5 →