Global-local contrastive multiview representation learning for skeleton-based action

被引：3

作者：

Bian, Cunling ^{[1
]}

Feng, Wei ^{[1
]}

Meng, Fanbo ^{[2
]}

Wang, Song ^{[3
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Sch Comp Sci & Technol, Tianjin 300350, Peoples R China

[2] Tianjin Univ, Inst Int Engn, Tianjin 300350, Peoples R China

[3] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2023年 / 229卷

基金：

中国国家自然科学基金;

关键词：

Skeleton-based action recognition; Contrastive representation learning; Multiview; Graph convolutional network; DEEPER;

D O I：

10.1016/j.cviu.2023.103655

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Skeleton-based human action recognition has been drawing more interest recently due to its low sensitivity to appearance changes and the accessibility of more skeleton data. However, the skeletons captured in practice are sensitive to the view of an actor, given the occlusion of different human-body joints and the errors in human joint localization. Each view is noisy and incomplete, but important factors, such as motion and semantics, should be shared between all views in action representation learning. We support the classic hypothesis that a powerful representation is one that models view-invariant factors, and so does unsupervised learning. Therefore, we study this hypothesis under the framework of contrastive multiview learning, where we learn a representation for action recognition that aims to maximize the mutual information between different views of the same action sequence. Apart from that, a global-local contrastive loss is proposed to model the multi-scale co-occurrence relationships in both spatial and temporal domains. Extensive experimental results show that the proposed method significantly boosts the performance of unsupervised skeleton-based human action methods on three challenging benchmarks of PKUMMD, NTU RGB+D 60, and NTU RGB+D 120.

引用

页数：10

共 50 条

[21] Adaptive Spatiotemporal Representation Learning for Skeleton-Based Human Action Recognition
Yu, Jiahui
Gao, Hongwei
Chen, Yongquan
Zhou, Dalin
Liu, Jinguo
Ju, Zhaojie
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (04) : 1654 - 1665
[22] Multi-Granularity Anchor-Contrastive Representation Learning for Semi-Supervised Skeleton-Based Action Recognition
Shu, Xiangbo
Xu, Binqian
Zhang, Liyan
Tang, Jinhui
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7559 - 7576
[23] Learning Representations by Contrastive Spatio-Temporal Clustering for Skeleton-Based Action Recognition
Wang, Mingdao
Li, Xueming
Chen, Siqi
Zhang, Xianlin
Ma, Lei
Zhang, Yue
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3207 - 3220
[24] Reconstruction-driven contrastive learning for unsupervised skeleton-based human action recognition
Xing Liu
Bo Gao
The Journal of Supercomputing, 2025, 81 (1)
[25] Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semisupervised Skeleton-Based Action Recognition
Xu, Binqian
Shu, Xiangbo
Zhang, Jiachao
Dai, Guangzhao
Song, Yan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 11035 - 11048
[26] A GLOBAL-LOCAL CONTRASTIVE LEARNING FRAMEWORK FOR VIDEO CAPTIONING
Huang, Qunyue
Fang, Bin
Ai, Xi
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2410 - 2414
[27] Unsupervised skeleton-based action representation learning via relation consistency pursuit
Wenjing Zhang
Yonghong Hou
Haoyuan Zhang
Neural Computing and Applications, 2022, 34 : 20327 - 20339
[28] Global Co-Occurrence Feature and Local Spatial Feature Learning for Skeleton-Based Action Recognition
Xie, Jun
Xin, Wentian
Liu, Ruyi
Miao, Qiguang
Sheng, Lijie
Zhang, Liang
Gao, Xuesong
ENTROPY, 2020, 22 (10) : 1 - 16
[29] Unsupervised skeleton-based action representation learning via relation consistency pursuit
Zhang, Wenjing
Hou, Yonghong
Zhang, Haoyuan
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22): : 20327 - 20339
[30] Skeleton MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition
Xin, Wentian
Miao, Qiguang
Liu, Yi
Liu, Ruyi
Pun, Chi-Man
Shi, Cheng
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2211 - 2220

← 1 2 3 4 5 →