Global-local contrastive multiview representation learning for skeleton-based action

被引:3
|
作者
Bian, Cunling [1 ]
Feng, Wei [1 ]
Meng, Fanbo [2 ]
Wang, Song [3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Sch Comp Sci & Technol, Tianjin 300350, Peoples R China
[2] Tianjin Univ, Inst Int Engn, Tianjin 300350, Peoples R China
[3] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
基金
中国国家自然科学基金;
关键词
Skeleton-based action recognition; Contrastive representation learning; Multiview; Graph convolutional network; DEEPER;
D O I
10.1016/j.cviu.2023.103655
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based human action recognition has been drawing more interest recently due to its low sensitivity to appearance changes and the accessibility of more skeleton data. However, the skeletons captured in practice are sensitive to the view of an actor, given the occlusion of different human-body joints and the errors in human joint localization. Each view is noisy and incomplete, but important factors, such as motion and semantics, should be shared between all views in action representation learning. We support the classic hypothesis that a powerful representation is one that models view-invariant factors, and so does unsupervised learning. Therefore, we study this hypothesis under the framework of contrastive multiview learning, where we learn a representation for action recognition that aims to maximize the mutual information between different views of the same action sequence. Apart from that, a global-local contrastive loss is proposed to model the multi-scale co-occurrence relationships in both spatial and temporal domains. Extensive experimental results show that the proposed method significantly boosts the performance of unsupervised skeleton-based human action methods on three challenging benchmarks of PKUMMD, NTU RGB+D 60, and NTU RGB+D 120.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Global-Local Motion Transformer for Unsupervised Skeleton-Based Action Learning
    Kim, Boeun
    Chang, Hyung Jin
    Kim, Jungho
    Choi, Jin Young
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 209 - 225
  • [2] Global-Local Motion Transformer for Unsupervised Skeleton-Based Action Learning
    Kim, Boeun
    Chang, Hyung Jin
    Kim, Jungho
    Choi, Jin Young
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13664 LNCS : 209 - 225
  • [3] Global and Local Contrastive Learning for Self-supervised Skeleton-Based Action Recognition
    Hu J.
    Hou Y.
    Guo Z.
    Gao J.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (11) : 1 - 1
  • [4] EnsCLR: Unsupervised skeleton-based action recognition via ensemble contrastive learning of representation
    Wang, Kun
    Cao, Jiuxin
    Cao, Biwei
    Liu, Bo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 247
  • [5] Lightweight skeleton-based action recognition model based on global-local feature extraction and fusion
    Deng, Zhe
    Wang, Yulin
    Wei, Xing
    Yang, Fan
    Zhao, Chong
    Lu, Yang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, : 1477 - 1488
  • [6] JointContrast: Skeleton-Based Mutual Action Recognition with Contrastive Learning
    Jia, Xiangze
    Zhang, Ji
    Wang, Zhen
    Luo, Yonglong
    Chen, Fulong
    Xiao, Jing
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 478 - 489
  • [7] Multi-stream Global-Local Motion Fusion Network for skeleton-based action recognition
    Qi, Yanpeng
    Pang, Chen
    Liu, Yiliang
    Lyu, Lei
    APPLIED SOFT COMPUTING, 2023, 145
  • [8] Bootstrapped Representation Learning for Skeleton-Based Action Recognition
    Moliner, Olivier
    Huang, Sangxia
    Astrom, Kalle
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4153 - 4163
  • [9] JointContrast: Skeleton-Based Interaction Recognition with New Representation and Contrastive Learning
    Zhang, Ji
    Jia, Xiangze
    Wang, Zhen
    Luo, Yonglong
    Chen, Fulong
    Yang, Gaoming
    Zhao, Lihui
    ALGORITHMS, 2023, 16 (04)
  • [10] InfoGCN: Representation Learning for Human Skeleton-based Action Recognition
    Chi, Hyung-gun
    Ha, Myoung Hoon
    Chi, Seunggeun
    Lee, Sang Wan
    Huang, Qixing
    Ramani, Karthik
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20154 - 20164