Mutual Information Driven Equivariant Contrastive Learning for 3D Action Representation Learning

被引：0

作者：

Lin, Lilang ^{[1
]}

Zhang, Jiahang ^{[1
]}

Liu, Jiaying ^{[1
]}

机构：

[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100080, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

中国国家自然科学基金;

关键词：

Self-supervised learning; Skeleton; Task analysis; Representation learning; Data models; Three-dimensional displays; Convolutional neural networks; skeleton-based action recognition; contrastive learning; LSTM;

D O I：

10.1109/TIP.2024.3372451

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-supervised contrastive learning has proven to be successful for skeleton-based action recognition. For contrastive learning, data transformations are found to fundamentally affect the learned representation quality. However, traditional invariant contrastive learning is detrimental to the performance on the downstream task if the transformation carries important information for the task. In this sense, it limits the application of many data transformations in the current contrastive learning pipeline. To address these issues, we propose to utilize equivariant contrastive learning, which extends invariant contrastive learning and preserves important information. By integrating equivariant and invariant contrastive learning into a hybrid approach, the model can better leverage the motion patterns exposed by data transformations and obtain a more discriminative representation space. Specifically, a self-distillation loss is first proposed for transformed data of different intensities to fully utilize invariant transformations, especially strong invariant transformations. For equivariant transformations, we explore the potential of skeleton mixing and temporal shuffling for equivariant contrastive learning. Meanwhile, we analyze the impacts of different data transformations on the feature space in terms of two novel metrics proposed in this paper, namely, consistency and diversity. In particular, we demonstrate that equivariant learning boosts performance by alleviating the dimensional collapse problem. Experimental results on several benchmarks indicate that our method outperforms existing state-of-the-art methods.

引用

下载

页码：1883 / 1897

页数：15

共 50 条

[11] ECO-3D: Equivariant Contrastive Learning for Pre-training on Perturbed 3D Point Cloud
Wang, Ruibin
Ying, Xianghua
Xing, Bowei
Yang, Jinfa
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2626 - 2634
[12] CMD: Self-supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation
Mao, Yunyao
Zhou, Wengang
Lu, Zhenbo
Deng, Jiajun
Li, Houqiang
COMPUTER VISION - ECCV 2022, PT III, 2022, 13663 : 734 - 752
[13] Contrastive 3D Human Skeleton Action Representation Learning via CrossMoCo With Spatiotemporal Occlusion Mask Data Augmentation
Zeng, Qinyang
Liu, Chengju
Liu, Ming
Chen, Qijun
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 (1564-1574) : 1564 - 1574
[14] 3D seismic Fault Detection via Contrastive-Reconstruction Representation Learning
Dou, Yimin
Li, Kewen
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
[15] Transformation-Equivariant Representation Learning with Barber-Agakov and InfoNCE Mutual Information Estimation
Sinaga, Marshal Arijona
Basarrudin, T.
Krisnadhi, Adila Alfa
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 99 - 109
[16] Towards a rigorous analysis of mutual information in contrastive learning
Lee, Kyungeun
Kim, Jaeill
Kang, Suhyun
Rhee, Wonjong
NEURAL NETWORKS, 2024, 179
[17] Learning 3D Skeletal Representation From Transformer for Action Recognition
Cha, Junuk
Saqlain, Muhammad
Kim, Donguk
Lee, Seungeun
Lee, Seongyeong
Baek, Seungryul
IEEE ACCESS, 2022, 10 : 67541 - 67550
[18] Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning
Yang, Siyuan
Liu, Jun
Lu, Shijian
Er, Meng Hwa
Kot, Alex C.
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13403 - 13413
[19] Mutual Information Driven Federated Learning
Uddin, Md Palash
Xiang, Yong
Lu, Xuequan
Yearwood, John
Gao, Longxiang
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (07) : 1526 - 1538
[20] Equivariant Contrastive Learning for Sequential Recommendation
Zhou, Peilin
Gao, Jingqi
Xie, Yueqi
Ye, Qichen
Hua, Yining
Kim, Jaeboum
Wang, Shoujin
Kim, Sunghun
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 129 - 140

← 1 2 3 4 5 →