Skeleton-DML: Deep Metric Learning for Skeleton-Based One-Shot Action Recognition

被引：16

作者：

Memmesheimer, Raphael ^{[1
]}

Haering, Simon ^{[1
]}

Theisen, Nick ^{[1
]}

Paulus, Dietrich ^{[1
]}

机构：

[1] Univ Koblenz Landau, Act Vis Grp, Mainz, Germany

来源：

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) | 2022年

关键词：

D O I：

10.1109/WACV51458.2022.00091

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One-shot action recognition allows the recognition of human-performed actions with only a single training example. This can influence human-robot-interaction positively by enabling the robot to react to previously unseen behaviour. We formulate the one-shot action recognition problem as a deep metric learning problem and propose a novel image-based skeleton representation that performs well in a metric learning setting. Therefore, we train a model that projects the image representations into an embedding space. In embedding space similar actions have a low euclidean distance while dissimilar actions have a higher distance. The one-shot action recognition problem becomes a nearest-neighbor search in a set of activity reference samples. We evaluate the performance of our proposed representation against a variety of other skeleton-based image representations. In addition we present an ablation study that shows the influence of different embedding vector sizes, losses and augmentation. Our approach lifts the state-of-the-art by 3.3% for the one-shot action recognition protocol on the NTU RGB+D 120 dataset under a comparable training setup. With additional augmentation our result improved over 7.7%.

引用

页码：837 / 845

页数：9

共 50 条

[41] Adaptive Spatiotemporal Representation Learning for Skeleton-Based Human Action Recognition
Yu, Jiahui
Gao, Hongwei
Chen, Yongquan
Zhou, Dalin
Liu, Jinguo
Ju, Zhaojie
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (04) : 1654 - 1665
[42] Parallel Attention Interaction Network for Few-Shot Skeleton-based Action Recognition
Liu, Xingyu
Zhou, Sanping
Wang, Le
Hua, Gang
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1379 - 1388
[43] Skeleton-based Human Action Recognition A Learning Method based on Active Joints
Tehrani, Ahmad K. N.
Aghbolaghi, Maryam Asadi
Kasaei, Shohreh
PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 303 - 310
[44] Deep Residual Temporal Convolutional Networks for Skeleton-Based Human Action Recognition
Khamsehashari, R.
Gadzicki, K.
Zetzsche, C.
COMPUTER VISION SYSTEMS (ICVS 2019), 2019, 11754 : 376 - 385
[45] Generative Action Description Prompts for Skeleton-based Action Recognition
Xiang, Wangmeng
Li, Chao
Zhou, Yuxuan
Wang, Biao
Zhang, Lei
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10242 - 10251
[46] Deep Manifold-to-Manifold Transforming Network for Skeleton-Based Action Recognition
Zhang, Tong
Zheng, Wenming
Cui, Zhen
Zong, Yuan
Li, Chaolong
Zhou, Xiaoyan
Yang, Jian
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2926 - 2937
[47] Deep Stacked Bidirectional LSTM Neural Network for Skeleton-Based Action Recognition
Zou, Kai
Yin, Ming
Huang, Weitian
Zeng, Yiqiu
IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 676 - 688
[48] Fully Attentional Network for Skeleton-Based Action Recognition
Liu, Caifeng
Zhou, Hongcheng
IEEE ACCESS, 2023, 11 : 20478 - 20485
[49] Insight on Attention Modules for Skeleton-Based Action Recognition
Jiang, Quanyan
Wu, Xiaojun
Kittler, Josef
PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 242 - 255
[50] Skeleton-based action recognition with JRR-GCN
Ye, Fanfan
Tang, Huiming
ELECTRONICS LETTERS, 2019, 55 (17) : 933 - 935

← 1 2 3 4 5 →