Skeleton-DML: Deep Metric Learning for Skeleton-Based One-Shot Action Recognition

被引:16
|
作者
Memmesheimer, Raphael [1 ]
Haering, Simon [1 ]
Theisen, Nick [1 ]
Paulus, Dietrich [1 ]
机构
[1] Univ Koblenz Landau, Act Vis Grp, Mainz, Germany
关键词
D O I
10.1109/WACV51458.2022.00091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One-shot action recognition allows the recognition of human-performed actions with only a single training example. This can influence human-robot-interaction positively by enabling the robot to react to previously unseen behaviour. We formulate the one-shot action recognition problem as a deep metric learning problem and propose a novel image-based skeleton representation that performs well in a metric learning setting. Therefore, we train a model that projects the image representations into an embedding space. In embedding space similar actions have a low euclidean distance while dissimilar actions have a higher distance. The one-shot action recognition problem becomes a nearest-neighbor search in a set of activity reference samples. We evaluate the performance of our proposed representation against a variety of other skeleton-based image representations. In addition we present an ablation study that shows the influence of different embedding vector sizes, losses and augmentation. Our approach lifts the state-of-the-art by 3.3% for the one-shot action recognition protocol on the NTU RGB+D 120 dataset under a comparable training setup. With additional augmentation our result improved over 7.7%.
引用
收藏
页码:837 / 845
页数:9
相关论文
共 50 条
  • [41] Adaptive Spatiotemporal Representation Learning for Skeleton-Based Human Action Recognition
    Yu, Jiahui
    Gao, Hongwei
    Chen, Yongquan
    Zhou, Dalin
    Liu, Jinguo
    Ju, Zhaojie
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (04) : 1654 - 1665
  • [42] Parallel Attention Interaction Network for Few-Shot Skeleton-based Action Recognition
    Liu, Xingyu
    Zhou, Sanping
    Wang, Le
    Hua, Gang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1379 - 1388
  • [43] Skeleton-based Human Action Recognition A Learning Method based on Active Joints
    Tehrani, Ahmad K. N.
    Aghbolaghi, Maryam Asadi
    Kasaei, Shohreh
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 303 - 310
  • [44] Deep Residual Temporal Convolutional Networks for Skeleton-Based Human Action Recognition
    Khamsehashari, R.
    Gadzicki, K.
    Zetzsche, C.
    COMPUTER VISION SYSTEMS (ICVS 2019), 2019, 11754 : 376 - 385
  • [45] Generative Action Description Prompts for Skeleton-based Action Recognition
    Xiang, Wangmeng
    Li, Chao
    Zhou, Yuxuan
    Wang, Biao
    Zhang, Lei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10242 - 10251
  • [46] Deep Manifold-to-Manifold Transforming Network for Skeleton-Based Action Recognition
    Zhang, Tong
    Zheng, Wenming
    Cui, Zhen
    Zong, Yuan
    Li, Chaolong
    Zhou, Xiaoyan
    Yang, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2926 - 2937
  • [47] Deep Stacked Bidirectional LSTM Neural Network for Skeleton-Based Action Recognition
    Zou, Kai
    Yin, Ming
    Huang, Weitian
    Zeng, Yiqiu
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 676 - 688
  • [48] Fully Attentional Network for Skeleton-Based Action Recognition
    Liu, Caifeng
    Zhou, Hongcheng
    IEEE ACCESS, 2023, 11 : 20478 - 20485
  • [49] Insight on Attention Modules for Skeleton-Based Action Recognition
    Jiang, Quanyan
    Wu, Xiaojun
    Kittler, Josef
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 242 - 255
  • [50] Skeleton-based action recognition with JRR-GCN
    Ye, Fanfan
    Tang, Huiming
    ELECTRONICS LETTERS, 2019, 55 (17) : 933 - 935