Multimodal multilevel attention for semi-supervised skeleton-based gesture recognition

被引:0
|
作者
Liu, Jinting [1 ]
Gan, Minggang [1 ]
He, Yuxuan [1 ]
Guo, Jia [1 ]
Hu, Kang [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, State Key Lab Intelligent Control & Decis Complex, Beijing, Peoples R China
关键词
Gesture recognition; Skeleton; Self-attention; Semi-supervised; Deep learning; NEURAL-NETWORKS; FUSION;
D O I
10.1007/s40747-025-01807-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although skeleton-based gesture recognition using supervised learning has achieved promising results, the reliance on extensive annotated data poses significant costs. This paper addresses the challenge of semi-supervised skeleton-based gesture recognition, to effectively learn feature representations from labeled and unlabeled data. To resolve this problem, we propose a novel multimodal multilevel attention network designed for semi-supervised learning. This model utilizes the self-attention mechanism to polymerize multimodal and multilevel complementary semantic information of the hand skeleton, designing a multimodal multilevel contrastive loss to measure feature similarity. Specifically, our method explores the relationships between joint, bone, and motion to learn more discriminative feature representations. Considering the hierarchy of the hand skeleton, the skeleton data is divided into multilevel to capture complementary semantic information. Furthermore, the multimodal contrastive loss measures similarity among these multilevel representations. The proposed method demonstrates improved performance in semi-supervised skeleton-based gesture recognition tasks, as evidenced by experiments on the SHREC-17 and DHG 14/28 datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Decoupled Representation Network for Skeleton-Based Hand Gesture Recognition
    Zhong, Zhaochao
    Li, Yangke
    Yang, Jifang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 469 - 480
  • [22] Momentum Contrastive Teacher for Semi-Supervised Skeleton Action Recognition
    Lu, Mingqi
    Lu, Xiaobo
    Liu, Jun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 295 - 305
  • [23] Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition
    Zhu, Shasha
    Sun, Lu
    Ma, Zeyuan
    Li, Chenxi
    He, Dongzhi
    NEUROCOMPUTING, 2025, 611
  • [24] Insight on Attention Modules for Skeleton-Based Action Recognition
    Jiang, Quanyan
    Wu, Xiaojun
    Kittler, Josef
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 242 - 255
  • [25] Memory Attention Networks for Skeleton-Based Action Recognition
    Li, Ce
    Xie, Chunyu
    Zhang, Baochang
    Han, Jungong
    Zhen, Xiantong
    Chen, Jie
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 4800 - 4814
  • [26] Memory Attention Networks for Skeleton-based Action Recognition
    Xie, Chunyu
    Li, Ce
    Zhang, Baochang
    Chen, Chen
    Han, Jungong
    Liu, Jianzhuang
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1639 - 1645
  • [27] Spatial-Temporal Attention Res-TCN for Skeleton-Based Dynamic Hand Gesture Recognition
    Hou, Jingxuan
    Wang, Guijin
    Chen, Xinghao
    Xue, Jing-Hao
    Zhu, Rui
    Yang, Huazhong
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT VI, 2019, 11134 : 273 - 286
  • [28] GEOMETRIC MAGNIFICATION-BASED ATTENTION GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED MICRO-GESTURE RECOGNITION
    Jiang, Haolin
    Zheng, Wenming
    Zong, Yuan
    Xu, Xiaolin
    Jiang, Xingxun
    Xue, Yunlong
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3404 - 3408
  • [29] Semi-supervised Multimodal Emotion Recognition With Improved Wasserstein GANs
    Liang, Jingjun
    Chen, Shizhe
    Jin, Qin
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 695 - 703
  • [30] Lightweight Online Semi-Supervised Learning Algorithm for Ultrasonic Gesture Recognition
    Kang, Pixi
    Li, Xiangyu
    2021 IEEE SENSORS, 2021,