Multimodal multilevel attention for semi-supervised skeleton-based gesture recognition

被引:0
|
作者
Liu, Jinting [1 ]
Gan, Minggang [1 ]
He, Yuxuan [1 ]
Guo, Jia [1 ]
Hu, Kang [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, State Key Lab Intelligent Control & Decis Complex, Beijing, Peoples R China
关键词
Gesture recognition; Skeleton; Self-attention; Semi-supervised; Deep learning; NEURAL-NETWORKS; FUSION;
D O I
10.1007/s40747-025-01807-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although skeleton-based gesture recognition using supervised learning has achieved promising results, the reliance on extensive annotated data poses significant costs. This paper addresses the challenge of semi-supervised skeleton-based gesture recognition, to effectively learn feature representations from labeled and unlabeled data. To resolve this problem, we propose a novel multimodal multilevel attention network designed for semi-supervised learning. This model utilizes the self-attention mechanism to polymerize multimodal and multilevel complementary semantic information of the hand skeleton, designing a multimodal multilevel contrastive loss to measure feature similarity. Specifically, our method explores the relationships between joint, bone, and motion to learn more discriminative feature representations. Considering the hierarchy of the hand skeleton, the skeleton data is divided into multilevel to capture complementary semantic information. Furthermore, the multimodal contrastive loss measures similarity among these multilevel representations. The proposed method demonstrates improved performance in semi-supervised skeleton-based gesture recognition tasks, as evidenced by experiments on the SHREC-17 and DHG 14/28 datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Attention Relational Network for Skeleton-Based Group Activity Recognition
    Wang, Chuanchuan
    Mohamed, Ahmad Sufril Azlan
    IEEE ACCESS, 2023, 11 : 129230 - 129239
  • [42] Sequence Segmentation Attention Network for Skeleton-Based Action Recognition
    Zhang, Yujie
    Cai, Haibin
    ELECTRONICS, 2023, 12 (07)
  • [43] Skeleton-Based Attention Mask for Pedestrian Attribute Recognition Network
    Sooksatra, Sorn
    Rujikietgumjorn, Sitapa
    JOURNAL OF IMAGING, 2021, 7 (12)
  • [44] Activity recognition based on semi-supervised learning
    Guan, Donghai
    Yuan, Weiwei
    Lee, Young-Koo
    Gavrilov, Andrey
    Lee, Sungyoung
    13TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2007, : 469 - +
  • [45] Global Spatio-Temporal Deformable Network for Skeleton-Based Gesture Recognition
    Shi D.
    Lin H.
    Liu Y.
    Zhang X.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2024, 53 (01): : 60 - 66
  • [46] A neural network based on SPD manifold learning for skeleton-based hand gesture recognition
    Nguyen, Xuan Son
    Brun, Luc
    Lezoray, Olivier
    Bougleux, Sebastien
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12028 - 12037
  • [47] Semi-supervised Learning with Multimodal Perturbation
    Su, Lei
    Liao, Hongzhi
    Yu, Zhengtao
    Tang, Jiahua
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 1, PROCEEDINGS, 2009, 5551 : 651 - +
  • [48] TEMPORAL-SPATIAL DEFORMABLE POSE NETWORK FOR SKELETON-BASED GESTURE RECOGNITION
    Lin, Honghui
    Cheng, Jiale
    Li, Yu
    Zhang, Xin
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2324 - 2328
  • [49] Hidden States Exploration for 3D Skeleton-based Gesture Recognition
    Liu, Xin
    Shi, Henglin
    Hong, Xiaopeng
    Chen, Haoyu
    Tao, Dacheng
    Zhao, Guoying
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1846 - 1855
  • [50] Bidirectional Independently Recurrent Neural Network for Skeleton-based Hand Gesture Recognition
    Li, Shuai
    Zheng, Longfei
    Zhu, Ce
    Gao, Yanbo
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,