Multi-grained clip focus for skeleton-based action recognition

被引:10
|
作者
Qiu, Helei [1 ]
Hou, Biao [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Int Res Ctr Intelligent Percept & Computat, Key Lab Intelligent Percept & Image Understanding,, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Skeleton; Multi-grain; Self-attention;
D O I
10.1016/j.patcog.2023.110188
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Joint-level and part-level information are crucial for modeling actions with different granularity. In addition, the relevant information on different joints between consecutive frames is very useful for skeleton-based action recognition. To effectively capture the action information, a new multi-grained clip focus network (MGCF-Net) is proposed. Firstly, the skeleton sequence is divided into multiple clips, each containing several consecutive frames. According to the structure of the human body, each clip is divided into several tuples. Then an intra-clip attention module is proposed to capture intra-clip action information. Specifically, multi-head self-attention is divided into two parts, obtaining relevant information at the joint and part levels, and integrating the information captured from these two parts to obtain multi-grained contextual features. In addition, an inter clip focus module is used to capture the key information of several consecutive sub-actions, which will help to distinguish similar actions. On two large-scale benchmarks for skeleton-based action recognition, our method achieves the most advanced performance, and its effectiveness has been verified.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Multi-Grained Temporal Segmentation Attention Modeling for Skeleton-Based Action Recognition
    Lv, Jinrong
    Gong, Xun
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 927 - 931
  • [2] SpatioTemporal focus for skeleton-based action recognition
    Wu, Liyu
    Zhang, Can
    Zou, Yuexian
    PATTERN RECOGNITION, 2023, 136
  • [3] CdCLR: Clip- Driven Contrastive Learning for Skeleton-Based Action Recognition
    Gao, Rong
    Liu, Xin
    Yang, Jingyu
    Yue, Huanjing
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [4] Learning Clip Representations for Skeleton-Based 3D Action Recognition
    Ke, Qiuhong
    Bennamoun, Mohammed
    An, Senjian
    Sohel, Ferdous
    Boussaid, Farid
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (06) : 2842 - 2855
  • [5] Spatial Focus Attention for Fine-Grained Skeleton-Based Action Tasks
    Liu, Kaiyuan
    Li, Yunheng
    Xu, Yuanfeng
    Liu, Shuai
    Liu, Shenglan
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1883 - 1887
  • [6] Fine-Grained Skeleton-Based Human Action Recognition for Figure Skating
    Wei, Zhihong
    Qin, Jianshu
    Lie, Bangmao
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 2686 - 2691
  • [7] Revisiting Skeleton-based Action Recognition
    Duan, Haodong
    Zhao, Yue
    Chen, Kai
    Lin, Dahua
    Dai, Bo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2959 - 2968
  • [8] Multi-Term Attention Networks for Skeleton-Based Action Recognition
    Diao, Xiaolei
    Li, Xiaoqiang
    Huang, Chen
    APPLIED SCIENCES-BASEL, 2020, 10 (15):
  • [9] Robust Multi-Feature Learning for Skeleton-Based Action Recognition
    Wang, Yingfu
    Xu, Zheyuan
    Li, Li
    Yao, Jian
    IEEE ACCESS, 2019, 7 : 148658 - 148671
  • [10] RELATIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION
    Zheng, Wu
    Li, Lin
    Zhang, Zhaoxiang
    Huang, Yan
    Wang, Liang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 826 - 831