Parallel Self-Attention and Spatial-Attention Fusion for Human Pose Estimation and Running Movement Recognition

被引:0
|
作者
Wu, Qingtian [1 ]
Zhang, Yu [2 ,3 ]
Zhang, Liming [1 ]
Yu, Haoyong [4 ]
机构
[1] Univ Macau, Fac Sci & Technol, Dept Comp & Informat Sci, Macau, Peoples R China
[2] Univ Macau, Fac Sci & Technol, Macau, Peoples R China
[3] Shenyang Univ Chem Technol, Comp Sci & Technol Coll, Shenyang 110142, Peoples R China
[4] Natl Univ Singapore, Dept Biomed Engn, Singapore 119077, Singapore
关键词
Transformers; Semantics; Pose estimation; Feature extraction; Convolutional neural networks; Task analysis; Visualization; Feature fusion; human pose estimation (HPE); running recognition; self-attention; spatial attention;
D O I
10.1109/TCDS.2023.3275652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human pose estimation (HPE) is a fundamental yet promising visual recognition problem. Existing popular methods (e.g., Hourglass and its variants) either attempt to directly add local features element-wisely, or (e.g., vision transformers) try to learn the global relationships among different human parts. However, it remains an open problem to effectively integrate the local-global representations for accurate HPE. In this work, we design four feature fusion strategies on the hierarchical ResNet structure, including direct channel concatenation, element-wise addition, and two parallel structures. Both two parallel structures adopt the naive self-attention encoder to model global dependencies. The difference between them is that one adopts the original ResNet BottleNeck while the other employs a spatial-attention module (named SSF) to learn the local patterns. Experiments on COCO Keypoint 2017 show that our SSF for HPE (named SSPose) achieves the best average precision with acceptable computational cost among the compared state-of-the-art methods. In addition, we build a lightweight running data set to verify the effectiveness of SSPose. Based solely on the keypoints estimated by our SSPose, we propose a regression model to identify valid running movements without training any other classifiers. Our source codes and running data set are publicly available.
引用
收藏
页码:358 / 368
页数:11
相关论文
共 50 条
  • [41] Infrared head pose estimation with multi-scales feature fusion on the IRHP database for human attention recognition
    Liu, Hai
    Wang, Xiang
    Zhang, Wei
    Zhang, Zhaoli
    Li, You-Fu
    [J]. NEUROCOMPUTING, 2020, 411 : 510 - 520
  • [42] Multimodal Fusion Method Based on Self-Attention Mechanism
    Zhu, Hu
    Wang, Ze
    Shi, Yu
    Hua, Yingying
    Xu, Guoxia
    Deng, Lizhen
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020 (2020):
  • [43] An efficient parallel self-attention transformer for CSI feedback
    Liu, Ziang
    Song, Tianyu
    Zhao, Ruohan
    Jin, Jiyu
    Jin, Guiyue
    [J]. PHYSICAL COMMUNICATION, 2024, 66
  • [44] Self-attention feature fusion network for semantic segmentation
    Zhou, Zhen
    Zhou, Yan
    Wang, Dongli
    Mu, Jinzhen
    Zhou, Haibin
    [J]. NEUROCOMPUTING, 2021, 453 : 50 - 59
  • [45] Crop Diseases Recognition Method via Fusion Color Mask and Self-attention Mechanism
    Yu, Ming
    Li, Ruoxi
    Yan, Gang
    Wang, Yan
    Wang, Jianchun
    Li, Yang
    [J]. Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2022, 53 (08): : 337 - 344
  • [46] Pest Identification Based on Fusion of Self-Attention With ResNet
    Hassan, Sk Mahmudul
    Maji, Arnab Kumar
    [J]. IEEE ACCESS, 2024, 12 : 6036 - 6050
  • [47] Dual Stream Spatio-Temporal Motion Fusion With Self-Attention For Action Recognition
    Jalal, Md Asif
    Aftab, Waqas
    Moore, Roger K.
    Mihaylova, Lyudmila
    [J]. 2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
  • [48] Advancing classroom fatigue recognition: A multimodal fusion approach using self-attention mechanism
    Cao, Lei
    Wang, Wenrong
    Dong, Yilin
    Fan, Chunjiang
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [49] Self-attention and forgetting fusion knowledge tracking algorithm
    Song, Jianfeng
    Wang, Yukai
    Zhang, Chu
    Xie, Kun
    [J]. INFORMATION SCIENCES, 2024, 680
  • [50] Lightweight Human Pose Estimation with Attention Mechanism
    Chu Xiaoshuai
    Ji Ruirui
    Dong Danyang
    Xi Yuzhuo
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705