Hybrid attention adaptive sampling network for human pose estimation in videos

被引:0
|
作者
Song, Qianyun [1 ]
Zhang, Hao [1 ]
Liu, Yanan [1 ]
Sun, Shouzheng [1 ]
Xu, Dan [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Univ City East Outer Ring South Rd, Kunming, Yunnan, Peoples R China
关键词
adaptive sampling; attention mechanism; human pose estimation; videos;
D O I
10.1002/cav.2244
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human pose estimation in videos often uses sampling strategies like sparse uniform sampling and keyframe selection. Sparse uniform sampling can miss spatial-temporal relationships, while keyframe selection using CNNs struggles to fully capture these relationships and is costly. Neither strategy ensures the reliability of pose data from single-frame estimators. To address these issues, this article proposes an efficient and effective hybrid attention adaptive sampling network. This network includes a dynamic attention module and a pose quality attention module, which comprehensively consider the dynamic information and the quality of pose data. Additionally, the network improves efficiency through compact uniform sampling and parallel mechanism of multi-head self-attention. Our network is compatible with various video-based pose estimation frameworks and demonstrates greater robustness in high degree of occlusion, motion blur, and illumination changes, achieving state-of-the-art performance on Sub-JHMDB dataset. The article introduces a hybrid attention adaptive sampling network for video-based human pose estimation, integrating dynamic and pose quality attention modules to enhance data quality and dynamic capture. This approach outperforms traditional sampling strategies, demonstrating robust performance even under challenging conditions like occlusion, motion blur, and illumination variance, achieving state-of-the-art results on Sub-JHMDB. image
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos
    Jin, Kyung-Min
    Lim, Byoung-Sung
    Lee, Gun-Hee
    Kang, Tae-Kyung
    Lee, Seong-Whan
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5714 - 5723
  • [2] Adaptive occlusion hybrid second-order attention network for head pose estimation
    Qi Fu
    Kai Xie
    Chang Wen
    Jianbiao He
    Wei Zhang
    Hongling Tian
    Sheng Yang
    [J]. International Journal of Machine Learning and Cybernetics, 2024, 15 : 667 - 683
  • [3] Adaptive occlusion hybrid second-order attention network for head pose estimation
    Fu, Qi
    Xie, Kai
    Wen, Chang
    He, Jianbiao
    Zhang, Wei
    Tian, Hongling
    Yang, Sheng
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (02) : 667 - 683
  • [4] Attention Refined Network for Human Pose Estimation
    Wang, Xiangyang
    Tong, Jiangwei
    Wang, Rui
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (04) : 2853 - 2872
  • [5] Attention Refined Network for Human Pose Estimation
    Xiangyang Wang
    Jiangwei Tong
    Rui Wang
    [J]. Neural Processing Letters, 2021, 53 : 2853 - 2872
  • [6] Multistage attention network for human pose estimation
    Zhou, Jingyang
    Wen, Guangzhao
    Zhang, Yu
    Geng, Xin
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [7] Human Pose Estimation in Videos
    Zhang, Dong
    Shah, Mubarak
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2012 - 2020
  • [8] TEMPORAL FEATURE ENHANCING NETWORK FOR HUMAN POSE ESTIMATION IN VIDEOS
    Li, Haihan
    Yang, Wenming
    Liao, Qingmin
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 579 - 583
  • [9] Self-Attention Network for Human Pose Estimation
    Xia, Hailun
    Zhang, Tianyang
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 15
  • [10] Human Pose Estimation Fusing Weight Adaptive Loss and Attention
    Jiang, Chunling
    Zeng, Bi
    Yao, Zhuangze
    Deng, Bin
    [J]. Computer Engineering and Applications, 2023, 59 (18) : 145 - 153