InferTrans: Hierarchical structural fusion transformer for crowded human pose estimation

被引:0
|
作者
Li, Muyu [1 ,2 ]
Wang, Yingfeng [4 ]
Hu, Henan [3 ]
Zhao, Xudong [1 ,2 ]
机构
[1] Institute of Intelligent Science and Technology, School of Control Science and Engineering, Dalian University of Technology, Liaoning, Dalian,116024, China
[2] Key Laboratory of Intelligent Control and Optimization for Industrial Equipment of Ministry of Education, Dalian University of Technology, Liaoning, Dalian,116024, China
[3] School of Mechanical Engineering, Dalian Jiaotong University, Liaoning, Dalian,116028, China
[4] Center for Intelligent Multidimensional Data Analysis, Hong Kong Science Park, Hong Kong
关键词
D O I
10.1016/j.inffus.2024.102878
中图分类号
学科分类号
摘要
Human pose estimation in crowded scenes presents unique challenges due to frequent occlusions and complex interactions between individuals. To address these issues, we introduce InferTrans, a hierarchical structural fusion Transformer designed to improve crowded human pose estimation. InferTrans integrates semantic features into structural information using a hierarchical joint-limb-semantic fusion module. By reorganizing joints and limbs into a tree structure, the fusion module facilitates effective information exchange across different structural levels, and leverage both global structural information and local contextual details. Furthermore, we explicitly model limb structural patterns separately from joints, treating limbs as vectors with defined lengths and orientations. This allows our model to infer complete human poses from minimal input, significantly enhancing pose refinement tasks. Extensive experiments on multiple datasets demonstrate that InferTrans outperforms existing pose estimation techniques in crowded and occluded scenarios. The proposed InferTrans serves as a robust post-processing technique, and is capable of improving the accuracy and robustness of pose estimation in challenging environments. © 2024 Elsevier B.V.
引用
下载
收藏
相关论文
共 50 条
  • [1] Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation
    Zhou, Kangkang
    Zhang, Lijun
    Lu, Feng
    Zhou, Xiang-Dong
    Shi, Yu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7512 - 7520
  • [2] Aggregation Transformer for Human Pose Estimation
    Dong, Hao
    Wang, Guodong
    Zhang, Xinyue
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3660 - 3667
  • [3] Pose Relation Transformer Refine Occlusions for Human Pose Estimation
    Chi, Hyung-gun
    Chi, Seunggeun
    Chan, Stanley
    Ramani, Karthik
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 6138 - 6145
  • [4] MsF-HigherHRNet: Multi-scale Feature Fusion for Human Pose Estimation in Crowded Scenes
    Yu, Cuihong
    Han, Cheng
    Zhang, Qi
    Zhang, Chao
    COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS, CAD/GRAPHICS 2023, 2024, 14250 : 16 - 29
  • [5] Human Pose Estimation for Real-World Crowded Scenarios
    Golda, Thomas
    Kalb, Tobias
    Schumann, Arne
    Beyerer, Juergen
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
  • [6] Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
    Chang, Shuning
    Yuan, Li
    Nie, Xuecheng
    Huang, Ziyuan
    Zhou, Yichen
    Chen, Yupeng
    Feng, Jiashi
    Yan, Shuicheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4630 - 4634
  • [7] A Fast and Effective Transformer for Human Pose Estimation
    Wang, Dong
    Xie, Wenjun
    Cai, Youcheng
    Liu, Xiaoping
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 992 - 996
  • [8] Deformer: Dynamic Fusion Transformer for Robust Hand Pose Estimation
    Fu, Qichen
    Liu, Xingyu
    Xu, Ran
    Niebles, Juan Carlos
    Kitani, Kris M.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 23543 - 23554
  • [9] Gated Region-Refine pose transformer for human pose estimation
    Wang, Tianfeng
    Zhang, Xiaoxu
    NEUROCOMPUTING, 2023, 530 : 37 - 47
  • [10] Efficient Human Pose Estimation in Hierarchical Context
    Zhang, Feng
    Zhu, Xiatian
    Ye, Mao
    IEEE ACCESS, 2019, 7 : 29365 - 29373