Relation-aware interaction spatio-temporal network for 3D human pose estimation

被引:1
|
作者
Zhang, Hehao [1 ]
Hu, Zhengping [1 ]
Bi, Shuai [1 ]
Di, Jirui [1 ]
Sun, Zhe [1 ]
机构
[1] Yanshan Univ, Dept Informat Sci & Engn, Qinhuangdao 066000, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Bi-directional interaction module; Spatial kinematics modeling block; Temporal trajectory modeling block; Video processing;
D O I
10.1016/j.dsp.2024.104764
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
3D human pose estimation is a fundamental task in analyzing human behavior, which has many practical applications. However, existing methods suffer from high time complexity and weak capability to acquire the relations at the human joint level and the spatio-temporal level. To this end, the R elation-aware I nteraction S patio-temporal Net work (RISNet) is presented to achieve a better speed-accuracy trade-off in a parallel interactive architecture. Firstly, the Spatial Kinematics Modeling Block (SKMB) is proposed to encode spatially positional correlations among human joints, thereby capturing cross-joint kinematic dependencies in each frame. Secondly, the Temporal Trajectory Modeling Block (TTMB) is employed to further process the temporal motion trajectory of individual joints at several various frame scales. Besides, the bi-directional interaction modules across branches are presented to enhance modeling abilities at the spatio-temporal level. Experiments on Human 3.6M, HumanEva-I and MPI-INF-3DHP benchmarks indicate that the RISNet gains significant improvement compared to several state-of-the-art techniques. In conclusion, the proposed approach elegantly extracts critical features of body joints in the spatio-temporal domain with fewer model parameters and lower time complexity.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] An Articulated Structure-aware Network for 3D Human Pose Estimation
    Tang, Zhenhua
    Zhang, Xiaoyan
    Hou, Junhui
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 48 - 63
  • [22] MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video
    Zhang, Jinlu
    Tu, Zhigang
    Yang, Jianyu
    Chen, Yujin
    Yuan, Junsong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13222 - 13232
  • [23] Multi-view 3D Smooth Human Pose Estimation based on Heatmap Filtering and Spatio-temporal Information
    Niu, Zehai
    Lu, Ke
    Xue, Jian
    Ma, Haifeng
    Wei, Runchen
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 442 - 450
  • [24] A Study on 3D Human Pose Estimation with a Hybrid Algorithm of Spatio-temporal Semantic Graph Attention and Deep Learning
    Lin, Shengqing
    INFORMATION TECHNOLOGY AND CONTROL, 2024, 53 (04):
  • [25] Domain-Guided Spatio-Temporal Self-Attention for Egocentric 3D Pose Estimation
    Park, Jinman
    Kaai, Kimathi
    Hossain, Saad
    Sumi, Norikatsu
    Rambhatla, Sirisha
    Fieguth, Paul
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1837 - 1849
  • [26] STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation (vol 149, 105142, 2024)
    Hao, Feng
    Zhong, Fujin
    Wang, Yunhe
    Yu, Hong
    Hu, Jun
    Yang, Yan
    IMAGE AND VISION COMPUTING, 2024, 151
  • [27] ARHPE: Asymmetric Relation-Aware Representation Learning for Head Pose Estimation in Industrial Human-Computer Interaction
    Liu, Hai
    Liu, Tingting
    Zhang, Zhaoli
    Sangaiah, Arun Kumar
    Yang, Bing
    Li, Youfu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) : 7107 - 7117
  • [28] A cross-feature interaction network for 3D human pose estimation
    Peng, Jihua
    Zhou, Yanghong
    Mok, P. Y.
    PATTERN RECOGNITION LETTERS, 2025, 189 : 175 - 181
  • [29] A Spatio-temporal Transformer for 3D Human Motion Prediction
    Aksan, Emre
    Kaufmann, Manuel
    Cao, Peng
    Hilliges, Otmar
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 565 - 574
  • [30] Relation-balanced graph convolutional network for 3D human pose estimation
    Chen, Lu
    Liu, Qiong
    IMAGE AND VISION COMPUTING, 2023, 140