Relation-aware interaction spatio-temporal network for 3D human pose estimation

被引:1
|
作者
Zhang, Hehao [1 ]
Hu, Zhengping [1 ]
Bi, Shuai [1 ]
Di, Jirui [1 ]
Sun, Zhe [1 ]
机构
[1] Yanshan Univ, Dept Informat Sci & Engn, Qinhuangdao 066000, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Bi-directional interaction module; Spatial kinematics modeling block; Temporal trajectory modeling block; Video processing;
D O I
10.1016/j.dsp.2024.104764
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
3D human pose estimation is a fundamental task in analyzing human behavior, which has many practical applications. However, existing methods suffer from high time complexity and weak capability to acquire the relations at the human joint level and the spatio-temporal level. To this end, the R elation-aware I nteraction S patio-temporal Net work (RISNet) is presented to achieve a better speed-accuracy trade-off in a parallel interactive architecture. Firstly, the Spatial Kinematics Modeling Block (SKMB) is proposed to encode spatially positional correlations among human joints, thereby capturing cross-joint kinematic dependencies in each frame. Secondly, the Temporal Trajectory Modeling Block (TTMB) is employed to further process the temporal motion trajectory of individual joints at several various frame scales. Besides, the bi-directional interaction modules across branches are presented to enhance modeling abilities at the spatio-temporal level. Experiments on Human 3.6M, HumanEva-I and MPI-INF-3DHP benchmarks indicate that the RISNet gains significant improvement compared to several state-of-the-art techniques. In conclusion, the proposed approach elegantly extracts critical features of body joints in the spatio-temporal domain with fewer model parameters and lower time complexity.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] 3D human pose estimation via human structure-aware fully connected network
    Zhang, Xiaoyan
    Tang, Zhenhua
    Hou, Junhui
    Hao, Yanbin
    PATTERN RECOGNITION LETTERS, 2019, 125 : 404 - 410
  • [32] A 3D spatio-temporal motion estimation algorithm for video coding
    Lee, Gwo Giun
    Wang, Ming-Jiun
    Lin, He-Yuan
    Su, Drew Wei-Chi
    Lin, Bo-Yun
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 741 - +
  • [33] 3D Human Pose Estimation with Spatial and Temporal Transformers
    Zheng, Ce
    Zhu, Sijie
    Mendieta, Matias
    Yang, Taojiannan
    Chen, Chen
    Ding, Zhengming
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11636 - 11645
  • [34] Exploiting Temporal Information for 3D Human Pose Estimation
    Hossain, Mir Rayat Imtiaz
    Little, James J.
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 69 - 86
  • [35] Exploiting Temporal Correlations for 3D Human Pose Estimation
    Wang, Ruibin
    Ying, Xianghua
    Xing, Bowei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4527 - 4539
  • [36] Video-based body geometric aware network for 3D human pose estimation
    Chaonan Li
    Sheng Liu
    Lu Yao
    Siyu Zou
    Optoelectronics Letters, 2022, 18 : 313 - 320
  • [37] Video-based body geometric aware network for 3D human pose estimation
    LI Chaonan
    LIU Sheng
    YAO Lu
    ZOU Siyu
    Optoelectronics Letters, 2022, 18 (05) : 313 - 320
  • [38] Spatio-temporal SRU with global context-aware attention for 3D human action recognition
    She, Qingshan
    Mu, Gaoyuan
    Gan, Haitao
    Fan, Yingle
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (17-18) : 12349 - 12371
  • [39] Spatio-temporal SRU with global context-aware attention for 3D human action recognition
    Qingshan She
    Gaoyuan Mu
    Haitao Gan
    Yingle Fan
    Multimedia Tools and Applications, 2020, 79 : 12349 - 12371
  • [40] Attention guided spatio-temporal network for 3D signature recognition
    Singh, Aradhana Kumari
    Koundal, Deepika
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 33985 - 33997