Relation-aware interaction spatio-temporal network for 3D human pose estimation

被引:1
|
作者
Zhang, Hehao [1 ]
Hu, Zhengping [1 ]
Bi, Shuai [1 ]
Di, Jirui [1 ]
Sun, Zhe [1 ]
机构
[1] Yanshan Univ, Dept Informat Sci & Engn, Qinhuangdao 066000, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Bi-directional interaction module; Spatial kinematics modeling block; Temporal trajectory modeling block; Video processing;
D O I
10.1016/j.dsp.2024.104764
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
3D human pose estimation is a fundamental task in analyzing human behavior, which has many practical applications. However, existing methods suffer from high time complexity and weak capability to acquire the relations at the human joint level and the spatio-temporal level. To this end, the R elation-aware I nteraction S patio-temporal Net work (RISNet) is presented to achieve a better speed-accuracy trade-off in a parallel interactive architecture. Firstly, the Spatial Kinematics Modeling Block (SKMB) is proposed to encode spatially positional correlations among human joints, thereby capturing cross-joint kinematic dependencies in each frame. Secondly, the Temporal Trajectory Modeling Block (TTMB) is employed to further process the temporal motion trajectory of individual joints at several various frame scales. Besides, the bi-directional interaction modules across branches are presented to enhance modeling abilities at the spatio-temporal level. Experiments on Human 3.6M, HumanEva-I and MPI-INF-3DHP benchmarks indicate that the RISNet gains significant improvement compared to several state-of-the-art techniques. In conclusion, the proposed approach elegantly extracts critical features of body joints in the spatio-temporal domain with fewer model parameters and lower time complexity.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Attention guided spatio-temporal network for 3D signature recognition
    Aradhana Kumari Singh
    Deepika Koundal
    Multimedia Tools and Applications, 2024, 83 : 33985 - 33997
  • [42] Scene-aware Egocentric 3D Human Pose Estimation
    Wang, Jian
    Luvizon, Diogo
    Xu, Weipeng
    Liu, Lingjie
    Sarkar, Kripasindhu
    Theobalt, Christian
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13031 - 13040
  • [43] Kinematics-aware spatial-temporal feature transform for 3D human pose estimation
    Du, Songlin
    Yuan, Zhiwei
    Ikenaga, Takeshi
    PATTERN RECOGNITION, 2024, 150
  • [44] Temporal Relation-Aware Global Attention Network for Temporal Action Detection
    Xu, Weijie
    Tan, Jingwei
    Wang, Shulin
    Yang, Sheng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 257 - 269
  • [45] Preterm Infants' Pose Estimation With Spatio-Temporal Features
    Moccia, Sara
    Migliorelli, Lucia
    Carnielli, Virgilio
    Frontoni, Emanuele
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2020, 67 (08) : 2370 - 2380
  • [46] U-shaped spatial–temporal transformer network for 3D human pose estimation
    Honghong Yang
    Longfei Guo
    Yumei Zhang
    Xiaojun Wu
    Machine Vision and Applications, 2022, 33
  • [47] Position constrained network for 3D human pose estimation
    Xiena Dong
    Jun Yu
    Jian Zhang
    Multimedia Systems, 2023, 29 : 459 - 468
  • [48] Position constrained network for 3D human pose estimation
    Dong, Xiena
    Yu, Jun
    Zhang, Jian
    MULTIMEDIA SYSTEMS, 2023, 29 (02) : 459 - 468
  • [49] Optimizing Network Structure for 3D Human Pose Estimation
    Ci, Hai
    Wang, Chunyu
    Ma, Xiaoxuan
    Wang, Yizhou
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2262 - 2271
  • [50] PRA-Net: Point Relation-Aware Network for 3D Point Cloud Analysis
    Cheng, Silin
    Chen, Xiwu
    He, Xinwei
    Liu, Zhe
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4436 - 4448