Utilizing motion segmentation for optimizing the temporal adjacency matrix in 3D human pose estimation

被引:0
|
作者
Wang, Yingfeng [1 ]
Li, Muyu [3 ]
Yan, Hong [1 ,2 ]
机构
[1] Hong Kong Sci Pk, Ctr Intelligent Multidimens Data Anal, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[3] Dalian Univ Technol, Inst Intelligent Sci & Technol, Sch Control Sci & Engn, Dalian, Peoples R China
关键词
3D human pose estimation; Temporal adjacency matrix; Motion segmentation;
D O I
10.1016/j.neucom.2024.128153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In monocular 3D human pose estimation, modeling the temporal relation of human joints is crucial for prediction accuracy. Currently, most methods utilize transformer to model the temporal relation among joints. However, existing transformer-based methods have limitations. The temporal adjacency matrix utilized within the self-attention of the temporal transformer inaccurately models the temporal relationships between frames, particularly in cases where distinct motions exhibit significant correlation despite having different physical interpretations and large temporal spans. To address this issue, we construct an artificial temporal adjacency matrix based on input data and introduce a temporal adjacency matrix hybrid module to blend this matrix with the model's inherent temporal adjacency matrix, resulting in a novel composite temporal adjacency matrix. Through extensive experiments on Human3.6M and MPI-INF-3DHP datasets using state-of-the-art methods as benchmarks, our proposed method demonstrates a maximum improvement of up to 5.6% compared to the original approach.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Optimizing Network Structure for 3D Human Pose Estimation
    Ci, Hai
    Wang, Chunyu
    Ma, Xiaoxuan
    Wang, Yizhou
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2262 - 2271
  • [2] 3D Human Pose Estimation with Spatial and Temporal Transformers
    Zheng, Ce
    Zhu, Sijie
    Mendieta, Matias
    Yang, Taojiannan
    Chen, Chen
    Ding, Zhengming
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11636 - 11645
  • [3] Exploiting Temporal Information for 3D Human Pose Estimation
    Hossain, Mir Rayat Imtiaz
    Little, James J.
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 69 - 86
  • [4] Exploiting Temporal Correlations for 3D Human Pose Estimation
    Wang, Ruibin
    Ying, Xianghua
    Xing, Bowei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4527 - 4539
  • [5] Part Segmentation of Visual Hull for 3D Human Pose Estimation
    Kanaujia, Atul
    Kittens, Nicholas
    Ramanathan, Narayanan
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 542 - 549
  • [6] HuMoR: 3D Human Motion Model for Robust Pose Estimation
    Rempe, Davis
    Birdal, Tolga
    Hertzmann, Aaron
    Yang, Jimei
    Sridhar, Srinath
    Guibas, Leonidas J.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11468 - 11479
  • [7] On the Effect of Temporal Information on Monocular 3D Human Pose Estimation
    Brauer, Juergen
    Gong, Wenjuan
    Gonzalez, Jordi
    Arens, Michael
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [8] Exploiting temporal context for 3D human pose estimation in the wild
    Arnab, Anurag
    Doersch, Carl
    Zisserman, Andrew
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3390 - 3399
  • [9] 3D Human Pose Estimation in Video with Temporal and Spatial Transformer
    Peng, Sha
    Hu, Jiwei
    Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12707
  • [10] Pose Estimation and Segmentation of People in 3D Movies
    Alahari, Karteek
    Seguin, Guillaume
    Sivic, Josef
    Laptev, Ivan
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2112 - 2119