Utilizing motion segmentation for optimizing the temporal adjacency matrix in 3D human pose estimation

被引:0
|
作者
Wang, Yingfeng [1 ]
Li, Muyu [3 ]
Yan, Hong [1 ,2 ]
机构
[1] Hong Kong Sci Pk, Ctr Intelligent Multidimens Data Anal, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[3] Dalian Univ Technol, Inst Intelligent Sci & Technol, Sch Control Sci & Engn, Dalian, Peoples R China
关键词
3D human pose estimation; Temporal adjacency matrix; Motion segmentation;
D O I
10.1016/j.neucom.2024.128153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In monocular 3D human pose estimation, modeling the temporal relation of human joints is crucial for prediction accuracy. Currently, most methods utilize transformer to model the temporal relation among joints. However, existing transformer-based methods have limitations. The temporal adjacency matrix utilized within the self-attention of the temporal transformer inaccurately models the temporal relationships between frames, particularly in cases where distinct motions exhibit significant correlation despite having different physical interpretations and large temporal spans. To address this issue, we construct an artificial temporal adjacency matrix based on input data and introduce a temporal adjacency matrix hybrid module to blend this matrix with the model's inherent temporal adjacency matrix, resulting in a novel composite temporal adjacency matrix. Through extensive experiments on Human3.6M and MPI-INF-3DHP datasets using state-of-the-art methods as benchmarks, our proposed method demonstrates a maximum improvement of up to 5.6% compared to the original approach.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Assembly Manipulation Understanding Based on 3D Object Pose Estimation and Human Motion Estimation
    Yamazaki, Kimitoshi
    Higashide, Taichi
    Tanaka, Daisuke
    Nagahama, Kotaro
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 802 - 807
  • [22] 3D Pose Estimation and Segmentation using Specular Cues
    Chang, Ju Yong
    Raskar, Ramesh
    Agrawal, Amit
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 1706 - +
  • [23] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
    Wei, Wen-Li
    Lin, Jen-Chun
    Liu, Tyng-Luh
    Liao, Hong-Yuan Mark
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, 2022-June : 13201 - 13210
  • [24] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
    Wei, Wen-Li
    Lin, Jen-Chun
    Liu, Tyng-Luh
    Liao, Hong-Yuan Mark
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13201 - 13210
  • [25] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
    Wei, Wen-Li
    Lin, Jen-Chun
    Liu, Tyng-Luh
    Liao, Hong-Yuan Mark
    arXiv, 2022,
  • [26] 3D Human Pose Estimation=2D Pose Estimation plus Matching
    Chen, Ching-Hang
    Ramanan, Deva
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5759 - 5767
  • [27] 3D human pose estimation in motion based on multi-stage regression
    Zhang, Yongtao
    Li, Shuang
    Long, Peng
    DISPLAYS, 2021, 69
  • [28] Temporal 3D Human Pose Estimation for Action Recognition from Arbitrary Viewpoints
    Musallam, Mohamed Adel
    Baptista, Renato
    Al Ismaeil, Kassem
    Aouada, Djamila
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 253 - 258
  • [29] REAL-TIME 3D RECONSTRUCTION AND POSE ESTIMATION FOR HUMAN MOTION ANALYSIS
    Graf, Holger
    Yoon, Sang Min
    Malerczyk, Cornelius
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3981 - 3984
  • [30] SPATIO-TEMPORAL ATTENTION GRAPH FOR MONOCULAR 3D HUMAN POSE ESTIMATION
    Zhang, Lijun
    Shao, Xiaohu
    Li, Zhenghao
    Zhou, Xiang-Dong
    Shi, Yu
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1231 - 1235