Utilizing motion segmentation for optimizing the temporal adjacency matrix in 3D human pose estimation

被引：0

作者：

Wang, Yingfeng ^{[1
]}

Li, Muyu ^{[3
]}

Yan, Hong ^{[1
,2
]}

机构：

[1] Hong Kong Sci Pk, Ctr Intelligent Multidimens Data Anal, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

[3] Dalian Univ Technol, Inst Intelligent Sci & Technol, Sch Control Sci & Engn, Dalian, Peoples R China

来源：

NEUROCOMPUTING | 2024年 / 600卷

关键词：

3D human pose estimation; Temporal adjacency matrix; Motion segmentation;

D O I：

10.1016/j.neucom.2024.128153

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In monocular 3D human pose estimation, modeling the temporal relation of human joints is crucial for prediction accuracy. Currently, most methods utilize transformer to model the temporal relation among joints. However, existing transformer-based methods have limitations. The temporal adjacency matrix utilized within the self-attention of the temporal transformer inaccurately models the temporal relationships between frames, particularly in cases where distinct motions exhibit significant correlation despite having different physical interpretations and large temporal spans. To address this issue, we construct an artificial temporal adjacency matrix based on input data and introduce a temporal adjacency matrix hybrid module to blend this matrix with the model's inherent temporal adjacency matrix, resulting in a novel composite temporal adjacency matrix. Through extensive experiments on Human3.6M and MPI-INF-3DHP datasets using state-of-the-art methods as benchmarks, our proposed method demonstrates a maximum improvement of up to 5.6% compared to the original approach.

引用

页数：12

共 50 条

[21] Assembly Manipulation Understanding Based on 3D Object Pose Estimation and Human Motion Estimation
Yamazaki, Kimitoshi
Higashide, Taichi
Tanaka, Daisuke
Nagahama, Kotaro
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 802 - 807
[22] 3D Pose Estimation and Segmentation using Specular Cues
Chang, Ju Yong
Raskar, Ramesh
Agrawal, Amit
CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 1706 - +
[23] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
Wei, Wen-Li
Lin, Jen-Chun
Liu, Tyng-Luh
Liao, Hong-Yuan Mark
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, 2022-June : 13201 - 13210
[24] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
Wei, Wen-Li
Lin, Jen-Chun
Liu, Tyng-Luh
Liao, Hong-Yuan Mark
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13201 - 13210
[25] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
Wei, Wen-Li
Lin, Jen-Chun
Liu, Tyng-Luh
Liao, Hong-Yuan Mark
arXiv, 2022,
[26] 3D Human Pose Estimation=2D Pose Estimation plus Matching
Chen, Ching-Hang
Ramanan, Deva
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5759 - 5767
[27] 3D human pose estimation in motion based on multi-stage regression
Zhang, Yongtao
Li, Shuang
Long, Peng
DISPLAYS, 2021, 69
[28] Temporal 3D Human Pose Estimation for Action Recognition from Arbitrary Viewpoints
Musallam, Mohamed Adel
Baptista, Renato
Al Ismaeil, Kassem
Aouada, Djamila
2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 253 - 258
[29] REAL-TIME 3D RECONSTRUCTION AND POSE ESTIMATION FOR HUMAN MOTION ANALYSIS
Graf, Holger
Yoon, Sang Min
Malerczyk, Cornelius
2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3981 - 3984
[30] SPATIO-TEMPORAL ATTENTION GRAPH FOR MONOCULAR 3D HUMAN POSE ESTIMATION
Zhang, Lijun
Shao, Xiaohu
Li, Zhenghao
Zhou, Xiang-Dong
Shi, Yu
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1231 - 1235

← 1 2 3 4 5 →