A cross-feature interaction network for 3D human pose estimation

被引:0
|
作者
Peng, Jihua [1 ]
Zhou, Yanghong [3 ]
Mok, P. Y. [1 ,2 ,4 ,5 ]
机构
[1] Hong Kong Polytech Univ, Sch Fash & Text, Hong Kong, Peoples R China
[2] Lab Artificial Intelligence Design, Hong Kong, Peoples R China
[3] Hong Kong Polytech Univ, Res Ctr Text Future Fash, Hong Kong, Peoples R China
[4] Hong Kong Polytech Univ, Res Inst Sports Sci & Technol, Hong Kong, Peoples R China
[5] Hong Kong Univ Sci & Technol, Div Integrat Syst & Design, Hong Kong, Peoples R China
关键词
3D human pose estimation; graph convolutional network (GCN); self-attention; cross-attention;
D O I
10.1016/j.patrec.2025.01.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of estimating 3D human poses from single monocular images is challenging because, unlike video sequences, single images can hardly provide any temporal information for the prediction. Most existing methods attempt to predict 3D poses by modeling the spatial dependencies inherent in the anatomical structure of the human skeleton, yet these methods fail to capture the complex local and global relationships that exist among various joints. To solve this problem, we propose a novel Cross-Feature Interaction Network to effectively model spatial correlations between body joints. Specifically, we exploit graph convolutional networks (GCNs) to learn the local features between neighboring joints and the self-attention structure to learn the global features among all joints. We then design a cross-feature interaction (CFI) module to facilitate cross-feature communications among the three different features, namely the local features, global features, and initial 2D pose features, aggregating them to form enhanced spatial representations of human pose. Furthermore, a novel graph-enhanced module (GraMLP) with parallel GCN and multi-layer perceptron is introduced to inject the skeletal knowledge of the human body into the final representation of 3D pose. Extensive experiments on two datasets (Human3.6M (Ionescu et al., 2013) and MPI-INF-3DHP (Mehta et al., 2017)) show the superior performance of our method in comparison to existing state-of-the-art (SOTA) models. The code and data are shared at https://github.com/JihuaPeng/CFI-3DHPE
引用
收藏
页码:175 / 181
页数:7
相关论文
共 50 条
  • [1] Feature Boosting Network For 3D Pose Estimation
    Liu, Jun
    Ding, Henghui
    Shahroudy, Amir
    Duan, Ling-Yu
    Jiang, Xudong
    Wang, Gang
    Kot, Alex C.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 494 - 501
  • [2] Cross View Fusion for 3D Human Pose Estimation
    Qiu, Haibo
    Wang, Chunyu
    Wang, Jingdong
    Wang, Naiyan
    Zeng, Wenjun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4341 - 4350
  • [3] Position constrained network for 3D human pose estimation
    Xiena Dong
    Jun Yu
    Jian Zhang
    Multimedia Systems, 2023, 29 : 459 - 468
  • [4] Position constrained network for 3D human pose estimation
    Dong, Xiena
    Yu, Jun
    Zhang, Jian
    MULTIMEDIA SYSTEMS, 2023, 29 (02) : 459 - 468
  • [5] Optimizing Network Structure for 3D Human Pose Estimation
    Ci, Hai
    Wang, Chunyu
    Ma, Xiaoxuan
    Wang, Yizhou
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2262 - 2271
  • [6] MCFNet: Multi-scale Cross Fusion Network for 3D Human Pose Estimation
    Wang, Dazhong
    Liu, Rui
    Yi, Pengfei
    Dong, Jing
    Zhou, Dongsheng
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 684 - 688
  • [7] A New 3D Human Pose Estimation Network for Knee Posture Estimation
    Cui, Shangqi
    Peng, Fan
    Yang, Zhi
    PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 734 - 739
  • [8] A Baseline for Cross-Database 3D Human Pose Estimation
    Rapczynski, Michal
    Werner, Philipp
    Handrich, Sebastian
    Al-Hamadi, Ayoub
    SENSORS, 2021, 21 (11)
  • [9] Global and local feature communications with transformers for 3D human pose estimation
    No, Changho
    Lee, Minsik
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [10] 3D Human Pose Estimation Based on Multi-feature Extraction
    Ge, Senlin
    Yu, Huan
    Zhang, Yuanming
    Shi, Huitao
    Gao, Hao
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 570 - 581