3D Human Pose Estimation Using Improved Semantic Graph Convolutional Based on Fusing Non-local Neural Network and Multi-Head Attention

被引:0
|
作者
Gui W. [1 ]
Luo Y. [1 ]
机构
[1] School of Electrical and Information Engineering, Zhengzhou University, 100 Avenue of Science, Zhengzhou
关键词
3D human pose estimation; Multi-head attention mechanism; Non-local neural networks; Semantic graph convolutional networks;
D O I
10.1007/s40031-024-01050-x
中图分类号
学科分类号
摘要
Although semantic graph convolutions networks can effectively learn the dependencies between joints and bones, their accuracy in estimating human body coordinates is not high. Aiming at solving the above problem, this paper studies semantic graph convolutional networks and discovers the limitations of capturing complex long-range dependencies and assigning appropriate importance weights across graph nodes. To overcome these issues, a novel module, NMHA, is built by fusing multi-head attention and non-local neural networks to enhance the relational modeling capabilities of semantic graph convolutional networks. Furthermore, this paper proposes a new 3D human pose estimation model, NMHA-SemGCN, which incorporates NMHA to better address the defects of human pose estimation. Detailed experiments conducted on the Human3.6M and HumanEva-I datasets reveal that NMHA-SemGCN achieves significant improvements in accuracy over the previous approach. These results show the effectiveness and innovation of our method. Moreover, the paper presents a comprehensive approach for estimating human poses from monocular images to 3D skeletal coordinates utilizing the NMHA-SemGCN model, demonstrating its potential for practical applications. © The Institution of Engineers (India) 2024.
引用
收藏
页码:1109 / 1119
页数:10
相关论文
共 50 条
  • [21] Arabic cyberbullying detection system using convolutional neural network and multi-head attention
    Azzeh M.
    Alhijawi B.
    Tabbaza A.
    Alabboshi O.
    Hamdan N.
    Jaser D.
    International Journal of Speech Technology, 2024, 27 (03) : 521 - 537
  • [22] Head Pose Estimation Based on Multi-Scale Convolutional Neural Network
    Liang Lingyu
    Zhang Tiantian
    He Wei
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (13)
  • [23] Multi-Label Patent Categorization with Non-Local Attention-Based Graph Convolutional Network
    Tang, Pingjie
    Jiang, Meng
    Xia, Bryan
    Pitera, Jed W.
    Welser, Jeffrey
    Chawla, Nitesh, V
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9024 - 9031
  • [24] Robust 3D Shape Classification via Non-local Graph Attention Network
    Qin, Shengwei
    Li, Zhong
    Liu, Ligang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5374 - 5383
  • [25] Research on power generation prediction of hydropower in river basin based on multi-head attention graph convolutional neural network
    Chen, Zhiliang
    Wang, Juan
    Wei, Miao
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (02) : 797 - 811
  • [26] Compositional Graph Convolutional Networks for 3D Human Pose Estimation
    Zou, Zhiming
    Liu, Tianqi
    Wu, Dapeng
    Tang, Wei
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [27] Prediction of circRNA-Disease Associations Based on the Combination of Multi-Head Graph Attention Network and Graph Convolutional Network
    Cao, Ruifen
    He, Chuan
    Wei, Pijing
    Su, Yansen
    Xia, Junfeng
    Zheng, Chunhou
    BIOMOLECULES, 2022, 12 (07)
  • [28] Multi-hop Modulated Graph Convolutional Networks for 3D Human Pose Estimation
    Lee, Jae Yung
    Kim, I. Gil
    BMVC 2022 - 33rd British Machine Vision Conference Proceedings, 2022,
  • [29] SINGLE IMAGE SUPER-RESOLUTION USING A NON-LOCAL 3D CONVOLUTIONAL NEURAL NETWORK
    Xiong, Zhuang
    Tao, Xiaoming
    Zhao, Nan
    Lin, Baihong
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 31 - 35
  • [30] Deep Semantic Graph Transformer for Multi-View 3D Human Pose Estimation
    Zhang, Lijun
    Zhou, Kangkang
    Lu, Feng
    Zhou, Xiang-Dong
    Shi, Yu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7205 - 7214