3D Human Pose Estimation Using Improved Semantic Graph Convolutional Based on Fusing Non-local Neural Network and Multi-Head Attention

被引：0

作者：

Gui W. ^{[1
]}

Luo Y. ^{[1
]}

机构：

[1] School of Electrical and Information Engineering, Zhengzhou University, 100 Avenue of Science, Zhengzhou

来源：

Journal of The Institution of Engineers (India): Series B | 2024年 / 105卷 / 05期

关键词：

3D human pose estimation; Multi-head attention mechanism; Non-local neural networks; Semantic graph convolutional networks;

D O I：

10.1007/s40031-024-01050-x

中图分类号：

学科分类号：

摘要：

Although semantic graph convolutions networks can effectively learn the dependencies between joints and bones, their accuracy in estimating human body coordinates is not high. Aiming at solving the above problem, this paper studies semantic graph convolutional networks and discovers the limitations of capturing complex long-range dependencies and assigning appropriate importance weights across graph nodes. To overcome these issues, a novel module, NMHA, is built by fusing multi-head attention and non-local neural networks to enhance the relational modeling capabilities of semantic graph convolutional networks. Furthermore, this paper proposes a new 3D human pose estimation model, NMHA-SemGCN, which incorporates NMHA to better address the defects of human pose estimation. Detailed experiments conducted on the Human3.6M and HumanEva-I datasets reveal that NMHA-SemGCN achieves significant improvements in accuracy over the previous approach. These results show the effectiveness and innovation of our method. Moreover, the paper presents a comprehensive approach for estimating human poses from monocular images to 3D skeletal coordinates utilizing the NMHA-SemGCN model, demonstrating its potential for practical applications. © The Institution of Engineers (India) 2024.

引用

页码：1109 / 1119

页数：10

共 50 条

[21] Arabic cyberbullying detection system using convolutional neural network and multi-head attention
Azzeh M.
Alhijawi B.
Tabbaza A.
Alabboshi O.
Hamdan N.
Jaser D.
International Journal of Speech Technology, 2024, 27 (03) : 521 - 537
[22] Head Pose Estimation Based on Multi-Scale Convolutional Neural Network
Liang Lingyu
Zhang Tiantian
He Wei
LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (13)
[23] Multi-Label Patent Categorization with Non-Local Attention-Based Graph Convolutional Network
Tang, Pingjie
Jiang, Meng
Xia, Bryan
Pitera, Jed W.
Welser, Jeffrey
Chawla, Nitesh, V
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9024 - 9031
[24] Robust 3D Shape Classification via Non-local Graph Attention Network
Qin, Shengwei
Li, Zhong
Liu, Ligang
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5374 - 5383
[25] Research on power generation prediction of hydropower in river basin based on multi-head attention graph convolutional neural network
Chen, Zhiliang
Wang, Juan
Wei, Miao
JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (02) : 797 - 811
[26] Compositional Graph Convolutional Networks for 3D Human Pose Estimation
Zou, Zhiming
Liu, Tianqi
Wu, Dapeng
Tang, Wei
2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
[27] Prediction of circRNA-Disease Associations Based on the Combination of Multi-Head Graph Attention Network and Graph Convolutional Network
Cao, Ruifen
He, Chuan
Wei, Pijing
Su, Yansen
Xia, Junfeng
Zheng, Chunhou
BIOMOLECULES, 2022, 12 (07)
[28] Multi-hop Modulated Graph Convolutional Networks for 3D Human Pose Estimation
Lee, Jae Yung
Kim, I. Gil
BMVC 2022 - 33rd British Machine Vision Conference Proceedings, 2022,
[29] SINGLE IMAGE SUPER-RESOLUTION USING A NON-LOCAL 3D CONVOLUTIONAL NEURAL NETWORK
Xiong, Zhuang
Tao, Xiaoming
Zhao, Nan
Lin, Baihong
2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 31 - 35
[30] Deep Semantic Graph Transformer for Multi-View 3D Human Pose Estimation
Zhang, Lijun
Zhou, Kangkang
Lu, Feng
Zhou, Xiang-Dong
Shi, Yu
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7205 - 7214

← 1 2 3 4 5 →