PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation

被引:0
|
作者
Zhu, Yiran [1 ]
Xu, Xing [1 ]
Shen, Fumin [1 ]
Ji, Yanli [1 ]
Gao, Lianli [1 ]
Shen, Heng Tao [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph neural networks (GNNs) have been widely used in the 3D human pose estimation task, since the pose representation of a human body can be naturally modeled by the graph structure. Generally, most of the existing GNN-based models utilize the restricted receptive fields of filters and single-scale information, while neglecting the valuable multiscale contextual information. To tackle this issue, we propose a novel model named Graph Transformer Encoder-Decoder with Atrous Convolution (PoseGTAC), to effectively extract multi-scale context and long-range information. Specifically, our PoseGTAC model has two key components: Graph Atrous Convolution (GAC) and Graph Transformer Layer (GTL), which are respectively for the extraction of local multi-scale and global long-range information. They are combined and stacked in an encoder-decoder structure, where graph pooling and unpooling are adopted for the interaction of multi-scale information from local to global aspect (e.g., part-scale and body-scale). Extensive experiments on the Human3.6M and MPI-INF-3DHP datasets demonstrate that the proposed PoseGTAC model achieves state-of-the-art performance.
引用
收藏
页码:1359 / 1365
页数:7
相关论文
共 50 条
  • [21] Super-Resolution-Aided Sea Ice Concentration Estimation From AMSR2 Images by Encoder-Decoder Networks With Atrous Convolution
    Feng, Tiantian
    Liu, Xiaomin
    Li, Rongxing
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 962 - 973
  • [22] Swin transformer with multiscale 3D atrous convolution for hyperspectral image classification
    Farooque, Ghulam
    Liu, Qichao
    Sargano, Allah Bux
    Xiao, Liang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [23] 3D hand pose estimation algorithm based on cascaded features and graph convolution
    Lin, Yi-lin
    Lin, Shan-ling
    Lin, Zhi-xian
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (06) : 736 - 745
  • [24] Dual-Path Transformer for 3D Human Pose Estimation
    Zhou, Lu
    Chen, Yingying
    Wang, Jinqiao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3260 - 3270
  • [25] Neuron segmentation using 3D wavelet integrated encoder-decoder network
    Li, Qiufu
    Shen, Linlin
    BIOINFORMATICS, 2022, 38 (03) : 809 - 817
  • [26] End-to-end 3D Human Pose Estimation with Transformer
    Zhang, Bowei
    Cui, Peng
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4529 - 4536
  • [27] 3D Human Pose Estimation in Video with Temporal and Spatial Transformer
    Peng, Sha
    Hu, Jiwei
    Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12707
  • [28] Joint graph convolution networks and transformer for human pose estimation in sports technique analysis
    Cheng, Hongren
    Wang, Jing
    Zhao, Anran
    Zhong, Yaping
    Li, Jingli
    Dong, Liangshan
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (10)
  • [29] Modulated Graph Convolutional Network for 3D Human Pose Estimation
    Zou, Zhiming
    Tang, Wei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11457 - 11467
  • [30] Flexible Graph Convolutional Network for 3D Human Pose Estimation
    Shahjahan, Abu Taib Mohammed
    Hamza, A. Ben
    arXiv,