PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation

被引:0
|
作者
Zhu, Yiran [1 ]
Xu, Xing [1 ]
Shen, Fumin [1 ]
Ji, Yanli [1 ]
Gao, Lianli [1 ]
Shen, Heng Tao [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph neural networks (GNNs) have been widely used in the 3D human pose estimation task, since the pose representation of a human body can be naturally modeled by the graph structure. Generally, most of the existing GNN-based models utilize the restricted receptive fields of filters and single-scale information, while neglecting the valuable multiscale contextual information. To tackle this issue, we propose a novel model named Graph Transformer Encoder-Decoder with Atrous Convolution (PoseGTAC), to effectively extract multi-scale context and long-range information. Specifically, our PoseGTAC model has two key components: Graph Atrous Convolution (GAC) and Graph Transformer Layer (GTL), which are respectively for the extraction of local multi-scale and global long-range information. They are combined and stacked in an encoder-decoder structure, where graph pooling and unpooling are adopted for the interaction of multi-scale information from local to global aspect (e.g., part-scale and body-scale). Extensive experiments on the Human3.6M and MPI-INF-3DHP datasets demonstrate that the proposed PoseGTAC model achieves state-of-the-art performance.
引用
收藏
页码:1359 / 1365
页数:7
相关论文
共 50 条
  • [31] Compositional Graph Convolutional Networks for 3D Human Pose Estimation
    Zou, Zhiming
    Liu, Tianqi
    Wu, Dapeng
    Tang, Wei
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [32] Graph Stacked Hourglass Networks for 3D Human Pose Estimation
    Xu, Tianhan
    Takano, Wataru
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16100 - 16109
  • [33] Iterative graph filtering network for 3D human pose estimation
    Islam, Zaedul
    Ben Hamza, A.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [34] SGAT: Semantic Graph Attention for 3D human pose estimation
    Schirmer, Luiz
    Lucio, Djalma
    Cruz, Leandro
    Raposo, Alberto
    Velho, Luiz
    Lopes, Helio
    2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021), 2021, : 255 - 262
  • [35] Iterative Graph Filtering Network for 3D Human Pose Estimation
    Islam, Zaedul
    Ben Hamza, A.
    arXiv, 2023,
  • [36] Regular Splitting Graph Network for 3D Human Pose Estimation
    Hassan, Md. Tanvir
    Ben Hamza, A.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4212 - 4222
  • [37] Automatic Evaluation Method for Functional Movement Screening Based on Multi-Scale Lightweight 3D Convolution and an Encoder-Decoder
    Lin, Xiuchun
    Liu, Yichao
    Feng, Chen
    Chen, Zhide
    Yang, Xu
    Cui, Hui
    ELECTRONICS, 2024, 13 (10)
  • [38] Human pose estimation based on parallel atrous convolution and body structure constraints
    Zhang, Min
    Yang, Haijie
    Li, Pengfei
    Jiang, Ming
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (06) : 5553 - 5563
  • [39] 3D Image Inpainting for Rotor Detection using 3D Encoder-Decoder Generative Adversarial Network
    Chung, Yi-Hao
    Chen, Yen-Lin
    IEEE ISPCE-ASIA 2021: IEEE INTERNATIONAL SYMPOSIUM ON PRODUCT COMPLIANCE ENGINEERING - ASIA, 2021,
  • [40] 3D Image Inpainting for Rotor Detection using 3D Encoder-Decoder Generative Adversarial Network
    Chung, Yi-Hao
    Chen, Yen-Lin
    IEEE ISPCE-ASIA 2021: IEEE INTERNATIONAL SYMPOSIUM ON PRODUCT COMPLIANCE ENGINEERING - ASIA, 2021,