PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation

被引:34
|
作者
Zhou, Guangyuan [1 ]
Wang, Huiqun [1 ,2 ]
Chen, Jiaxin [2 ]
Huang, Di [1 ,2 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV48922.2021.00279
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB-D based 6D pose estimation has recently achieved remarkable progress, but still suffers from two major limitations: (1) ineffective representation of depth data and (2) insufficient integration of different modalities. This paper proposes a novel deep learning approach, namely Graph Convolutional Network with Point Refinement (PR-GCN), to simultaneously address the issues above in a unified way. It first introduces the Point Refinement Network (PRN) to polish 3D point clouds, recovering missing parts with noise removed. Subsequently, the Multi-Modal Fusion Graph Convolutional Network (MMF-GCN) is presented to strengthen RGB-D combination, which captures geometry-aware inter-modality correlation through local information propagation in the graph convolutional network. Extensive experiments are conducted on three widely used benchmarks, and state-of-the-art performance is reached. Besides, it is also shown that the proposed PRN and MMF-GCN modules are well generalized to other frameworks.
引用
收藏
页码:2773 / 2782
页数:10
相关论文
共 50 条
  • [21] Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
    Lipson, Lahav
    Teed, Zachary
    Goyal, Ankit
    Deng, Jia
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6718 - 6727
  • [22] Prior-information-guided corresponding point regression network for 6D pose estimation
    Gan, Haiqing
    Wang, Lihui
    Su, Yuzuwei
    Ruan, Wenjun
    Jiao, Xize
    COMPUTERS & GRAPHICS-UK, 2024, 121
  • [23] PVA-GCN: point-voxel absorbing graph convolutional network for 3D human pose estimation from monocular video
    Liu, Minghao
    Wang, Wenshan
    Zhao, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3627 - 3641
  • [24] PVA-GCN: point-voxel absorbing graph convolutional network for 3D human pose estimation from monocular video
    Minghao Liu
    Wenshan Wang
    Wei Zhao
    Signal, Image and Video Processing, 2024, 18 : 3627 - 3641
  • [25] Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation
    Corsetti, Jaime
    Boscaini, Davide
    Poiesi, Fabio
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2095 - 2104
  • [26] CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers
    Castro, Pedro
    Kim, Tae-Kyun
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5735 - 5744
  • [27] Deep Fusion for Multi-Modal 6D Pose Estimation
    Lin, Shifeng
    Wang, Zunran
    Zhang, Shenghao
    Ling, Yonggen
    Yang, Chenguang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6540 - 6549
  • [28] Deep Refinement Convolutional Networks for Human Pose Estimation
    Marras, Ioannis
    Palasek, Petar
    Patras, Ioannis
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 446 - 453
  • [29] Uncertainty Quantification with Deep Ensembles for 6D Object Pose Estimation
    Wursthorn, Kira
    Hillemann, Markus
    Ulrich, Markus
    ISPRS ANNALS OF THE PHOTOGRAMMETRY, REMOTE SENSING AND SPATIAL INFORMATION SCIENCES: VOLUME X-2-2024, 2024, : 223 - 230
  • [30] RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering
    Iwase, Shun
    Liu, Xingyu
    Khirodkar, Rawal
    Yokota, Rio
    Kitani, Kris M.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3283 - 3292