PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation

被引：34

作者：

Zhou, Guangyuan ^{[1
]}

Wang, Huiqun ^{[1
,2
]}

Chen, Jiaxin ^{[2
]}

Huang, Di ^{[1
,2
]}

机构：

[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China

[2] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICCV48922.2021.00279

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

RGB-D based 6D pose estimation has recently achieved remarkable progress, but still suffers from two major limitations: (1) ineffective representation of depth data and (2) insufficient integration of different modalities. This paper proposes a novel deep learning approach, namely Graph Convolutional Network with Point Refinement (PR-GCN), to simultaneously address the issues above in a unified way. It first introduces the Point Refinement Network (PRN) to polish 3D point clouds, recovering missing parts with noise removed. Subsequently, the Multi-Modal Fusion Graph Convolutional Network (MMF-GCN) is presented to strengthen RGB-D combination, which captures geometry-aware inter-modality correlation through local information propagation in the graph convolutional network. Extensive experiments are conducted on three widely used benchmarks, and state-of-the-art performance is reached. Besides, it is also shown that the proposed PRN and MMF-GCN modules are well generalized to other frameworks.

引用

页码：2773 / 2782

页数：10

共 50 条

[21] Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
Lipson, Lahav
Teed, Zachary
Goyal, Ankit
Deng, Jia
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6718 - 6727
[22] Prior-information-guided corresponding point regression network for 6D pose estimation
Gan, Haiqing
Wang, Lihui
Su, Yuzuwei
Ruan, Wenjun
Jiao, Xize
COMPUTERS & GRAPHICS-UK, 2024, 121
[23] PVA-GCN: point-voxel absorbing graph convolutional network for 3D human pose estimation from monocular video
Liu, Minghao
Wang, Wenshan
Zhao, Wei
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3627 - 3641
[24] PVA-GCN: point-voxel absorbing graph convolutional network for 3D human pose estimation from monocular video
Minghao Liu
Wenshan Wang
Wei Zhao
Signal, Image and Video Processing, 2024, 18 : 3627 - 3641
[25] Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation
Corsetti, Jaime
Boscaini, Davide
Poiesi, Fabio
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2095 - 2104
[26] CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers
Castro, Pedro
Kim, Tae-Kyun
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5735 - 5744
[27] Deep Fusion for Multi-Modal 6D Pose Estimation
Lin, Shifeng
Wang, Zunran
Zhang, Shenghao
Ling, Yonggen
Yang, Chenguang
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6540 - 6549
[28] Deep Refinement Convolutional Networks for Human Pose Estimation
Marras, Ioannis
Palasek, Petar
Patras, Ioannis
2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 446 - 453
[29] Uncertainty Quantification with Deep Ensembles for 6D Object Pose Estimation
Wursthorn, Kira
Hillemann, Markus
Ulrich, Markus
ISPRS ANNALS OF THE PHOTOGRAMMETRY, REMOTE SENSING AND SPATIAL INFORMATION SCIENCES: VOLUME X-2-2024, 2024, : 223 - 230
[30] RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering
Iwase, Shun
Liu, Xingyu
Khirodkar, Rawal
Yokota, Rio
Kitani, Kris M.
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3283 - 3292

← 1 2 3 4 5 →