GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

被引：0

作者：

Wang, Pengyuan ^{[1
]}

Ikeda, Takuya ^{[2
]}

Lee, Robert ^{[2
]}

Nishiwaki, Koichi ^{[2
]}

机构：

[1] Tech Univ Munich, Munich, Germany

[2] Woven Toyota, Tokyo, Japan

来源：

COMPUTER VISION - ECCV 2024, PT XXVII | 2025年 / 15085卷

关键词：

D O I：

10.1007/978-3-031-73383-3_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Category-level pose estimation is a challenging task with many potential applications in computer vision and robotics. Recently, deep-learning-based approaches have made great progress, but are typically hindered by the need for large datasets of either pose-labelled real images or carefully tuned photorealistic simulators. This can be avoided by using only geometry inputs such as depth images to reduce the domain-gap but these approaches suffer from a lack of semantic information, which can be vital in the pose estimation problem. To resolve this conflict, we propose to utilize both geometric and semantic features obtained from a pre-trained foundation model. Our approach projects 2D semantic features into object models as 3D semantic point clouds. Based on the novel 3D representation, we further propose a self-supervision pipeline, and match the fused semantic point clouds against their synthetic rendered partial observations from synthetic object models. The learned knowledge from synthetic data generalizes to observations of unseen objects in the real scenes, without any fine-tuning. We demonstrate this with a rich evaluation on the NOCS, Wild6D and SUN RGB-D benchmarks, showing superior performance over geometric-only and semantic-only baselines with significantly fewer training objects.

引用

页码：108 / 126

页数：19

共 50 条

[1] Category-Level Articulated Object Pose Estimation
Li, Xiaolong
Wang, He
Yi, Li
Guibas, Leonidas
Abbott, A. Lynn
Song, Shuran
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3703 - 3712
[2] Category-Level Object Pose Estimation with Statistic Attention
Jiang, Changhong
Mu, Xiaoqiao
Zhang, Bingbing
Liang, Chao
Xie, Mujun
SENSORS, 2024, 24 (16)
[3] GenPose: Generative Category-level Object Pose Estimation via Diffusion Models
Zhang, Jiyao
Wu, Mingdong
Dong, Hao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[4] iCaps: Iterative Category-Level Object Pose and Shape Estimation
Deng, Xinke
Geng, Junyi
Bretl, Timothy
Xiang, Yu
Fox, Dieter
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 1784 - 1791
[5] A Visual Navigation Perspective for Category-Level Object Pose Estimation
Guo, Jiaxin
Zhong, Fangxun
Xiong, Rong
Liu, Yunhui
Wang, Yue
Liao, Yiyi
COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 123 - 141
[6] Zero-Shot Category-Level Object Pose Estimation
Goodwin, Walter
Vaze, Sagar
Havoutis, Ioannis
Posner, Ingmar
COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 516 - 532
[7] Generative Category-Level Shape and Pose Estimation with Semantic Primitives
Li, Guanglin
Li, Yifeng
Ye, Zhichao
Zhang, Qihang
Kong, Tao
Cui, Zhaopeng
Zhang, Guofeng
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1390 - 1400
[8] TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation
Zhan, Yue
Wang, Xin
Nie, Lang
Zhao, Yang
Yang, Tangwen
Ruan, Qiuqi
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9749 - 9762
[9] Category-Level Metric Scale Object Shape and Pose Estimation
Lee, Taeyeop
Lee, Byeong-Uk
Kim, Myungchul
Kweon, I. S.
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 8575 - 8582
[10] HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation
Zheng, Linfang
Wang, Chen
Sun, Yinghan
Dasgupta, Esha
Chen, Hua
Leonardis, Ales
Zhang, Wei
Chang, Hyung Jin
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17163 - 17173

← 1 2 3 4 5 →