GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

被引:0
|
作者
Wang, Pengyuan [1 ]
Ikeda, Takuya [2 ]
Lee, Robert [2 ]
Nishiwaki, Koichi [2 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Woven Toyota, Tokyo, Japan
来源
关键词
D O I
10.1007/978-3-031-73383-3_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category-level pose estimation is a challenging task with many potential applications in computer vision and robotics. Recently, deep-learning-based approaches have made great progress, but are typically hindered by the need for large datasets of either pose-labelled real images or carefully tuned photorealistic simulators. This can be avoided by using only geometry inputs such as depth images to reduce the domain-gap but these approaches suffer from a lack of semantic information, which can be vital in the pose estimation problem. To resolve this conflict, we propose to utilize both geometric and semantic features obtained from a pre-trained foundation model. Our approach projects 2D semantic features into object models as 3D semantic point clouds. Based on the novel 3D representation, we further propose a self-supervision pipeline, and match the fused semantic point clouds against their synthetic rendered partial observations from synthetic object models. The learned knowledge from synthetic data generalizes to observations of unseen objects in the real scenes, without any fine-tuning. We demonstrate this with a rich evaluation on the NOCS, Wild6D and SUN RGB-D benchmarks, showing superior performance over geometric-only and semantic-only baselines with significantly fewer training objects.
引用
收藏
页码:108 / 126
页数:19
相关论文
共 50 条
  • [1] Category-Level Articulated Object Pose Estimation
    Li, Xiaolong
    Wang, He
    Yi, Li
    Guibas, Leonidas
    Abbott, A. Lynn
    Song, Shuran
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3703 - 3712
  • [2] Category-Level Object Pose Estimation with Statistic Attention
    Jiang, Changhong
    Mu, Xiaoqiao
    Zhang, Bingbing
    Liang, Chao
    Xie, Mujun
    SENSORS, 2024, 24 (16)
  • [3] GenPose: Generative Category-level Object Pose Estimation via Diffusion Models
    Zhang, Jiyao
    Wu, Mingdong
    Dong, Hao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] iCaps: Iterative Category-Level Object Pose and Shape Estimation
    Deng, Xinke
    Geng, Junyi
    Bretl, Timothy
    Xiang, Yu
    Fox, Dieter
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 1784 - 1791
  • [5] A Visual Navigation Perspective for Category-Level Object Pose Estimation
    Guo, Jiaxin
    Zhong, Fangxun
    Xiong, Rong
    Liu, Yunhui
    Wang, Yue
    Liao, Yiyi
    COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 123 - 141
  • [6] Zero-Shot Category-Level Object Pose Estimation
    Goodwin, Walter
    Vaze, Sagar
    Havoutis, Ioannis
    Posner, Ingmar
    COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 516 - 532
  • [7] Generative Category-Level Shape and Pose Estimation with Semantic Primitives
    Li, Guanglin
    Li, Yifeng
    Ye, Zhichao
    Zhang, Qihang
    Kong, Tao
    Cui, Zhaopeng
    Zhang, Guofeng
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1390 - 1400
  • [8] TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation
    Zhan, Yue
    Wang, Xin
    Nie, Lang
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9749 - 9762
  • [9] Category-Level Metric Scale Object Shape and Pose Estimation
    Lee, Taeyeop
    Lee, Byeong-Uk
    Kim, Myungchul
    Kweon, I. S.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 8575 - 8582
  • [10] HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation
    Zheng, Linfang
    Wang, Chen
    Sun, Yinghan
    Dasgupta, Esha
    Chen, Hua
    Leonardis, Ales
    Zhang, Wei
    Chang, Hyung Jin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17163 - 17173