SSP-Pose: Symmetry-Aware Shape Prior Deformation for Direct Category-Level Object Pose Estimation

被引:9
|
作者
Zhang, Ruida [1 ]
Di, Yan [2 ]
Manhardt, Fabian [3 ]
Tombari, Federico [2 ,3 ]
Ji, Xiangyang [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Tech Univ Munich, Munich, Germany
[3] Google, Mountain View, CA 94043 USA
关键词
D O I
10.1109/IROS47612.2022.9981506
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Category-level pose estimation is a challenging problem due to intra-class shape variations. Recent methods deform pre-computed shape priors to map the observed point cloud into the normalized object coordinate space and then retrieve the pose via post-processing, i.e., Umeyama's Algorithm. The shortcomings of this two-stage strategy lie in two aspects: 1) The surrogate supervision on the intermediate results can not directly guide the learning of pose, resulting in large pose error after post-processing. 2) The inference speed is limited by the post-processing step. In this paper, to handle these shortcomings, we propose an end-to-end trainable network SSP-Pose for category-level pose estimation, which integrates shape priors into a direct pose regression network. SSP-Pose stacks four individual branches on a shared feature extractor, where two branches are designed to deform and match the prior model with the observed instance, and the other two branches are applied for directly regressing the totally 9 degrees-of-freedom pose and performing symmetry reconstruction and point-wise inlier mask prediction respectively. Consistency loss terms are then naturally exploited to align the outputs of different branches and promote the performance. During inference, only the direct pose regression branch is needed. In this manner, SSP-Pose not only learns category-level pose-sensitive characteristics to boost performance but also keeps a real-time inference speed. Moreover, we utilize the symmetry information of each category to guide the shape prior deformation, and propose a novel symmetry-aware loss to mitigate the matching ambiguity. Extensive experiments on public datasets demonstrate that SSP-Pose produces superior performance compared with competitors with a real-time inference speed at about 25Hz. The codes will be released soon.
引用
收藏
页码:7452 / 7459
页数:8
相关论文
共 50 条
  • [1] iCaps: Iterative Category-Level Object Pose and Shape Estimation
    Deng, Xinke
    Geng, Junyi
    Bretl, Timothy
    Xiang, Yu
    Fox, Dieter
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 1784 - 1791
  • [2] Category-Level Metric Scale Object Shape and Pose Estimation
    Lee, Taeyeop
    Lee, Byeong-Uk
    Kim, Myungchul
    Kweon, I. S.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 8575 - 8582
  • [3] Category-Level Articulated Object Pose Estimation
    Li, Xiaolong
    Wang, He
    Yi, Li
    Guibas, Leonidas
    Abbott, A. Lynn
    Song, Shuran
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3703 - 3712
  • [4] Category-Level Object Pose Estimation with Statistic Attention
    Jiang, Changhong
    Mu, Xiaoqiao
    Zhang, Bingbing
    Liang, Chao
    Xie, Mujun
    [J]. SENSORS, 2024, 24 (16)
  • [5] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Guan, Yuyin
    Xia, Yuanqing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [6] SD-Pose: Structural Discrepancy Aware Category-Level 6D Object Pose Estimation
    Li, Guowei
    Zhu, Dongchen
    Zhang, Guanghui
    Shi, Wenjun
    Zhang, Tianyu
    Zhang, Xiaolin
    Li, Jiamao
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5674 - 5683
  • [7] A Visual Navigation Perspective for Category-Level Object Pose Estimation
    Guo, Jiaxin
    Zhong, Fangxun
    Xiong, Rong
    Liu, Yunhui
    Wang, Yue
    Liao, Yiyi
    [J]. COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 123 - 141
  • [8] Zero-Shot Category-Level Object Pose Estimation
    Goodwin, Walter
    Vaze, Sagar
    Havoutis, Ioannis
    Posner, Ingmar
    [J]. COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 516 - 532
  • [9] TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation
    Zhan, Yue
    Wang, Xin
    Nie, Lang
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9749 - 9762
  • [10] Optimal Pose and Shape Estimation for Category-level 3D Object Perception
    Shi, Jingnan
    Yang, Heng
    Carlone, Luca
    [J]. ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,