Optimal Pose and Shape Estimation for Category-level 3D Object Perception

被引:0
|
作者
Shi, Jingnan [1 ]
Yang, Heng [1 ]
Carlone, Luca [1 ]
机构
[1] MIT, Lab Informat & Decis Syst LIDS, Cambridge, MA 02139 USA
关键词
MODELS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a category-level perception problem, where one is given 3D sensor data picturing an object of a given category (e.g., a car), and has to reconstruct the pose and shape of the object despite intra-class variability (i.e., different car models have different shapes). We consider an active shape model, where -for an object category- we are given a library of potential CAD models describing objects in that category, and we adopt a standard formulation where pose and shape estimation are formulated as a non-convex optimization. Our first contribution is to provide the first certifiably optimal solver for pose and shape estimation. In particular, we show that rotation estimation can be decoupled from the estimation of the object translation and shape, and we demonstrate that (i) the optimal object rotation can be computed via a tight (small-size) semidefinite relaxation, and (ii) the translation and shape parameters can be computed in closed-form given the rotation. Our second contribution is to add an outlier rejection layer to our solver, hence making it robust to a large number of misdetections. Towards this goal, we wrap our optimal solver in a robust estimation scheme based on graduated non-convexity. To further enhance robustness to outliers, we also develop the first graph-theoretic formulation to prune outliers in category-level perception, which removes outliers via convex hull and maximum clique computations; the resulting approach is robust to 70 - 90% outliers. Our third contribution is an extensive experimental evaluation. Besides providing an ablation study on a simulated dataset and on the PASCAL3D+ dataset, we combine our solver with a deep-learned keypoint detector, and show that the resulting approach improves over the state of the art in vehicle pose estimation in the ApolloScape datasets.
引用
收藏
页数:17
相关论文
共 50 条
  • [11] A Visual Navigation Perspective for Category-Level Object Pose Estimation
    Guo, Jiaxin
    Zhong, Fangxun
    Xiong, Rong
    Liu, Yunhui
    Wang, Yue
    Liao, Yiyi
    [J]. COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 123 - 141
  • [12] Zero-Shot Category-Level Object Pose Estimation
    Goodwin, Walter
    Vaze, Sagar
    Havoutis, Ioannis
    Posner, Ingmar
    [J]. COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 516 - 532
  • [13] MSSPA-GC: Multi-Scale Shape Prior Adaptation with 3D Graph Convolutions for Category-Level Object Pose Estimation
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    [J]. NEURAL NETWORKS, 2023, 166 : 609 - 621
  • [14] Flexible object models for category-level 3D object recognition
    Kushal, Akash
    Schmid, Cordelia
    Ponce, Jean
    [J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 1370 - +
  • [15] Median-shape Representation Learning for Category-level Object Pose Estimation in Cluttered Environments
    Tatemichi, Hiroki
    Kawanishi, Yasutomo
    Deguchi, Daisuke
    Ide, Ichiro
    Amma, Ayako
    Murase, Hiroshi
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4473 - 4480
  • [16] Open-Vocabulary Category-Level Object Pose and Size Estimation
    Cai, Junhao
    He, Yisheng
    Yuan, Weihao
    Zhu, Siyu
    Dong, Zilong
    Bo, Liefeng
    Chen, Qifeng
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7661 - 7668
  • [17] Bi-directional attention based RGB-D fusion for category-level object pose and shape estimation
    Tang, Kaifeng
    Xu, Chi
    Chen, Ming
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 53043 - 53063
  • [18] Self-Supervised Category-Level 6D Object Pose Estimation with Deep Implicit Shape Representation
    Peng, Wanli
    Yan, Jianhang
    Wen, Hongtao
    Sun, Yi
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2082 - 2090
  • [19] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation
    Wang, He
    Sridhar, Srinath
    Huang, Jingwei
    Valentin, Julien
    Song, Shuran
    Guibas, Leonidas J.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2637 - 2646
  • [20] TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation
    Zhan, Yue
    Wang, Xin
    Nie, Lang
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9749 - 9762