CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation

被引:34
|
作者
Irshad, Muhammad Zubair [1 ]
Kollar, Thomas [2 ]
Laskey, Michael [2 ]
Stone, Kevin [2 ]
Kira, Zsolt [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA USA
[2] Toyota Res Inst, Los Altos, CA USA
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022 | 2022年
关键词
D O I
10.1109/ICRA46639.2022.9811799
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the complex task of simultaneous multi-object 3D reconstruction, 6D pose and size estimation from a single-view RGB-D observation. In contrast to instance-level pose estimation, we focus on a more challenging problem where CAD models are not available at inference time. Existing approaches mainly follow a complex multi-stage pipeline which first localizes and detects each object instance in the image and then regresses to either their 3D meshes or 6D poses. These approaches suffer from high-computational cost and low performance in complex multi-object scenarios, where occlusions can be present. Hence, we present a simple onestage approach to predict both the 3D shape and estimate the 6D pose and size jointly in a bounding-box free manner. In particular, our method treats object instances as spatial centers where each center denotes the complete shape of an object along with its 6D pose and size. Through this perpixel representation, our approach can reconstruct in realtime (40 FPS) multiple novel object instances and predict their 6D pose and sizes in a single-forward pass. Through extensive experiments, we demonstrate that our approach significantly outperforms all shape completion and categorical 6D pose and size estimation baselines on multi-object ShapeNet and NOCS datasets respectively with a 12.6% absolute improvement in mAP for 6D pose for novel real-world object instances.
引用
收藏
页码:10632 / 10640
页数:9
相关论文
共 50 条
  • [1] Single Shot 6D Object Pose Estimation
    Kleeberger, Kilian
    Huber, Marco F.
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6239 - 6245
  • [2] PoET: Pose Estimation Transformer for Single-View, Multi-Object 6D Pose Estimation
    Jantos, Thomas
    Hamdad, Mohamed Amin
    Granig, Wolfgang
    Weiss, Stephan
    Steinbrener, Jan
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1060 - 1070
  • [3] Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
    Lipson, Lahav
    Teed, Zachary
    Goyal, Ankit
    Deng, Jia
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6718 - 6727
  • [4] 6D pose estimation and 3D object reconstruction from 2D shape for robotic grasping of objects
    Wolnitza, Marcell
    Kaya, Osman
    Kulvicius, Tomas
    Woergoetter, Florentin
    Dellen, Babette
    2022 SIXTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC, 2022, : 67 - 71
  • [5] YOLO-6D-Pose: Enhancing YOLO for Single-Stage Monocular Multi-Object 6D Pose Estimation
    Maji, Debapriya
    Nagori, Soyeb
    Mathew, Manu
    Poddar, Deepak
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1616 - 1625
  • [6] Learning 6D Object Pose Estimation Using 3D Object Coordinates
    Brachmann, Eric
    Krull, Alexander
    Michel, Frank
    Gumhold, Stefan
    Shotton, Jamie
    Rother, Carsten
    COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 : 536 - 551
  • [7] 6D Object Pose Estimation using Few-Shot Instance Segmentation and 3D Matching
    Li, Wanyi
    Sun, Jia
    Luo, Yongkang
    Wang, Peng
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 1071 - 1077
  • [8] Single-shot 3D multi-person pose estimation in complex images
    Benzine, Abdallah
    Luvison, Bertrand
    Pham, Quoc Cuong
    Achard, Catherine
    PATTERN RECOGNITION, 2021, 112
  • [9] ACCURATE 6D OBJECT POSE ESTIMATION BY POSE CONDITIONED MESH RECONSTRUCTION
    Castro, Pedro
    Armagan, Anil
    Kim, Tae-Kyun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4147 - 4151
  • [10] Estimation method of category-level multi-object rigid body 6D pose
    Cheng, Shuo
    Jia, Di
    Yang, Liu
    He, Dekun
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2025, 40 (03) : 457 - 471