VI-Net: Boosting Category-level 6D Object Pose Estimation via Learning Decoupled Rotations on the Spherical Representations

被引:3
|
作者
Lin, Jiehong [1 ,2 ]
Wei, Zewei [1 ]
Zhang, Yabin [3 ]
Jia, Kui [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
[2] DexForce Technol Co Ltd, Shenzhen, Peoples R China
[3] Hong Kong Polytech Univ, Hong Kong, Peoples R China
关键词
D O I
10.1109/ICCV51070.2023.01287
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rotation estimation of high precision from an RGB-D object observation is a huge challenge in 6D object pose estimation, due to the difficulty of learning in the non-linear space of SO(3). In this paper, we propose a novel rotation estimation network, termed as VI- Net, to make the task easier by decoupling the rotation as the combination of a viewpoint rotation and an in-plane rotation. More specifically, VI-Net bases the feature learning on the sphere with two individual branches for the estimates of two factorized rotations, where a V-Branch is employed to learn the viewpoint rotation via binary classification on the spherical signals, while another I-Branch is used to estimate the in-plane rotation by transforming the signals to view from the zenith direction. To process the spherical signals, a Spherical Feature Pyramid Network is constructed based on a novel design of SPAtial Spherical Convolution (SPA-SConv), which settles the boundary problem of spherical signals via feature padding and realizes viewpoint-equivariant feature extraction by symmetric convolutional operations. We apply the proposed VI-Net to the challenging task of category-level 6D object pose estimation for predicting the poses of unknown objects without available CAD models; experiments on the benchmarking datasets confirm the efficacy of our method, which outperforms the existing ones with a large margin in the regime of high precision.
引用
收藏
页码:13955 / 13965
页数:11
相关论文
共 50 条
  • [21] Self-Supervised Category-Level 6D Object Pose Estimation with Deep Implicit Shape Representation
    Peng, Wanli
    Yan, Jianhang
    Wen, Hongtao
    Sun, Yi
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2082 - 2090
  • [22] Attention-guided RGB-D Fusion Network for Category-level 6D Object Pose Estimation
    Wang, Hao
    Li, Weiming
    Kim, Jiyeon
    Wang, Qiang
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10651 - 10658
  • [23] StereoPose: Category-Level 6D Transparent Object Pose Estimation from Stereo Images via Back-View NOCS
    Chen, Kai
    James, Stephen
    Sui, Congying
    Liu, Yun-Hui
    Abbeel, Pieter
    Dou, Qi
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2855 - 2861
  • [24] DR-Pose: A Two-stage Deformation-and-Registration Pipeline for Category-level 6D Object Pose Estimation
    Zhou, Lei
    Liu, Zhiyang
    Gan, Runze
    Wang, Haozhe
    Ang, Marcelo H., Jr.
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1192 - 1199
  • [25] CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    Li, Dong
    Zhao, Shiqi
    [J]. IEEE Transactions on Multimedia, 2024, 26 : 1665 - 1680
  • [26] CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    Li, Dong
    Zhao, Shiqi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1665 - 1680
  • [27] Category-Level 6D Object Pose and Size Estimation Using Self-supervised Deep Prior Deformation Networks
    Lin, Jiehong
    Wei, Zewei
    Ding, Changxing
    Jia, Kui
    [J]. COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 19 - 34
  • [28] GenPose: Generative Category-level Object Pose Estimation via Diffusion Models
    Zhang, Jiyao
    Wu, Mingdong
    Dong, Hao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [29] Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features
    Ma, Wufei
    Wang, Angtian
    Yuille, Alan
    Kortylewski, Adam
    [J]. COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 492 - 508
  • [30] SOCS: Semantically-aware Object Coordinate Space for Category-Level 6D Object Pose Estimation under Large Shape Variations
    Wan, Boyan
    Shi, Yifei
    Xu, Kai
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14019 - 14028