A Visual Navigation Perspective for Category-Level Object Pose Estimation

被引:1
|
作者
Guo, Jiaxin [1 ]
Zhong, Fangxun [2 ]
Xiong, Rong [1 ]
Liu, Yunhui [2 ]
Wang, Yue [1 ]
Liao, Yiyi [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China
来源
关键词
Category-level object pose estimation; Analysis-by-synthesis;
D O I
10.1007/978-3-031-20068-7_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies category-level object pose estimation based on a single monocular image. Recent advances in pose-aware generative models have paved the way for addressing this challenging task using analysis-by-synthesis. The idea is to sequentially update a set of latent variables, e.g., pose, shape, and appearance, of the generative model until the generated image best agrees with the observation. However, convergence and efficiency are two challenges of this inference procedure. In this paper, we take a deeper look at the inference of analysis-by-synthesis from the perspective of visual navigation, and investigate what is a good navigation policy for this specific task. We evaluate three different strategies, including gradient descent, reinforcement learning and imitation learning, via thorough comparisons in terms of convergence, robustness and efficiency. Moreover, we show that a simple hybrid approach leads to an effective and efficient solution. We further compare these strategies to state-of-the-art methods, and demonstrate superior performance on synthetic and real-world datasets leveraging off-the-shelf pose-aware generative models.
引用
收藏
页码:123 / 141
页数:19
相关论文
共 50 条
  • [41] GarmentTracking: Category-Level Garment Pose Tracking
    Xue, Han
    Xu, Wenqiang
    Zhang, Jieyi
    Tang, Tutian
    Li, Yutong
    Du, Wenxin
    Ye, Ruolin
    Lu, Cewu
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21233 - 21242
  • [42] DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency
    Lin, Jiehong
    Wei, Zewei
    Li, Zhihao
    Xu, Songcen
    Jia, Kui
    Li, Yuanqing
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3540 - 3549
  • [43] Single-Stage Keypoint-Based Category-Level Object Pose Estimation from an RGB Image
    Lin, Yunzhi
    Tremblay, Jonathan
    Tyree, Stephen
    Vela, Patricio A.
    Birchfield, Stan
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 1547 - 1553
  • [44] KGNet: Knowledge-Guided Networks for Category-Level 6D Object Pose and Size Estimation
    Meng, Qiwei
    Gu, Jason
    Zhu, Shiqiang
    Liao, Jianfeng
    Jin, Tianlei
    Guo, Fangtai
    Wang, Wen
    Song, Wei
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 6102 - 6108
  • [45] SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation
    Chen, Kai
    Dou, Qi
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2753 - 2762
  • [46] Category-Level Object Pose Estimation in Heavily Cluttered Scenes by Generalized Two-Stage Shape Reconstructor
    Tatemichi, Hiroki
    Kawanishi, Yasutomo
    Deguchi, Daisuke
    Ide, Ichiro
    Murase, Hiroshi
    [J]. IEEE ACCESS, 2024, 12 : 33440 - 33448
  • [47] Adversarial imitation learning-based network for category-level 6D object pose estimation
    Sun, Shantong
    Bao, Xu
    Kaushik, Aryan
    [J]. MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
  • [48] Bi-directional attention based RGB-D fusion for category-level object pose and shape estimation
    Tang, Kaifeng
    Xu, Chi
    Chen, Ming
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 53043 - 53063
  • [49] Fine segmentation and difference-aware shape adjustment for category-level 6DoF object pose estimation
    Chongpei Liu
    Wei Sun
    Jian Liu
    Xing Zhang
    Shimeng Fan
    Qiang Fu
    [J]. Applied Intelligence, 2023, 53 : 23711 - 23728
  • [50] Self-Supervised Category-Level 6D Object Pose Estimation with Deep Implicit Shape Representation
    Peng, Wanli
    Yan, Jianhang
    Wen, Hongtao
    Sun, Yi
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2082 - 2090