Adversarial imitation learning-based network for category-level 6D object pose estimation

被引:0
|
作者
Sun, Shantong [1 ]
Bao, Xu [1 ]
Kaushik, Aryan [2 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Commun Engn, Zhenjiang 212013, Peoples R China
[2] Univ Sussex, Sch Engn & Informat, Brighton BN19RH, England
关键词
Category-level; Object pose estimation; Analysis-by-synthesis; Adversarial imitation learning; MACHINE VISION; REALITY;
D O I
10.1007/s00138-024-01592-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category-level 6D object pose estimation is a very fundamental and key research in computer vision. In order to get rid of the dependence on the object 3D models, analysis-by-synthesis object pose estimation methods have recently been widely studied. While these methods have certain improvements in generalization, the accuracy of category-level object pose estimation still needs to be improved. In this paper, we propose a category-level 6D object pose estimation network based on adversarial imitation learning, named AIL-Net. AIL-Net adopts the state-action distribution matching criterion and is able to perform expert actions that have not appeared in the dataset. This prevents the object pose estimation from falling into a bad state. We further design a framework for estimating object pose through generative adversarial imitation learning. This method is able to distinguish between expert policy and imitation policy in AIL-Net. Experimental results show that our approach achieves competitive category-level object pose estimation performance on REAL275 dataset and Cars dataset.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    Li, Dong
    Zhao, Shiqi
    [J]. IEEE Transactions on Multimedia, 2024, 26 : 1665 - 1680
  • [42] Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image
    Fan, Zhaoxin
    Song, Zhenbo
    Xu, Jian
    Wang, Zhicheng
    Wu, Kejian
    Liu, Hongyan
    He, Jun
    [J]. COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 220 - 236
  • [43] CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    Li, Dong
    Zhao, Shiqi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1665 - 1680
  • [44] Optimal Pose and Shape Estimation for Category-level 3D Object Perception
    Shi, Jingnan
    Yang, Heng
    Carlone, Luca
    [J]. ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [45] Robotic Grasp Detection Based on Category-Level Object Pose Estimation With Self-Supervised Learning
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (01) : 625 - 635
  • [46] Graph neural network for 6D object pose estimation
    Yin, Pengshuai
    Ye, Jiayong
    Lin, Guoshen
    Wu, Qingyao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [47] Generalizable and Accurate 6D Object Pose Estimation Network
    Fu, Shouxu
    Li, Xiaoning
    Yu, Xiangdong
    Cao, Lu
    Li, Xingxing
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 312 - 324
  • [48] Open-Vocabulary Category-Level Object Pose and Size Estimation
    Cai, Junhao
    He, Yisheng
    Yuan, Weihao
    Zhu, Siyu
    Dong, Zilong
    Bo, Liefeng
    Chen, Qifeng
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7661 - 7668
  • [49] PointPoseNet: Point Pose Network for Robust 6D Object Pose Estimation
    Chen, Wei
    Duan, Jinming
    Basevi, Hector
    Chang, Hyung Jin
    Leonardis, Ales
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2813 - 2822
  • [50] A Pose Proposal and Refinement Network for Better 6D Object Pose Estimation
    Trabelsi, Ameni
    Chaabane, Mohamed
    Blanchard, Nathaniel
    Beveridge, Ross
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2381 - 2390