GCCN: Geometric Constraint Co-attention Network for 6D Object Pose Estimation

被引:7
|
作者
Wen, Yongming [1 ]
Fang, Yiquan [1 ]
Cai, Junhao [1 ]
Tung, Kimwa [1 ]
Cheng, Hui [1 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Peoples R China
关键词
6D Pose Estimation; Co-attention Mechanism; Object Model Priors; Geometric Constraint;
D O I
10.1145/3474085.3475209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In 6D object pose estimation task, object models are usually available and represented as the point cloud set in canonical object frame, which are important references for estimating object poses to the camera frame. However, directly introducing object models as the prior knowledge (i.e., object model point cloud) will cause potential perturbations and even degenerate pose estimation performance. To make the most of object model priors and eliminate the problem, we present an end-to-end deep learning approach called the Geometric Constraint Co-attention Network (GCCN) for 6D object pose estimation. GCCN is designed to explicitly leverage the object model priors effectively with the co-attention mechanism. We add explicit geometric constraints to a co-attention module to inform the geometric correspondence relationships between points in the scene and object model priors and develop a novel geometric constraint loss to guide the training. In this manner, our method effectively eliminates the side effect of directly introducing the object model priors into the network. Experiments on the YCB-Video and LineMOD datasets demonstrate that our GCCN substantially improves the performance of pose estimation and is robust against heavy occlusions. We also demonstrate that GCCN is accurate and robust enough to be deployed in real-world robotic tasks.
引用
收藏
页码:2671 / 2679
页数:9
相关论文
共 50 条
  • [41] Focal segmentation for robust 6D object pose estimation
    Ye, Yuning
    Park, Hanhoon
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 47563 - 47585
  • [42] Segmentation-driven 6D Object Pose Estimation
    Hu, Yinlin
    Hugonot, Joachim
    Fua, Pascal
    Salzmann, Mathieu
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3380 - 3389
  • [43] Fundamental Coordinate Space for Object 6D Pose Estimation
    Wan, Boyan
    Zhang, Chen
    IEEE ACCESS, 2024, 12 : 146430 - 146440
  • [44] 6D Object Pose Estimation for Robot Programming by Demonstration
    Ghahramani, Mohammad
    Vakanski, Aleksandar
    Janabi-Sharifi, Farrokh
    PROGRESS IN OPTOMECHATRONIC TECHNOLOGIES, 2019, 233 : 93 - 101
  • [45] RobotP: A Benchmark Dataset for 6D Object Pose Estimation
    Yuan, Honglin
    Hoogenkamp, Tim
    Veltkamp, Remco C.
    SENSORS, 2021, 21 (04) : 1 - 26
  • [46] Focal segmentation for robust 6D object pose estimation
    Yuning Ye
    Hanhoon Park
    Multimedia Tools and Applications, 2024, 83 : 47563 - 47585
  • [47] Open-vocabulary object 6D pose estimation
    Corsetti, Jaime
    Boscaini, Davide
    Oh, Changjae
    Cavallaro, Andrea
    Poiesi, Fabio
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 18071 - 18080
  • [48] Single-Stage 6D Object Pose Estimation
    Hu, Yinlin
    Fua, Pascal
    Wang, Wei
    Salzmann, Mathieu
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2927 - 2936
  • [49] Sparse Keypoint Models for 6D Object Pose Estimation
    Sadran, Emal
    Wurm, Kai M.
    Burschka, Darius
    2013 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR 2013), 2013, : 307 - 312
  • [50] ACCURATE 6D OBJECT POSE ESTIMATION BY POSE CONDITIONED MESH RECONSTRUCTION
    Castro, Pedro
    Armagan, Anil
    Kim, Tae-Kyun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4147 - 4151