GCCN: Geometric Constraint Co-attention Network for 6D Object Pose Estimation

被引:7
|
作者
Wen, Yongming [1 ]
Fang, Yiquan [1 ]
Cai, Junhao [1 ]
Tung, Kimwa [1 ]
Cheng, Hui [1 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Peoples R China
关键词
6D Pose Estimation; Co-attention Mechanism; Object Model Priors; Geometric Constraint;
D O I
10.1145/3474085.3475209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In 6D object pose estimation task, object models are usually available and represented as the point cloud set in canonical object frame, which are important references for estimating object poses to the camera frame. However, directly introducing object models as the prior knowledge (i.e., object model point cloud) will cause potential perturbations and even degenerate pose estimation performance. To make the most of object model priors and eliminate the problem, we present an end-to-end deep learning approach called the Geometric Constraint Co-attention Network (GCCN) for 6D object pose estimation. GCCN is designed to explicitly leverage the object model priors effectively with the co-attention mechanism. We add explicit geometric constraints to a co-attention module to inform the geometric correspondence relationships between points in the scene and object model priors and develop a novel geometric constraint loss to guide the training. In this manner, our method effectively eliminates the side effect of directly introducing the object model priors into the network. Experiments on the YCB-Video and LineMOD datasets demonstrate that our GCCN substantially improves the performance of pose estimation and is robust against heavy occlusions. We also demonstrate that GCCN is accurate and robust enough to be deployed in real-world robotic tasks.
引用
收藏
页码:2671 / 2679
页数:9
相关论文
共 50 条
  • [31] Spatiotemporal Co-Attention Hybrid Neural Network for Pedestrian Localization Based on 6D IMU
    Wang, Yingying
    Cheng, Hu
    Meng, Max Q-H
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (01) : 636 - 648
  • [32] Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
    Liu, Jierui
    Cao, Zhiqiang
    Tang, Yingbo
    Liu, Xilong
    Tan, Min
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6728 - 6740
  • [33] Binocular vision object 6D pose estimation based on circulatory neural network
    Yang H.
    Li Z.
    Kang Z.-Y.
    Tian B.
    Dong Q.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (11): : 2179 - 2187
  • [34] A RGB-D feature fusion network for occluded object 6D pose estimation
    Song, Yiwei
    Tang, Chunhui
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6309 - 6319
  • [35] SilhoNet: An RGB Method for 6D Object Pose Estimation
    Billings, Gideon
    Johnson-Roberson, Matthew
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04): : 3727 - 3734
  • [36] DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation
    Jin, Lei
    Wang, Xiaojuan
    He, Mingshu
    Wang, Jingyue
    SENSORS, 2021, 21 (05) : 1 - 15
  • [37] On Object Symmetries and 6D Pose Estimation from Images
    Pitteri, Giorgia
    Ramamonjisoa, Michael
    Ilic, Slobodan
    Lepetit, Vincent
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 614 - 622
  • [38] PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes
    Xiang, Yu
    Schmidt, Tanner
    Narayanan, Venkatraman
    Fox, Dieter
    ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [39] Confidence-Based 6D Object Pose Estimation
    Huang, Wei-Lun
    Hung, Chun-Yi
    Lin, I-Chen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3025 - 3035
  • [40] ConvPoseCNN: Dense Convolutional 6D Object Pose Estimation
    Capellen, Catherine
    Schwarz, Max
    Behnke, Sven
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 162 - 172