6D Pose Estimation with Correlation Fusion

被引:4
|
作者
Cheng, Yi [1 ]
Zhu, Hongyuan [1 ]
Sun, Ying [1 ]
Acar, Cihan [2 ]
Jing, Wei [2 ]
Wu, Yan [2 ]
Li, Liyuan [1 ]
Tan, Cheston [1 ]
Lim, Joo-Hwee [1 ]
机构
[1] Agcy Sci Technol & Res, Inst Infocomm Res, Visual Intelligence, Singapore, Singapore
[2] Agcy Sci Technol & Res, Inst Infocomm Res, Robot & Autonomous Syst, Singapore, Singapore
关键词
object pose estimation; RGB-D; correlation fusion;
D O I
10.1109/ICPR48806.2021.9412238
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
6D object pose estimation is widely applied in robotic tasks such as grasping and manipulation. Prior methods using RGB-only images are vulnerable to heavy occlusion and poor illumination, so it is important to complement them with depth information. However, existing methods using RGB-D data cannot adequately exploit consistent and complementary information between RGB and depth modalities. In this paper, we present a novel method to effectively consider the correlation within and across both modalities with attention mechanism to learn discriminative and compact multi-modal features. Then, effective fusion strategies for intra- and inter-correlation modules are explored to ensure efficient information flow between RGB and depth. To our best knowledge, this is the first work to explore effective intra- and inter-modality fusion in 6D pose estimation. The experimental results show that our method can achieve the state-of-the-art performance on LineMOD and YCB-Video dataset. We also demonstrate that the proposed method can benefit a real-world robot grasping task by providing accurate object pose estimation.
引用
收藏
页码:2988 / 2994
页数:7
相关论文
共 50 条
  • [21] BOP: Benchmark for 6D Object Pose Estimation
    Hodan, Tomas
    Michel, Frank
    Brachmann, Eric
    Kehl, Wadim
    Buch, Anders Glent
    Kraft, Dirk
    Drost, Bertram
    Vidal, Joel
    Ihrke, Stephan
    Zabulis, Xenophon
    Sahin, Caner
    Manhardt, Fabian
    Tombari, Federico
    Kim, Tae-Kyun
    Matas, Jiri
    Rother, Carsten
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35
  • [22] Survey on 6D Pose Estimation of Rigid Object
    Chen, Jiale
    Zhang, Lijun
    Liu, Yi
    Xu, Chi
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7440 - 7445
  • [23] Orientation Keypoints for 6D Human Pose Estimation
    Fisch, Martin
    Clark, Ronald
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 10145 - 10158
  • [24] Mitigating imbalances in heterogeneous feature fusion for multi-class 6D pose estimation
    Wang, Huafeng
    Zhang, Haodu
    Liu, Wanquan
    Lv, Weifeng
    Gu, Xianfeng
    Guo, Kexin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 297
  • [25] A Lightweight Two-End Feature Fusion Network for Object 6D Pose Estimation
    Zuo, Ligang
    Xie, Lun
    Pan, Hang
    Wang, Zhiliang
    [J]. MACHINES, 2022, 10 (04)
  • [26] A Transformer-based multi-modal fusion network for 6D pose estimation
    Hong, Jia-Xin
    Zhang, Hong-Bo
    Liu, Jing-Hua
    Lei, Qing
    Yang, Li-Jie
    Du, Ji-Xiang
    [J]. INFORMATION FUSION, 2024, 105
  • [27] Homologous multimodal fusion network with geometric constraint keypoints selection for 6D pose estimation
    Yi, Guo
    Wang, Fei
    Ding, Qichuan
    [J]. Expert Systems with Applications, 2025, 266
  • [28] ACCURATE 6D OBJECT POSE ESTIMATION BY POSE CONDITIONED MESH RECONSTRUCTION
    Castro, Pedro
    Armagan, Anil
    Kim, Tae-Kyun
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4147 - 4151
  • [29] A Pose Proposal and Refinement Network for Better 6D Object Pose Estimation
    Trabelsi, Ameni
    Chaabane, Mohamed
    Blanchard, Nathaniel
    Beveridge, Ross
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2381 - 2390
  • [30] PointPoseNet: Point Pose Network for Robust 6D Object Pose Estimation
    Chen, Wei
    Duan, Jinming
    Basevi, Hector
    Chang, Hyung Jin
    Leonardis, Ales
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2813 - 2822