A RGB-D feature fusion network for occluded object 6D pose estimation

被引:0
|
作者
Song, Yiwei [1 ]
Tang, Chunhui [1 ]
机构
[1] Univ Shanghai Sci & Technol, 516 Jungong Rd, Shanghai 200093, Peoples R China
关键词
6D Pose estimation; Implicit fusion; Local region feature; Transformer;
D O I
10.1007/s11760-024-03318-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
6D pose estimation using RGB-D data has been widely utilized in various scenarios, with keypoint-based methods receiving significant attention due to their exceptional performance. However, these methods still face numerous challenges, especially when the object is heavily occluded or truncated. To address this issue, we propose a novel cross-modal fusion network. Specifically, our approach initially employs object detection to identify the potential position of the object and randomly samples within this region. Subsequently, a specially designed feature extraction network is utilized to extract appearance features from the RGB image and geometry features from the depth image respectively; these features are then implicitly aggregated through cross-modal fusion. Finally, keypoints are employed for estimating the pose of the object. The proposed method undergoes extensive testing on Occlusion Linemod and Truncation Linemod datasets. Experimental results demonstrate that our method has made significant advancements, thereby validating the effectiveness of cross-modal feature fusion strategy in enhancing the accuracy of RGB-D image pose estimation based on keypoints.
引用
收藏
页码:6309 / 6319
页数:11
相关论文
共 50 条
  • [41] RFF-PoseNet: A 6D Object Pose Estimation Network Based on Robust Feature Fusion in Complex Scenes
    Lei, Xiaomei
    Lu, Wenhuan
    Yong, Jiu
    Wei, Jianguo
    [J]. ELECTRONICS, 2024, 13 (17)
  • [42] Optimizing RGB-D Fusion for Accurate 6DoF Pose Estimation
    Saadi, Lounes
    Besbes, Bassem
    Kramm, Sebastien
    Bensrhair, Abdelaziz
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 2413 - 2420
  • [43] An RGB-D Refinement Solution for Accurate Object Pose Estimation
    Saadi, Lounes
    Besbes, Bassem
    Kramm, Sebastien
    Bensrhair, Abdelaziz
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT PROCEEDINGS (ISMAR-ADJUNCT 2021), 2021, : 189 - 194
  • [44] MFFNet: Multimodal feature fusion network for RGB-D transparent object detection
    Zhu, Li
    Li, Tuanjie
    Ning, Yuming
    Zhang, Yan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (05):
  • [45] Multiple geometry representations for 6D object pose estimation in occluded or truncated scenes
    Wang, Jichun
    Qiu, Lemiao
    Yi, Guodong
    Zhang, Shuyou
    Wang, Yang
    [J]. PATTERN RECOGNITION, 2022, 132
  • [46] 6D Pose Estimation with Correlation Fusion
    Cheng, Yi
    Zhu, Hongyuan
    Sun, Ying
    Acar, Cihan
    Jing, Wei
    Wu, Yan
    Li, Liyuan
    Tan, Cheston
    Lim, Joo-Hwee
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2988 - 2994
  • [47] 6D Robotic Assembly Based on RGB-only Object Pose Estimation
    Fu, Bowen
    Leong, Sek Kun
    Lian, Xiaocong
    Ji, Xiangyang
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 4736 - 4742
  • [48] T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects
    Hodan, Tomas
    Haluza, Pavel
    Obdrzalek, Stepan
    Matas, Jiri
    Lourakis, Manolis
    Zabulis, Xenophon
    [J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 880 - 888
  • [49] Discriminative feature fusion for RGB-D salient object detection
    Chen, Zeyu
    Zhu, Mingyu
    Chen, Shuhan
    Lu, Lu
    Tang, Haonan
    Hu, Xuelong
    Ji, Chunfan
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 106
  • [50] Single Shot 6D Object Pose Estimation
    Kleeberger, Kilian
    Huber, Marco F.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6239 - 6245