A RGB-D feature fusion network for occluded object 6D pose estimation

被引:0
|
作者
Song, Yiwei [1 ]
Tang, Chunhui [1 ]
机构
[1] Univ Shanghai Sci & Technol, 516 Jungong Rd, Shanghai 200093, Peoples R China
关键词
6D Pose estimation; Implicit fusion; Local region feature; Transformer;
D O I
10.1007/s11760-024-03318-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
6D pose estimation using RGB-D data has been widely utilized in various scenarios, with keypoint-based methods receiving significant attention due to their exceptional performance. However, these methods still face numerous challenges, especially when the object is heavily occluded or truncated. To address this issue, we propose a novel cross-modal fusion network. Specifically, our approach initially employs object detection to identify the potential position of the object and randomly samples within this region. Subsequently, a specially designed feature extraction network is utilized to extract appearance features from the RGB image and geometry features from the depth image respectively; these features are then implicitly aggregated through cross-modal fusion. Finally, keypoints are employed for estimating the pose of the object. The proposed method undergoes extensive testing on Occlusion Linemod and Truncation Linemod datasets. Experimental results demonstrate that our method has made significant advancements, thereby validating the effectiveness of cross-modal feature fusion strategy in enhancing the accuracy of RGB-D image pose estimation based on keypoints.
引用
收藏
页码:6309 / 6319
页数:11
相关论文
共 50 条
  • [21] A Novel Depth and Color Feature Fusion Framework for 6D Object Pose Estimation
    Zhou, Guangliang
    Yan, Yi
    Wang, Deming
    Chen, Qijun
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1630 - 1639
  • [22] HFF6D: Hierarchical Feature Fusion Network for Robust 6D Object Pose Tracking
    Liu, Jian
    Sun, Wei
    Liu, Chongpei
    Zhang, Xing
    Fan, Shimeng
    Wu, Wei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7719 - 7731
  • [23] RGB-D object pose estimation in unstructured environments
    Choi, Changhyun
    Christensen, Henrik I.
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 595 - 613
  • [24] 3D Object Detection and 6D Pose Estimation Using RGB-D Images and Mask R-CNN
    Tran, Van Luan
    Lin, Huei-Yung
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [25] Geometric-aware dense matching network for 6D pose estimation of objects from RGB-D images
    Wu, Chenrui
    Chen, Long
    Wang, Shenglong
    Yang, Han
    Jiang, Junjie
    [J]. PATTERN RECOGNITION, 2023, 137
  • [26] 6D Object Pose Estimation in Cluttered Scenes from RGB Images
    Xiao-Long Yang
    Xiao-Hong Jia
    Yuan Liang
    Lu-Bin Fan
    [J]. Journal of Computer Science and Technology, 2022, 37 : 719 - 730
  • [27] Graph neural network for 6D object pose estimation
    Yin, Pengshuai
    Ye, Jiayong
    Lin, Guoshen
    Wu, Qingyao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [28] Generalizable and Accurate 6D Object Pose Estimation Network
    Fu, Shouxu
    Li, Xiaoning
    Yu, Xiangdong
    Cao, Lu
    Li, Xingxing
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 312 - 324
  • [29] 6D Object Pose Estimation in Cluttered Scenes from RGB Images
    Yang, Xiao-Long
    Jia, Xiao-Hong
    Liang, Yuan
    Fan, Lu-Bin
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (03) : 719 - 730
  • [30] DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion
    Wang, Chen
    Xu, Danfei
    Zhu, Yuke
    Martin-Martin, Roberto
    Lu, Cewu
    Li Fei-Fei
    Savarese, Silvio
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3338 - 3347