A RGB-D feature fusion network for occluded object 6D pose estimation

被引:0
|
作者
Song, Yiwei [1 ]
Tang, Chunhui [1 ]
机构
[1] Univ Shanghai Sci & Technol, 516 Jungong Rd, Shanghai 200093, Peoples R China
关键词
6D Pose estimation; Implicit fusion; Local region feature; Transformer;
D O I
10.1007/s11760-024-03318-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
6D pose estimation using RGB-D data has been widely utilized in various scenarios, with keypoint-based methods receiving significant attention due to their exceptional performance. However, these methods still face numerous challenges, especially when the object is heavily occluded or truncated. To address this issue, we propose a novel cross-modal fusion network. Specifically, our approach initially employs object detection to identify the potential position of the object and randomly samples within this region. Subsequently, a specially designed feature extraction network is utilized to extract appearance features from the RGB image and geometry features from the depth image respectively; these features are then implicitly aggregated through cross-modal fusion. Finally, keypoints are employed for estimating the pose of the object. The proposed method undergoes extensive testing on Occlusion Linemod and Truncation Linemod datasets. Experimental results demonstrate that our method has made significant advancements, thereby validating the effectiveness of cross-modal feature fusion strategy in enhancing the accuracy of RGB-D image pose estimation based on keypoints.
引用
收藏
页码:6309 / 6319
页数:11
相关论文
共 50 条
  • [31] Selective Embedding with Gated Fusion for 6D Object Pose Estimation
    Shantong Sun
    Rongke Liu
    Qiuchen Du
    Shuqiao Sun
    [J]. Neural Processing Letters, 2020, 51 : 2417 - 2436
  • [32] 6D Object Pose Estimation With Color/Geometry Attention Fusion
    Yuan, Honglin
    Veltkamp, Remco C.
    [J]. 16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 529 - 535
  • [33] PointPoseNet: Point Pose Network for Robust 6D Object Pose Estimation
    Chen, Wei
    Duan, Jinming
    Basevi, Hector
    Chang, Hyung Jin
    Leonardis, Ales
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2813 - 2822
  • [34] A Pose Proposal and Refinement Network for Better 6D Object Pose Estimation
    Trabelsi, Ameni
    Chaabane, Mohamed
    Blanchard, Nathaniel
    Beveridge, Ross
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2381 - 2390
  • [35] Context-aware 6D pose estimation of known objects using RGB-D data
    Ankit Kumar
    Priya Shukla
    Vandana Kushwaha
    Gora Chand Nandi
    [J]. Multimedia Tools and Applications, 2024, 83 : 52973 - 52987
  • [36] Selective Embedding with Gated Fusion for 6D Object Pose Estimation
    Sun, Shantong
    Liu, Rongke
    Du, Qiuchen
    Sun, Shuqiao
    [J]. NEURAL PROCESSING LETTERS, 2020, 51 (03) : 2417 - 2436
  • [37] Marker-Less 3d Object Recognition and 6d Pose Estimation for Homogeneous Textureless Objects: An RGB-D Approach
    Hajari, Nasim
    Bustillo, Gabriel Lugo
    Sharma, Harsh
    Cheng, Irene
    [J]. SENSORS, 2020, 20 (18) : 1 - 22
  • [38] Context-aware 6D pose estimation of known objects using RGB-D data
    Kumar, Ankit
    Shukla, Priya
    Kushwaha, Vandana
    Nandi, Gora Chand
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 52973 - 52987
  • [39] Real Time and Robust 6D Pose Estimation of RGB-D Data for Robotic Bin Picking
    Peng, Linpeng
    Zhao, Yongsheng
    Qu, Shuailong
    Zhang, Yifeng
    Weng, Fang
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5283 - 5288
  • [40] On Evaluation of 6D Object Pose Estimation
    Hodan, Tomas
    Matas, Jiri
    Obdrzalek, Stephan
    [J]. COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 606 - 619