Self Attention Guided Depth Completion using RGB and Sparse LiDAR Point Clouds

被引:1
|
作者
Srivastava, Siddharth [1 ]
Sharma, Gaurav [2 ,3 ]
机构
[1] Ctr Dev Adv Comp, Noida, India
[2] TensorTour Inc, Gurgaon, Haryana, India
[3] IIT Kanpur, Kanpur, Uttar Pradesh, India
关键词
IMAGE;
D O I
10.1109/IROS51168.2021.9636310
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of completing per pixel dense depth map using a single RGB image and the sparse point cloud of the scene. Depth prediction from RGB image is a hard problem and while dense point clouds obtained from LiDAR sensors can be used in addition to RGB image, the cost of such sensors is a significant barrier. Having LiDAR sensors which capture sparse point clouds is a reasonable middle ground. We propose a novel architecture which incorporates geometric primitives and self attention mechanisms, to improve the prediction. The motivation of self attention is to capture the correlations between scene and object elements, e.g. between the right and left window of car, early on in the network. While that for using geometric primitives is to have a high level clustering cue to enable the network to exploit similar correlations. In addition, we enforce complimentarity in the predictions made with RGB and sparse LiDAR respectively, this forces the two corresponding branches to focus on hard areas which are not already well predicted by the other branch. With exhaustive experiments on KITTI depth completion benchmark, NYU v2 and Matterport3D we show that the proposed method provides state-of-the-art results.
引用
收藏
页码:2643 / 2650
页数:8
相关论文
共 50 条
  • [41] Semantically aware multilateral filter for depth upsampling in automotive LiDAR point clouds
    Dimitrievski, Martin
    Veelaert, Peter
    Philips, Wilfried
    2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 1058 - 1063
  • [42] Adaptive Selection of Color Images or Depth to Align RGB-D Point Clouds
    Perafan Villota, Juan Carlos
    Reali Costa, Anna Helena
    2014 2ND BRAZILIAN ROBOTICS SYMPOSIUM (SBR) / 11TH LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS) / 6TH ROBOCONTROL WORKSHOP ON APPLIED ROBOTICS AND AUTOMATION, 2014, : 175 - 180
  • [43] Edge-guided generative network with attention for point cloud completion
    Li, Jianliang
    Zhang, Jinming
    Zhang, Xiaohai
    Chen, Ming
    VISUAL COMPUTER, 2025, 41 (02): : 785 - 798
  • [44] Attention Unet plus plus for lightweight depth estimation from sparse depth samples and a single RGB image
    Zhao, Tao
    Pan, Shuguo
    Gao, Wang
    Sheng, Chao
    Sun, Yingchun
    Wei, Jiansheng
    VISUAL COMPUTER, 2022, 38 (05): : 1619 - 1630
  • [45] SemAttNet: Toward Attention-Based Semantic Aware Guided Depth Completion
    Nazir, Danish
    Pagani, Alain
    Liwicki, Marcus
    Stricker, Didier
    Afzal, Muhammad Zeshan
    IEEE ACCESS, 2022, 10 : 120781 - 120791
  • [46] Deep Architecture With Cross Guidance Between Single Image and Sparse LiDAR Data for Depth Completion
    Lee, Sihaeng
    Lee, Janghyeon
    Kim, Doyeon
    Kim, Junmo
    IEEE ACCESS, 2020, 8 : 79801 - 79810
  • [47] DEPTH ENHANCEMENT USING RGB-D GUIDED FILTERING
    Hui, Tak-Wai
    Ngan, King Ngi
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3832 - 3836
  • [48] Sparse Depth-Guided Image Enhancement Using Incremental GP with Informative Point Selection
    Yang, Geonmo
    Lee, Juhui
    Kim, Ayoung
    Cho, Younggun
    SENSORS, 2023, 23 (03)
  • [49] Point-AGM : Attention Guided Masked Auto-Encoder for Joint Self-supervised Learning on Point Clouds
    Liu, Jie
    Yang, Mengna
    Tian, Yu
    Li, Yancui
    Song, Da
    Li, Kang
    Cao, Xin
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)
  • [50] SemanticFlow: Semantic Segmentation of Sequential LiDAR Point Clouds From Sparse Frame Annotations
    Zhao, Junhao
    Huang, Weijie
    Wu, Hai
    Wen, Chenglu
    Yang, Bo
    Guo, Yulan
    Wang, Cheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61