Self Attention Guided Depth Completion using RGB and Sparse LiDAR Point Clouds

被引:1
|
作者
Srivastava, Siddharth [1 ]
Sharma, Gaurav [2 ,3 ]
机构
[1] Ctr Dev Adv Comp, Noida, India
[2] TensorTour Inc, Gurgaon, Haryana, India
[3] IIT Kanpur, Kanpur, Uttar Pradesh, India
关键词
IMAGE;
D O I
10.1109/IROS51168.2021.9636310
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of completing per pixel dense depth map using a single RGB image and the sparse point cloud of the scene. Depth prediction from RGB image is a hard problem and while dense point clouds obtained from LiDAR sensors can be used in addition to RGB image, the cost of such sensors is a significant barrier. Having LiDAR sensors which capture sparse point clouds is a reasonable middle ground. We propose a novel architecture which incorporates geometric primitives and self attention mechanisms, to improve the prediction. The motivation of self attention is to capture the correlations between scene and object elements, e.g. between the right and left window of car, early on in the network. While that for using geometric primitives is to have a high level clustering cue to enable the network to exploit similar correlations. In addition, we enforce complimentarity in the predictions made with RGB and sparse LiDAR respectively, this forces the two corresponding branches to focus on hard areas which are not already well predicted by the other branch. With exhaustive experiments on KITTI depth completion benchmark, NYU v2 and Matterport3D we show that the proposed method provides state-of-the-art results.
引用
收藏
页码:2643 / 2650
页数:8
相关论文
共 50 条
  • [21] Real-time Interpolation Method For Sparse LiDAR Point Cloud Using RGB Camera
    Hasegawa, Tomohiko
    Emaru, Takanori
    Ravankar, Ankit A.
    2021 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2021, : 421 - 425
  • [22] Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds
    Yang, Xuemeng
    Zou, Hao
    Kong, Xin
    Huang, Tianxin
    Liu, Yong
    Li, Wanlong
    Wen, Feng
    Zhang, Hongbo
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3555 - 3562
  • [23] Roof Classification From 3-D LiDAR Point Clouds Using Multiview CNN With Self-Attention
    Shajahan, Dimple A.
    Nayel, Vaibhav
    Muthuganapathy, Ramanathan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (08) : 1465 - 1469
  • [24] Stereo-augmented Depth Completion from a Single RGB-LiDAR image
    Choi, Keunhoon
    Jeong, Somi
    Kim, Youngjung
    Sohn, Kwanghoon
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13641 - 13647
  • [25] GRAYSCALE AND NORMAL GUIDED DEPTH COMPLETION WITH A LOW-COST LIDAR
    Yu, Qingyang
    Chu, Lei
    Wu, Qi
    Pei, Ling
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 979 - 983
  • [26] Novel Moving-Target Detection Using A Hybrid of RGB Images and LiDAR Point-Clouds
    Tian, Rui
    Wu, Hsiao-Chun
    Ye, Jinwei
    Wu, Yiyan
    2020 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2020,
  • [27] An Efficient Information-Reinforced Lidar Deep Completion Network without RGB Guided
    Wei, Ming
    Zhu, Ming
    Zhang, Yaoyuan
    Sun, Jiaqi
    Wang, Jiarong
    REMOTE SENSING, 2022, 14 (19)
  • [28] Model driven reconstruction of roofs from sparse LIDAR point clouds
    Henn, Andre
    Groeger, Gerhard
    Stroh, Viktor
    Pluemer, Lutz
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 76 : 17 - 29
  • [29] ESC-Net: Alleviating Triple Sparsity on 3D LiDAR Point Clouds for Extreme Sparse Scene Completion
    An, Pei
    Zhu, Di
    Quan, Siwen
    Ding, Junfeng
    Ma, Jie
    Yang, You
    Liu, Qiong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6799 - 6810
  • [30] SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds
    Wang, Yanbo
    Zhao, Wentao
    Cao, Chuan
    Deng, Tianchen
    Wang, Jingchuan
    Chen, Weidong
    COMPUTER VISION - ECCV 2024, PT V, 2025, 15063 : 403 - 421