Self Attention Guided Depth Completion using RGB and Sparse LiDAR Point Clouds

被引：1

作者：

Srivastava, Siddharth ^{[1
]}

Sharma, Gaurav ^{[2
,3
]}

机构：

[1] Ctr Dev Adv Comp, Noida, India

[2] TensorTour Inc, Gurgaon, Haryana, India

[3] IIT Kanpur, Kanpur, Uttar Pradesh, India

来源：

2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2021年

关键词：

IMAGE;

D O I：

10.1109/IROS51168.2021.9636310

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We address the problem of completing per pixel dense depth map using a single RGB image and the sparse point cloud of the scene. Depth prediction from RGB image is a hard problem and while dense point clouds obtained from LiDAR sensors can be used in addition to RGB image, the cost of such sensors is a significant barrier. Having LiDAR sensors which capture sparse point clouds is a reasonable middle ground. We propose a novel architecture which incorporates geometric primitives and self attention mechanisms, to improve the prediction. The motivation of self attention is to capture the correlations between scene and object elements, e.g. between the right and left window of car, early on in the network. While that for using geometric primitives is to have a high level clustering cue to enable the network to exploit similar correlations. In addition, we enforce complimentarity in the predictions made with RGB and sparse LiDAR respectively, this forces the two corresponding branches to focus on hard areas which are not already well predicted by the other branch. With exhaustive experiments on KITTI depth completion benchmark, NYU v2 and Matterport3D we show that the proposed method provides state-of-the-art results.

引用

页码：2643 / 2650

页数：8

共 50 条

[41] Semantically aware multilateral filter for depth upsampling in automotive LiDAR point clouds
Dimitrievski, Martin
Veelaert, Peter
Philips, Wilfried
2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 1058 - 1063
[42] Adaptive Selection of Color Images or Depth to Align RGB-D Point Clouds
Perafan Villota, Juan Carlos
Reali Costa, Anna Helena
2014 2ND BRAZILIAN ROBOTICS SYMPOSIUM (SBR) / 11TH LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS) / 6TH ROBOCONTROL WORKSHOP ON APPLIED ROBOTICS AND AUTOMATION, 2014, : 175 - 180
[43] Edge-guided generative network with attention for point cloud completion
Li, Jianliang
Zhang, Jinming
Zhang, Xiaohai
Chen, Ming
VISUAL COMPUTER, 2025, 41 (02): : 785 - 798
[44] Attention Unet plus plus for lightweight depth estimation from sparse depth samples and a single RGB image
Zhao, Tao
Pan, Shuguo
Gao, Wang
Sheng, Chao
Sun, Yingchun
Wei, Jiansheng
VISUAL COMPUTER, 2022, 38 (05): : 1619 - 1630
[45] SemAttNet: Toward Attention-Based Semantic Aware Guided Depth Completion
Nazir, Danish
Pagani, Alain
Liwicki, Marcus
Stricker, Didier
Afzal, Muhammad Zeshan
IEEE ACCESS, 2022, 10 : 120781 - 120791
[46] Deep Architecture With Cross Guidance Between Single Image and Sparse LiDAR Data for Depth Completion
Lee, Sihaeng
Lee, Janghyeon
Kim, Doyeon
Kim, Junmo
IEEE ACCESS, 2020, 8 : 79801 - 79810
[47] DEPTH ENHANCEMENT USING RGB-D GUIDED FILTERING
Hui, Tak-Wai
Ngan, King Ngi
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3832 - 3836
[48] Sparse Depth-Guided Image Enhancement Using Incremental GP with Informative Point Selection
Yang, Geonmo
Lee, Juhui
Kim, Ayoung
Cho, Younggun
SENSORS, 2023, 23 (03)
[49] Point-AGM : Attention Guided Masked Auto-Encoder for Joint Self-supervised Learning on Point Clouds
Liu, Jie
Yang, Mengna
Tian, Yu
Li, Yancui
Song, Da
Li, Kang
Cao, Xin
COMPUTER GRAPHICS FORUM, 2024, 43 (07)
[50] SemanticFlow: Semantic Segmentation of Sequential LiDAR Point Clouds From Sparse Frame Annotations
Zhao, Junhao
Huang, Weijie
Wu, Hai
Wen, Chenglu
Yang, Bo
Guo, Yulan
Wang, Cheng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61

← 1 2 3 4 5 →