Multi-View Depth Estimation by Using Adaptive Point Graph to Fuse Single-View Depth Probabilities

被引：0

作者：

Wang, Ke ^{[1
]}

Liu, Chuhao ^{[2
]}

Liu, Zhanwen ^{[1
]}

Xiao, Fangwei ^{[1
]}

An, Yisheng ^{[1
]}

Zhao, Xiangmo ^{[1
]}

Shen, Shaojie ^{[2
]}

机构：

[1] Changan Univ, Coll Informat Engn, Xian 710064, Peoples R China

[2] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 07期

基金：

国家重点研发计划;

关键词：

Computer vision for automation; range sensing; visual learning and deep learning for visual perception;

D O I：

10.1109/LRA.2024.3405332

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Recently, some methods estimate depth maps by fusing several adjacent single-view depth probabilities. They have achieved promising performance in multi-view inconsistent areas, such as texture-less surfaces, reflective surfaces, and moving objects. However, these methods involve two new problems: their thin cost volumes contain many invalid values, and the depths of adjacent volume units tend to be very different, which hinders the valid fusion of multi-view information. To deal with these issues, we design a novel point graph based single-views fusing method to estimate depth maps from several sequential images. Our method first estimates the initial probabilistic distribution of the depth map for input images, the distribution is parameterized as a pixel-wise depth and uncertainty. Then, we sample non-uniform depth candidates from the reference image's initial distribution. Diverse from the popular 3D cost volume, we utilize sampled depth candidates to construct an adaptive local point graph to represent multi-view geometric constraints. For pixels with multi-view consistency, we aggregate their local graphs to update their initial depths. And take the updated pixels as control points to refine the depth of the remaining pixels. We demonstrate the effectiveness of the proposed method by quantitative and qualitative comparisons with recent baseline works on the KITTI Odometry dataset and the DADD dataset, and our results surpass all competing methods even without 3D cost volume.

引用

页码：6400 / 6407

页数：8

共 50 条

[1] Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry
Bae, Gwangbin
Budvytis, Ignas
Cipolla, Roberto
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2832 - 2841
[2] Single-View and Multi-View Depth Fusion
Facil, Jose M.
Concha, Alejo
Montesano, Luis
Civera, Javier
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04): : 1994 - 2001
[3] Adaptive depth estimation for pyramid multi-view stereo
Liao, Jie
Fu, Yanping
Yan, Qingan
Luo, Fei
Xiao, Chunxia
[J]. COMPUTERS & GRAPHICS-UK, 2021, 97 : 268 - 278
[4] Multi-view depth video coding using depth view synthesis
Na, Sang-Tae
Oh, Kwan-Jung
Lee, Cheon
Ho, Yo-Sung
[J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1400 - 1403
[5] Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View CNN to Multi-View CNNs
Ge, Liuhao
Liang, Hui
Yuan, Junsong
Thalmann, Daniel
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3593 - 3601
[6] Are Multi-view Edges Incomplete for Depth Estimation?
Khan, Numair
Kim, Min H.
Tompkin, James
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2639 - 2673
[7] Continuous Depth Estimation for Multi-view Stereo
Liu, Yebin
Cao, Xun
Dai, Qionghai
Xu, Wenli
[J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2121 - 2128
[8] ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval
Zhang, Song
Xu, Wenjia
Wei, Zhiwei
Zhang, Lili
Wang, Yang
Liu, Junyi
[J]. PATTERN RECOGNITION, 2023, 144
[9] Mono-SF: Multi-View Geometry Meets Single-View Depth for Monocular Scene Flow Estimation of Dynamic Traffic Scenes
Brickwedde, Fabian
Abraham, Steffen
Mester, Rudolf
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2780 - 2790
[10] Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth Estimation in Dynamic Scenes
Li, Rui
Gong, Dong
Yin, Wei
Chen, Hao
Zhu, Yu
Wang, Kaixuan
Chen, Xiaozhi
Sun, Jinqiu
Zhang, Yanning
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21539 - 21548

← 1 2 3 4 5 →