Multi-View Depth Estimation by Using Adaptive Point Graph to Fuse Single-View Depth Probabilities

被引:0
|
作者
Wang, Ke [1 ]
Liu, Chuhao [2 ]
Liu, Zhanwen [1 ]
Xiao, Fangwei [1 ]
An, Yisheng [1 ]
Zhao, Xiangmo [1 ]
Shen, Shaojie [2 ]
机构
[1] Changan Univ, Coll Informat Engn, Xian 710064, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
来源
基金
国家重点研发计划;
关键词
Computer vision for automation; range sensing; visual learning and deep learning for visual perception;
D O I
10.1109/LRA.2024.3405332
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Recently, some methods estimate depth maps by fusing several adjacent single-view depth probabilities. They have achieved promising performance in multi-view inconsistent areas, such as texture-less surfaces, reflective surfaces, and moving objects. However, these methods involve two new problems: their thin cost volumes contain many invalid values, and the depths of adjacent volume units tend to be very different, which hinders the valid fusion of multi-view information. To deal with these issues, we design a novel point graph based single-views fusing method to estimate depth maps from several sequential images. Our method first estimates the initial probabilistic distribution of the depth map for input images, the distribution is parameterized as a pixel-wise depth and uncertainty. Then, we sample non-uniform depth candidates from the reference image's initial distribution. Diverse from the popular 3D cost volume, we utilize sampled depth candidates to construct an adaptive local point graph to represent multi-view geometric constraints. For pixels with multi-view consistency, we aggregate their local graphs to update their initial depths. And take the updated pixels as control points to refine the depth of the remaining pixels. We demonstrate the effectiveness of the proposed method by quantitative and qualitative comparisons with recent baseline works on the KITTI Odometry dataset and the DADD dataset, and our results surpass all competing methods even without 3D cost volume.
引用
收藏
页码:6400 / 6407
页数:8
相关论文
共 50 条
  • [1] Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry
    Bae, Gwangbin
    Budvytis, Ignas
    Cipolla, Roberto
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2832 - 2841
  • [2] Single-View and Multi-View Depth Fusion
    Facil, Jose M.
    Concha, Alejo
    Montesano, Luis
    Civera, Javier
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04): : 1994 - 2001
  • [3] Adaptive depth estimation for pyramid multi-view stereo
    Liao, Jie
    Fu, Yanping
    Yan, Qingan
    Luo, Fei
    Xiao, Chunxia
    [J]. COMPUTERS & GRAPHICS-UK, 2021, 97 : 268 - 278
  • [4] Multi-view depth video coding using depth view synthesis
    Na, Sang-Tae
    Oh, Kwan-Jung
    Lee, Cheon
    Ho, Yo-Sung
    [J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1400 - 1403
  • [5] Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View CNN to Multi-View CNNs
    Ge, Liuhao
    Liang, Hui
    Yuan, Junsong
    Thalmann, Daniel
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3593 - 3601
  • [6] Are Multi-view Edges Incomplete for Depth Estimation?
    Khan, Numair
    Kim, Min H.
    Tompkin, James
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2639 - 2673
  • [7] Continuous Depth Estimation for Multi-view Stereo
    Liu, Yebin
    Cao, Xun
    Dai, Qionghai
    Xu, Wenli
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2121 - 2128
  • [8] ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval
    Zhang, Song
    Xu, Wenjia
    Wei, Zhiwei
    Zhang, Lili
    Wang, Yang
    Liu, Junyi
    [J]. PATTERN RECOGNITION, 2023, 144
  • [9] Mono-SF: Multi-View Geometry Meets Single-View Depth for Monocular Scene Flow Estimation of Dynamic Traffic Scenes
    Brickwedde, Fabian
    Abraham, Steffen
    Mester, Rudolf
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2780 - 2790
  • [10] Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth Estimation in Dynamic Scenes
    Li, Rui
    Gong, Dong
    Yin, Wei
    Chen, Hao
    Zhu, Yu
    Wang, Kaixuan
    Chen, Xiaozhi
    Sun, Jinqiu
    Zhang, Yanning
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21539 - 21548