Adaptive depth estimation for pyramid multi-view stereo

被引:12
|
作者
Liao, Jie [1 ]
Fu, Yanping [2 ]
Yan, Qingan [3 ]
Luo, Fei [1 ]
Xiao, Chunxia [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei, Peoples R China
[3] JD Com, Silicon Valley Res Ctr Multimedia Software, Beijing, Peoples R China
来源
COMPUTERS & GRAPHICS-UK | 2021年 / 97卷
关键词
3D Reconstruction; Multi-View Stereo; Deep Learning; RECONSTRUCTION;
D O I
10.1016/j.cag.2021.04.016
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose a Multi-View Stereo (MVS) network which can perform efficient high-resolution depth estimation with low memory consumption. Classical learning-based MVS approaches typically construct 3D cost volumes to regress depth information, making the output resolution rather limited as the memory consumption grows cubically with the input resolution. Although recent approaches have made significant progress in scalability by introducing the coarse-to-fine fashion or sequential cost map regularization, the memory consumption still grows quadratically with input resolution and is not friendly for commodity GPU. Observing that the surfaces of most objects in real world are locally smooth, we assume that most of the depth hypotheses upsampled from a well-estimated depth map are accurate. Based on the assumption, we propose a pyramid MVS network based on the adaptive depth estimation, which gradually refines and upsamples the depth map to the desired resolution. Instead of estimating depth hypotheses for all pixels in the depth map, our method only performs prediction at adaptively selected locations, alleviating excessive computation on well-estimated positions. To estimate depth hypotheses for sparse selected locations, we propose the lightweight pixelwise depth estimation network, which can estimate depth value for each selected location independently. Experiments demonstrate that our method can generate results comparable with the state-of-the-art learning-based methods while reconstructing more geometric details and consuming less GPU memory. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页码:268 / 278
页数:11
相关论文
共 50 条
  • [41] IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo
    Wang, Fangjinhua
    Galliani, Silvano
    Vogel, Christoph
    Pollefeys, Marc
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8596 - 8605
  • [42] A Benchmark and a Baseline for Robust Multi-view Depth Estimation
    Schroeppel, Philipp
    Bechtold, Jan
    Amiranashvili, Artemij
    Brox, Thomas
    [J]. 2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 637 - 645
  • [43] Self-supervised Learning of Depth Inference for Multi-view Stereo
    Yang, Jiayu
    Alvarez, Jose M.
    Liu, Miaomiao
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7522 - 7530
  • [44] Multi-View Stereo: A Tutorial
    Furukawa, Yasutaka
    Hernandez, Carlos
    [J]. FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
  • [45] Multi-view Stereo by Fusing Monocular and a Combination of Depth Representation Methods
    Yu, Fanqi
    Sun, Xinyang
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 298 - 309
  • [46] Monocular depth estimation with multi-view attention autoencoder
    Geunho Jung
    Sang Min Yoon
    [J]. Multimedia Tools and Applications, 2022, 81 : 33759 - 33770
  • [47] Monocular depth estimation with multi-view attention autoencoder
    Jung, Geunho
    Yoon, Sang Min
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33759 - 33770
  • [48] Deep Multi-view Depth Estimation with Predicted Uncertainty
    Tong Ke
    Tien Do
    Khiem Vuong
    Sartipi, Kourosh
    Roumeliotis, Stergios, I
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 9235 - 9241
  • [49] PDE-based multi-view depth estimation
    Strecha, C
    Van Gool, L
    [J]. FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 416 - 425
  • [50] BEVStereo: Enhancing Depth Estimation in Multi-View 3D Object Detection with Temporal Stereo
    Li, Yinhao
    Bao, Han
    Ge, Zheng
    Yang, Jinrong
    Sun, Jianjian
    Li, Zeming
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1486 - 1494