Adaptive depth estimation for pyramid multi-view stereo

被引:12
|
作者
Liao, Jie [1 ]
Fu, Yanping [2 ]
Yan, Qingan [3 ]
Luo, Fei [1 ]
Xiao, Chunxia [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei, Peoples R China
[3] JD Com, Silicon Valley Res Ctr Multimedia Software, Beijing, Peoples R China
来源
COMPUTERS & GRAPHICS-UK | 2021年 / 97卷
关键词
3D Reconstruction; Multi-View Stereo; Deep Learning; RECONSTRUCTION;
D O I
10.1016/j.cag.2021.04.016
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose a Multi-View Stereo (MVS) network which can perform efficient high-resolution depth estimation with low memory consumption. Classical learning-based MVS approaches typically construct 3D cost volumes to regress depth information, making the output resolution rather limited as the memory consumption grows cubically with the input resolution. Although recent approaches have made significant progress in scalability by introducing the coarse-to-fine fashion or sequential cost map regularization, the memory consumption still grows quadratically with input resolution and is not friendly for commodity GPU. Observing that the surfaces of most objects in real world are locally smooth, we assume that most of the depth hypotheses upsampled from a well-estimated depth map are accurate. Based on the assumption, we propose a pyramid MVS network based on the adaptive depth estimation, which gradually refines and upsamples the depth map to the desired resolution. Instead of estimating depth hypotheses for all pixels in the depth map, our method only performs prediction at adaptively selected locations, alleviating excessive computation on well-estimated positions. To estimate depth hypotheses for sparse selected locations, we propose the lightweight pixelwise depth estimation network, which can estimate depth value for each selected location independently. Experiments demonstrate that our method can generate results comparable with the state-of-the-art learning-based methods while reconstructing more geometric details and consuming less GPU memory. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页码:268 / 278
页数:11
相关论文
共 50 条
  • [1] Depth Estimation in Multi-View Stereo Based on Image Pyramid
    Xu, Hanfei
    Cai, Yangang
    Wang, Ronggang
    [J]. PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 345 - 349
  • [2] Continuous Depth Estimation for Multi-view Stereo
    Liu, Yebin
    Cao, Xun
    Dai, Qionghai
    Xu, Wenli
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2121 - 2128
  • [3] Recurrent Multi-view Stereo Depth Inference with Pyramid of Images
    Wang, Xiaobao
    Dong, Enzeng
    Tong, Jigang
    Sun, Zhe
    Li, Wenyu
    Duan, Feng
    [J]. PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 259 - 263
  • [4] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose
    Liu, Miaomiao
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4748 - 4760
  • [5] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose M.
    Liu, Miaomiao
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4876 - 4885
  • [6] REVISED DEPTH MAP ESTIMATION FOR MULTI-VIEW STEREO
    Yao, Yao
    Zhu, Hao
    Nie, Yongming
    Ji, Xiaoli
    Cao, Xun
    [J]. 2014 INTERNATIONAL CONFERENCE ON 3D IMAGING (IC3D), 2014,
  • [7] ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval
    Zhang, Song
    Xu, Wenjia
    Wei, Zhiwei
    Zhang, Lili
    Wang, Yang
    Liu, Junyi
    [J]. PATTERN RECOGNITION, 2023, 144
  • [8] Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation
    Peng, Rui
    Wang, Rongjie
    Wang, Zhenyu
    Lai, Yawen
    Wang, Ronggang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8635 - 8644
  • [9] Uncertainty Guided Multi-View Stereo Network for Depth Estimation
    Su, Wanjuan
    Xu, Qingshan
    Tao, Wenbing
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7796 - 7808
  • [10] Pyramid Multi-View Stereo with Local Consistency
    Liao, Jie
    Fu, Yanping
    Yan, Qingan
    Xiao, Chunxia
    [J]. COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 335 - 346