MVSNet: Depth Inference for Unstructured Multi-view Stereo

被引:631
|
作者
Yao, Yao [1 ]
Luo, Zixin [1 ]
Li, Shiwei [1 ]
Fang, Tian [2 ]
Quan, Long [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Shenzhen Zhuke Innovat Technol Altizure, Shenzhen, Peoples R China
来源
关键词
Multi-view stereo; Depth map; Deep learning; RECONSTRUCTION;
D O I
10.1007/978-3-030-01237-3_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an end-to-end deep learning architecture for depth map inference from multi-view images. In the network, we first extract deep visual image features, and then build the 3D cost volume upon the reference camera frustum via the differentiable homography warping. Next, we apply 3D convolutions to regularize and regress the initial depth map, which is then refined with the reference image to generate the final output. Our framework flexibly adapts arbitrary N-view inputs using a variance-based cost metric that maps multiple features into one cost feature. The proposed MVSNet is demonstrated on the large-scale indoor DTU dataset. With simple post-processing, our method not only significantly outperforms previous state-of-the-arts, but also is several times faster in runtime. We also evaluate MVSNet on the complex outdoor Tanks and Temples dataset, where our method ranks first before April 18, 2018 without any fine-tuning, showing the strong generalization ability of MVSNet.
引用
收藏
页码:785 / 801
页数:17
相关论文
共 50 条
  • [31] Joint bilateral propagation upsampling for unstructured multi-view stereo
    Mengqiang Wei
    Qingan Yan
    Fei Luo
    Chengfang Song
    Chunxia Xiao
    [J]. The Visual Computer, 2019, 35 : 797 - 809
  • [32] Joint bilateral propagation upsampling for unstructured multi-view stereo
    Wei, Mengqiang
    Yan, Qingan
    Luo, Fei
    Song, Chengfang
    Xiao, Chunxia
    [J]. VISUAL COMPUTER, 2019, 35 (6-8): : 797 - 809
  • [33] Bi-directional Recurrent MVSNet for High-resolution Multi-view Stereo
    Fujitomi, Taku
    Ito, Seiya
    Kaneko, Naoshi
    Sumi, Kazuhiko
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [34] Adaptive depth estimation for pyramid multi-view stereo
    Liao, Jie
    Fu, Yanping
    Yan, Qingan
    Luo, Fei
    Xiao, Chunxia
    [J]. COMPUTERS & GRAPHICS-UK, 2021, 97 : 268 - 278
  • [35] Learning Depth for Multi-View Stereo with Adversarial Training
    Wang, Liang
    Fan, Deqiao
    Li, Jianshu
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1674 - 1679
  • [36] REVISED DEPTH MAP ESTIMATION FOR MULTI-VIEW STEREO
    Yao, Yao
    Zhu, Hao
    Nie, Yongming
    Ji, Xiaoli
    Cao, Xun
    [J]. 2014 INTERNATIONAL CONFERENCE ON 3D IMAGING (IC3D), 2014,
  • [37] Edge aware depth inference for large-scale aerial building multi-view stereo
    Zhang, Song
    Wei, Zhiwei
    Xu, Wenjia
    Zhang, Lili
    Wang, Yang
    Zhang, Jinming
    Liu, Junyi
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 207 : 27 - 42
  • [38] Multi-View Guided Multi-View Stereo
    Poggi, Matteo
    Conti, Andrea
    Mattoccia, Stefano
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8391 - 8398
  • [39] NTPP-MVSNet: Multi-View Stereo Network Based on Neighboring Tangent Plane Propagation
    Zhao, Qi
    Deng, Yangyan
    Yang, Yifan
    Li, Yawei
    Yuan, Ding
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [40] Multi-View Stereo and Depth Priors Guided NeRF for View Synthesis
    Deng, Wang
    Zhang, Xuetao
    Guo, Yu
    Lu, Zheng
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3922 - 3928