PSP-MVSNet: Deep Patch-Based Similarity Perceptual for Multi-view Stereo Depth Inference

被引:2
|
作者
Jie, Leiping [1 ,2 ]
Zhang, Hui [2 ]
机构
[1] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[2] BNU HKBU United Int Coll, Guangdong Key Lab Interdisciplinary Res & Applica, Zhuhai, Peoples R China
基金
中国国家自然科学基金;
关键词
Depth estimation; Patch-based similarity; Dynamic depth range; Multi-view stereo;
D O I
10.1007/978-3-031-15919-0_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes PSP-MVSNet for depth inference problem in multi-view stereo (MVS). We first introduce a novel patch-based similarity perceptual (PSP) module for effectively constructing 3D cost volume. Unlike previous methods that leverage variance-based operators to fuse feature volumes of different views, our method leverages a cosine similarity measure to calculate matching scores for pairs of deep feature vectors and then treats these scores as weights for constructing the 3D cost volume. This is based on an important observation that many performance degradation factors, e.g., illumination changes or occlusions, will lead to pixel differences between multi-view images. We demonstrate that a patch-based cosine similarity can be used as explicit supervision for feature learning and can help speed up convergence. Furthermore, To adaptively set different depth ranges for different pixels, we extend an existing dynamic depth range searching method with a simple yet effective improvement. We can use this improved searching method to train our model in an end-to-end manner and further improve the performance of our method. Experimental results show that our method achieves state-of-the-art performance on the DTU dataset and comparative results on the intermediate set of Tanks and Temples dataset.
引用
收藏
页码:316 / 328
页数:13
相关论文
共 50 条
  • [1] MVSNet: Depth Inference for Unstructured Multi-view Stereo
    Yao, Yao
    Luo, Zixin
    Li, Shiwei
    Fang, Tian
    Quan, Long
    [J]. COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 785 - 801
  • [2] Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference
    Yao, Yao
    Luo, Zixin
    Li, Shiwei
    Shen, Tianwei
    Fang, Tian
    Quan, Long
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5520 - 5529
  • [3] DRI-MVSNet: A depth residual inference network for multi-view stereo images
    Li, Ying
    Li, Wenyue
    Zhao, Zhijie
    Fan, JiaHao
    [J]. PLOS ONE, 2022, 17 (03):
  • [4] DS-MVSNet: Unsupervised Multi-view Stereo via Depth Synthesis
    Li, Jingliang
    Lu, Zhengda
    Wang, Yiqun
    Wang, Ying
    Xiao, Jun
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5593 - 5601
  • [5] EPP-MVSNet: Epipolar-assembling based Depth Prediction for Multi-view Stereo
    Ma, Xinjun
    Gong, Yue
    Wang, Qirui
    Huang, Jingwei
    Chen, Lei
    Yu, Fan
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5712 - 5720
  • [6] NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement
    Li, Jingliang
    Lu, Zhengda
    Wang, Yiqun
    Xiao, Jun
    Wang, Ying
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2649 - 2662
  • [7] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose M.
    Liu, Miaomiao
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4876 - 4885
  • [8] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose
    Liu, Miaomiao
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4748 - 4760
  • [9] MVSNet plus plus : Learning Depth-Based Attention Pyramid Features for Multi-View Stereo
    Chen, Po-Heng
    Yang, Hsiao-Chien
    Chen, Kuan-Wen
    Chen, Yong-Sheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7261 - 7273
  • [10] ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval
    Zhang, Song
    Xu, Wenjia
    Wei, Zhiwei
    Zhang, Lili
    Wang, Yang
    Liu, Junyi
    [J]. PATTERN RECOGNITION, 2023, 144