GeoMVSNet: Learning Multi-View Stereo with Geometry Perception

被引:12
|
作者
Zhang, Zhe [1 ]
Peng, Rui [1 ]
Hu, Yuxi [2 ]
Wang, Ronggang [1 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Beijing, Peoples R China
[2] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
SURFACE RECONSTRUCTION;
D O I
10.1109/CVPR52729.2023.02060
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent cascade Multi-View Stereo (MVS) methods can efficiently estimate high-resolution depth maps through narrowing hypothesis ranges. However, previous methods ignored the vital geometric information embedded in coarse stages, leading to vulnerable cost matching and sub-optimal reconstruction results. In this paper, we propose a geometry awareness model, termed GeoMVSNet, to explicitly integrate geometric clues implied in coarse stages for delicate depth estimation. In particular, we design a two-branch geometry fusion network to extract geometric priors from coarse estimations to enhance structural feature extraction at finer stages. Besides, we embed the coarse probability volumes, which encode valuable depth distribution attributes, into the lightweight regularization network to further strengthen depth-wise geometry intuition. Meanwhile, we apply the frequency domain filtering to mitigate the negative impact of the high-frequency regions and adopt the curriculum learning strategy to progressively boost the geometry integration of the model. To intensify the full-scene geometry perception of our model, we present the depth distribution similarity loss based on the Gaussian-Mixture Model assumption. Extensive experiments on DTU and Tanks and Temples (T&T) datasets demonstrate that our GeoMVSNet achieves state-of-the-art results and ranks first on the T&T-Advanced set. Code is available at https://github.com/doubleZ0108/GeoMVSNet.
引用
收藏
页码:21508 / 21518
页数:11
相关论文
共 50 条
  • [1] Learning a Multi-View Stereo Machine
    Kar, Abhishek
    Hane, Christian
    Malik, Jitendra
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [2] Multi-View Guided Multi-View Stereo
    Poggi, Matteo
    Conti, Andrea
    Mattoccia, Stefano
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8391 - 8398
  • [3] Learning Depth for Multi-View Stereo with Adversarial Training
    Wang, Liang
    Fan, Deqiao
    Li, Jianshu
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1674 - 1679
  • [4] Learning Patch Reconstructability for Accelerating Multi-View Stereo
    Poms, Alex
    Wu, Chenglei
    Yu, Shoou-I
    Sheikh, Yaser
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3041 - 3050
  • [5] Refractive Multi-view Stereo
    Cassidy, Matthew
    Melou, Jean
    Queau, Yvain
    Lauze, Francois
    Durou, Jean-Denis
    [J]. 2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 384 - 393
  • [6] Polarimetric Multi-View Stereo
    Cui, Zhaopeng
    Gu, Jinwei
    Shi, Boxin
    Tan, Ping
    Kautz, Jan
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 369 - 378
  • [7] Planar Catadioptric Stereo: Single and Multi-View Geometry for Calibration and Localization
    Mariottini, Gian Luca
    Scheggi, Stefano
    Morbidi, Fabio
    Prattichizzo, Domenico
    [J]. ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 1510 - 1515
  • [8] Multi-View Stereo: A Tutorial
    Furukawa, Yasutaka
    Hernandez, Carlos
    [J]. FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
  • [9] Learning Efficient Photometric Feature Transform for Multi-view Stereo
    Kang, Kaizhang
    Xie, Cihui
    Zhu, Ruisheng
    Ma, Xiaohe
    Tan, Ping
    Wu, Hongzhi
    Zhou, Kun
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5936 - 5945
  • [10] Learning Descriptor, Confidence, and Depth Estimation in Multi-view Stereo
    Choi, Sungil
    Kim, Seungryong
    Park, Kihong
    Sohn, Kwanghoon
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 389 - 395