360MVSNet: Deep Multi-view Stereo Network with 360° Images for Indoor Scene Reconstruction

被引:1
|
作者
Chiu, Ching-Ya [1 ]
Wu, Yu-Ting [2 ]
Shen, I-Chao [3 ]
Chuang, Yung-Yu [1 ]
机构
[1] Natl Taiwan Univ, New Taipei, Taiwan
[2] Natl Taipei Univ, New Taipei, Taiwan
[3] Univ Tokyo, Tokyo, Japan
关键词
D O I
10.1109/WACV56688.2023.00307
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent multi-view stereo methods have achieved promising results with the advancement of deep learning techniques. Despite of the progress, due to the limited fields of view of regular images, reconstructing large indoor environments still requires collecting many images with sufficient visual overlap, which is quite labor-intensive. 360 degrees images cover a much larger field of view than regular images and would facilitate the capture process. In this paper, we present 360MVSNet, the first deep learning network for multi-view stereo with 360 degrees images. Our method combines uncertainty estimation with a spherical sweeping module for 360 degrees images captured from multiple viewpoints in order to construct multi-scale cost volumes. By regressing volumes in a coarse-to-fine manner, high-resolution depth maps can be obtained. Furthermore, we have constructed EQMVS, a large-scale synthetic dataset that consists of over 50K pairs of RGB and depth maps in equirectangular projection. Experimental results demonstrate that our method can reconstruct large synthetic and real-world indoor scenes with significantly better completeness than previous traditional and learning-based methods while saving both time and effort in the data acquisition process.
引用
收藏
页码:3056 / 3065
页数:10
相关论文
共 50 条
  • [31] MODE: Multi-view Omnidirectional Depth Estimation with 360° Cameras
    Li, Ming
    Jin, Xueqian
    Hu, Xuejiao
    Dai, Jingzhao
    Du, Sidan
    Li, Yang
    COMPUTER VISION - ECCV 2022, PT XXXIII, 2022, 13693 : 197 - 213
  • [32] EA-MVSNet: Learning Error-Awareness for Enhanced Multi-View Stereo
    Gu, Wencong
    Xiao, Haihong
    Zhao, Xueyan
    Kang, Wenxiong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12127 - 12141
  • [33] PA-MVSNet: Sparse-to-Dense Multi-View Stereo With Pyramid Attention
    Zhang, Ke
    Liu, Mengyu
    Zhang, Jinlai
    Dong, Zhenbiao
    IEEE ACCESS, 2021, 9 : 27908 - 27915
  • [34] Bi-directional Recurrent MVSNet for High-resolution Multi-view Stereo
    Fujitomi, Taku
    Ito, Seiya
    Kaneko, Naoshi
    Sumi, Kazuhiko
    PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [35] Projection-based registration using a multi-view camera for indoor scene reconstruction
    Kim, S
    Woo, W
    FIFTH INTERNATIONAL CONFERENCE ON 3-D DIGITAL IMAGING AND MODELING, PROCEEDINGS, 2005, : 484 - 491
  • [36] DeepC-MVS: Deep Confidence Prediction for Multi-View Stereo Reconstruction
    Kuhn, Andreas
    Sormann, Christian
    Rossi, Mattia
    Erdler, Oliver
    Fraundorfer, Friedrich
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 404 - 413
  • [37] Deep Multi-View Stereo Gone Wild
    Darmon, Francois
    Bascle, Benedicte
    Devaux, Jean-Clement
    Monasse, Pascal
    Aubry, Mathieu
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 484 - 493
  • [38] Deep scene-scale material estimation from multi-view indoor captures
    Prakash, Siddhant
    Rainer, Gilles
    Bousseau, Adrien
    Drettakis, George
    COMPUTERS & GRAPHICS-UK, 2022, 109 : 15 - 29
  • [39] Multi-view stereo network with point attention
    Zhao, Rong
    Gu, Zhuoer
    Han, Xie
    He, Ligang
    Sun, Fusheng
    Jiao, Shichao
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26622 - 26636
  • [40] Multi-view stereo network with point attention
    Rong Zhao
    Zhuoer Gu
    Xie Han
    Ligang He
    Fusheng Sun
    Shichao Jiao
    Applied Intelligence, 2023, 53 : 26622 - 26636