Uanet: uncertainty-aware cost volume aggregation-based multi-view stereo for 3D reconstruction

被引:1
|
作者
Lu, Ping [1 ]
Cai, Youcheng [2 ]
Yang, Jiale [3 ]
Wang, Dong [4 ]
Wu, Tingting [5 ]
机构
[1] State Key Lab Mobile Network & Mobile Multimedia T, Shenzhen, Peoples R China
[2] Univ Sci & Technol China, Hefei, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] Anhui Jianzhu Univ, Hefei, Peoples R China
[5] Anhui Agr Univ, Hefei, Peoples R China
来源
关键词
Multi-view stereo; Uncertainty; Group-wise correlation; Cost volume aggregation; NETWORK;
D O I
10.1007/s00371-024-03678-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multi-view stereo (MVS) plays a vital role in 3D reconstruction, which aims to reconstruct the 3D point cloud model from multi-view images. Recently, learning-based MVS methods have demonstrated excellent performance compared with traditional MVS methods. Almost all current learning-based MVS methods focus on improving the accuracy and completeness of the reconstruction results. However, scalability remains a major limitation due to the memory constraint. In this paper, a cascaded network with an uncertainty-aware cost volume aggregation named UANet is proposed for efficient and effective dense 3D reconstruction. In particular, we present a novel uncertainty-aware cost volume aggregation approach that takes pair-wise uncertainty maps as guidance to adaptively aggregate cost volumes. Instead of applying 3D convolutional neural networks (CNNs), we introduce the feature difference with a shallow 2D CNN to compute uncertainty maps, which guarantees both efficiency and effectiveness. Moreover, we adopt a coarse-to-fine strategy and use a group-wise correlation to construct lightweight cost volumes, thus reducing the memory consumption and enabling high-resolution depth map inference. Finally, an uncertainty loss is designed to construct the uncertainty map, which can further boost the performance. Experimental results show that UANet outperforms the previous state-of-the-art methods on three benchmarks of DTU benchmark dataset, Tanks and Temples dataset, and BlendedMVS dataset. Besides, the runtime and memory requirements validate the effectiveness of UANet.
引用
收藏
页码:4567 / 4580
页数:14
相关论文
共 50 条
  • [41] Robust Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction
    Bo Yang
    Sen Wang
    Andrew Markham
    Niki Trigoni
    International Journal of Computer Vision, 2020, 128 : 53 - 73
  • [42] Efficient Multi-view Stereo by Iterative Dynamic Cost Volume
    Wang, Shaoqian
    Li, Bo
    Dai, Yuchao
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8645 - 8654
  • [43] An attention-based and deep sparse priori cascade multi-view stereo network for 3D reconstruction
    Wang, Yadong
    Ran, Teng
    Liang, Yuan
    Zheng, Guoquan
    COMPUTERS & GRAPHICS-UK, 2023, 116 : 383 - 392
  • [44] Topology-based UAV path planning for multi-view stereo 3D reconstruction of complex structures
    Shang, Zhexiong
    Shen, Zhigang
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (01) : 909 - 926
  • [45] A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images
    Gao, Jian
    Liu, Jin
    Ji, Shunping
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 195 : 446 - 461
  • [46] Topology-based UAV path planning for multi-view stereo 3D reconstruction of complex structures
    Zhexiong Shang
    Zhigang Shen
    Complex & Intelligent Systems, 2023, 9 : 909 - 926
  • [47] Research on Multi-View 3D Reconstruction Technology Based on SFM
    Gao, Lei
    Zhao, Yingbao
    Han, Jingchang
    Liu, Huixian
    SENSORS, 2022, 22 (12)
  • [48] Unsupervised 3D reconstruction method based on multi-view propagation
    Luo J.
    Yuan D.
    Zhang L.
    Qu Y.
    Su S.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2024, 42 (01): : 129 - 137
  • [49] FLAME-Based Multi-view 3D Face Reconstruction
    Zheng, Wenzhuo
    Zhao, Junhao
    Liu, Xiaohong
    Pan, Yongyang
    Gan, Zhenghao
    Han, Haozhe
    Liu, Ning
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT IV, 2024, 14498 : 327 - 339
  • [50] Enhancing 3D reconstruction of textureless indoor scenes with IndoReal multi-view stereo (MVS)
    Wang, Tao
    Gan, Vincent J. L.
    AUTOMATION IN CONSTRUCTION, 2024, 166