Uanet: uncertainty-aware cost volume aggregation-based multi-view stereo for 3D reconstruction

被引:1
|
作者
Lu, Ping [1 ]
Cai, Youcheng [2 ]
Yang, Jiale [3 ]
Wang, Dong [4 ]
Wu, Tingting [5 ]
机构
[1] State Key Lab Mobile Network & Mobile Multimedia T, Shenzhen, Peoples R China
[2] Univ Sci & Technol China, Hefei, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] Anhui Jianzhu Univ, Hefei, Peoples R China
[5] Anhui Agr Univ, Hefei, Peoples R China
来源
关键词
Multi-view stereo; Uncertainty; Group-wise correlation; Cost volume aggregation; NETWORK;
D O I
10.1007/s00371-024-03678-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multi-view stereo (MVS) plays a vital role in 3D reconstruction, which aims to reconstruct the 3D point cloud model from multi-view images. Recently, learning-based MVS methods have demonstrated excellent performance compared with traditional MVS methods. Almost all current learning-based MVS methods focus on improving the accuracy and completeness of the reconstruction results. However, scalability remains a major limitation due to the memory constraint. In this paper, a cascaded network with an uncertainty-aware cost volume aggregation named UANet is proposed for efficient and effective dense 3D reconstruction. In particular, we present a novel uncertainty-aware cost volume aggregation approach that takes pair-wise uncertainty maps as guidance to adaptively aggregate cost volumes. Instead of applying 3D convolutional neural networks (CNNs), we introduce the feature difference with a shallow 2D CNN to compute uncertainty maps, which guarantees both efficiency and effectiveness. Moreover, we adopt a coarse-to-fine strategy and use a group-wise correlation to construct lightweight cost volumes, thus reducing the memory consumption and enabling high-resolution depth map inference. Finally, an uncertainty loss is designed to construct the uncertainty map, which can further boost the performance. Experimental results show that UANet outperforms the previous state-of-the-art methods on three benchmarks of DTU benchmark dataset, Tanks and Temples dataset, and BlendedMVS dataset. Besides, the runtime and memory requirements validate the effectiveness of UANet.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Attention aware cost volume pyramid based multi-view stereo network for 3D reconstruction
    Yu, Anzhu
    Guo, Wenyue
    Liu, Bing
    Chen, Xin
    Wang, Xin
    Cao, Xuefeng
    Jiang, Bingchuan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 175 : 448 - 460
  • [2] Uncertainty-Aware Deep Multi-View Photometric Stereo
    Kaya, Berk
    Kumar, Suryansh
    Oliveira, Carlos
    Ferrari, Vittorio
    Van Gool, Luc
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12591 - 12601
  • [3] Underwater 3D reconstruction based on multi-view stereo
    Gu, Feifei
    Zhao, Juan
    Xu, Pei
    Huang, Shulan
    Zhang, Gaopeng
    Song, Zhan
    OCEAN OPTICS AND INFORMATION TECHNOLOGY, 2018, 10850
  • [4] Multi-View Stereo 3D Edge Reconstruction
    Bignoli, Andrea
    Romanoni, Andrea
    Matteucci, Matteo
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 867 - 875
  • [5] 3D Face Reconstruction based on Multi-View Stereo Algorithm
    Peng, Keju
    Guan, Tao
    Xu, Chao
    Zhou, Dongxiang
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS IEEE-ROBIO 2014, 2014, : 2310 - 2314
  • [6] Uncertainty-Aware Multi-View Representation Learning
    Geng, Yu
    Han, Zongbo
    Zhang, Changqing
    Hu, Qinghua
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7545 - 7553
  • [7] 3D Semi-Supervised Learning with Uncertainty-Aware Multi-View Co-Training
    Xia, Yingda
    Liu, Fengze
    Yang, Dong
    Cai, Jinzheng
    Yu, Lequan
    Zhu, Zhuotun
    Xu, Daguang
    Yuille, Alan
    Roth, Holger
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3635 - 3644
  • [8] Accurate Multi-View Stereo 3D Reconstruction for Cost-Effective Plant Phenotyping
    Lou, Lu
    Liu, Yonghuai
    Han, Jiwan
    Doonan, John H.
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT II, 2014, 8815 : 349 - 356
  • [9] MFNet: Multi-level fusion aware feature pyramid based multi-view stereo network for 3D reconstruction
    Youcheng Cai
    Lin Li
    Dong Wang
    Xiaoping Liu
    Applied Intelligence, 2023, 53 : 4289 - 4301
  • [10] MFNet: Multi-level fusion aware feature pyramid based multi-view stereo network for 3D reconstruction
    Cai, Youcheng
    Li, Lin
    Wang, Dong
    Liu, Xiaoping
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4289 - 4301