Uanet: uncertainty-aware cost volume aggregation-based multi-view stereo for 3D reconstruction

被引:1
|
作者
Lu, Ping [1 ]
Cai, Youcheng [2 ]
Yang, Jiale [3 ]
Wang, Dong [4 ]
Wu, Tingting [5 ]
机构
[1] State Key Lab Mobile Network & Mobile Multimedia T, Shenzhen, Peoples R China
[2] Univ Sci & Technol China, Hefei, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] Anhui Jianzhu Univ, Hefei, Peoples R China
[5] Anhui Agr Univ, Hefei, Peoples R China
来源
关键词
Multi-view stereo; Uncertainty; Group-wise correlation; Cost volume aggregation; NETWORK;
D O I
10.1007/s00371-024-03678-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multi-view stereo (MVS) plays a vital role in 3D reconstruction, which aims to reconstruct the 3D point cloud model from multi-view images. Recently, learning-based MVS methods have demonstrated excellent performance compared with traditional MVS methods. Almost all current learning-based MVS methods focus on improving the accuracy and completeness of the reconstruction results. However, scalability remains a major limitation due to the memory constraint. In this paper, a cascaded network with an uncertainty-aware cost volume aggregation named UANet is proposed for efficient and effective dense 3D reconstruction. In particular, we present a novel uncertainty-aware cost volume aggregation approach that takes pair-wise uncertainty maps as guidance to adaptively aggregate cost volumes. Instead of applying 3D convolutional neural networks (CNNs), we introduce the feature difference with a shallow 2D CNN to compute uncertainty maps, which guarantees both efficiency and effectiveness. Moreover, we adopt a coarse-to-fine strategy and use a group-wise correlation to construct lightweight cost volumes, thus reducing the memory consumption and enabling high-resolution depth map inference. Finally, an uncertainty loss is designed to construct the uncertainty map, which can further boost the performance. Experimental results show that UANet outperforms the previous state-of-the-art methods on three benchmarks of DTU benchmark dataset, Tanks and Temples dataset, and BlendedMVS dataset. Besides, the runtime and memory requirements validate the effectiveness of UANet.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose
    Liu, Miaomiao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4748 - 4760
  • [32] Overview of 3D Reconstruction Methods Based on Multi-view
    Li, Mengxin
    Zheng, Dai
    Zhang, Rui
    Yin, Jiadi
    Tian, Xiangqian
    2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL II, 2015,
  • [33] 3D Reconstruction for Multi-view Objects
    Yu, Jun
    Yin, Wenbin
    Hu, Zhiyi
    Liu, Yabin
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 106
  • [34] Multi-view 3D Reconstruction with Transformers
    Wang, Dan
    Cui, Xinrui
    Chen, Xun
    Zou, Zhengxia
    Shi, Tianyang
    Salcudean, Septimiu
    Wang, Z. Jane
    Ward, Rabab
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5702 - 5711
  • [35] Uncertainty-Aware Multi-view Learning for Prostate Cancer Grading with DWI
    Dong, Zhicheng
    Yue, Xiaodong
    Chen, Yufei
    Zhou, Xujing
    Liang, Jiye
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT X, 2024, 15010 : 739 - 748
  • [36] Engineering Monitoring and Change Detection for Multi-View Stereo 3D Reconstruction Technology
    Chang T.-R.
    Lee L.-H.
    Journal of the Chinese Institute of Civil and Hydraulic Engineering, 2019, 31 (04): : 337 - 350
  • [37] Prior depth-based multi-view stereo network for online 3D model reconstruction
    Song, Soohwan
    Truong, Khang Giang
    Kim, Daekyum
    Jo, Sungho
    PATTERN RECOGNITION, 2023, 136
  • [38] GARNet: Global-aware multi-view 3D reconstruction network and the cost-performance tradeoff
    Zhu, Zhenwei
    Yang, Liying
    Lin, Xuxin
    Yang, Lin
    Liang, Yanyan
    PATTERN RECOGNITION, 2023, 142
  • [39] GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo
    Wu, Jiang
    Li, Rui
    Xu, Haofei
    Zhao, Wenxun
    Zhu, Yu
    Sun, Jinqiu
    Zhang, Yanning
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 20207 - 20216
  • [40] Robust Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction
    Yang, Bo
    Wang, Sen
    Markham, Andrew
    Trigoni, Niki
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (01) : 53 - 73