Multi-View Stereo with Learnable Cost Metric

被引:0
|
作者
Yang, Guidong [1 ]
Zhou, Xunkuai [1 ,2 ]
Gao, Chuanxiang [1 ]
Zhao, Benyun [1 ]
Zhang, Jihan [1 ]
Chen, Yizhou [1 ]
Chen, Xi [1 ]
Chen, Ben M. [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Shatin, Hong Kong, Peoples R China
[2] Tongji Univ, Sch Elect & Informat Engn, Shanghai, Peoples R China
关键词
depth estimation; cost volume aggregation; multi-view stereo; 3D reconstruction; UAV;
D O I
10.1109/IROS55552.2023.10341606
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present LCM-MVSNet, a novel multi-view stereo (MVS) network with learnable cost metric (LCM) for more accurate and complete depth estimation and dense point cloud reconstruction. To adapt to the scene variation and improve the reconstruction quality in non-Lambertian lowtextured scenes, we propose LCM to adaptively aggregate multiview matching similarity into the 3D cost volume by leveraging sparse points hints. The proposed LCM benefits the MVS approaches in four folds, including depth estimation enhancement, reconstruction quality improvement, memory footprint reduction, and computational burden alleviation, allowing the depth inference for high-resolution images to achieve more accurate and complete reconstruction. Moreover, we improve the depth estimation by enhancing the propagation of shallow features via a bottom-up path and strengthen the end-to-end supervision by adapting the focal loss to reduce ambiguity caused by sample imbalance. Extensive experiments on two benchmark datasets show that our network achieves state-of-the-art performance on the DTU dataset and exhibits strong generalization ability with a competitive performance on the Tanks and Temples benchmark. Furthermore, we deploy our LCM-MVSNet into the real-world application for large-scale 3D reconstruction based on multi-view aerial images collected by self-developed UAV, demonstrating the robustness and scalability of our method. More detailed results are available in the Appendix(1).
引用
收藏
页码:3017 / 3024
页数:8
相关论文
共 50 条
  • [1] Learnable Cost Metric-Based Multi-View Stereo for Point Cloud Reconstruction
    Yang, Guidong
    Zhou, Xunkuai
    Gao, Chuanxiang
    Chen, Xi
    Chen, Ben M.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (09) : 11519 - 11528
  • [2] MULTI-VIEW IMAGE FEATURE CORRELATION GUIDED COST AGGREGATION FOR MULTI-VIEW STEREO
    Lai, Yawen
    Qiu, Ke
    Wang, Ronggang
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [3] Multi-View Guided Multi-View Stereo
    Poggi, Matteo
    Conti, Andrea
    Mattoccia, Stefano
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8391 - 8398
  • [4] CostFormer: Cost Transformer for Cost Aggregation in Multi-view Stereo
    Chen, Weitao
    Xu, Hongbin
    Zhou, Zhipeng
    Liu, Yang
    Sun, Baigui
    Kang, Wenxiong
    Xie, Xuansong
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 599 - 608
  • [5] Efficient Multi-view Stereo by Iterative Dynamic Cost Volume
    Wang, Shaoqian
    Li, Bo
    Dai, Yuchao
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8645 - 8654
  • [6] Learnable Graph Filter for Multi-view Clustering
    Zhou, Peng
    Du, Liang
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3089 - 3098
  • [7] Refractive Multi-view Stereo
    Cassidy, Matthew
    Melou, Jean
    Queau, Yvain
    Lauze, Francois
    Durou, Jean-Denis
    [J]. 2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 384 - 393
  • [8] Polarimetric Multi-View Stereo
    Cui, Zhaopeng
    Gu, Jinwei
    Shi, Boxin
    Tan, Ping
    Kautz, Jan
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 369 - 378
  • [9] Multi-View Stereo: A Tutorial
    Furukawa, Yasutaka
    Hernandez, Carlos
    [J]. FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
  • [10] MULTI-VIEW METRIC LEARNING FOR MULTI-VIEW VIDEO SUMMARIZATION
    Wang, Linbo
    Fang, Xianyong
    Guo, Yanwen
    Fu, Yanwei
    [J]. 2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 179 - 182