Learnable Cost Metric-Based Multi-View Stereo for Point Cloud Reconstruction

被引:2
|
作者
Yang, Guidong [1 ]
Zhou, Xunkuai [1 ]
Gao, Chuanxiang [1 ]
Chen, Xi [1 ]
Chen, Ben M. [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Shatin, Hong Kong, Peoples R China
关键词
Defect inspection; depth estimation; diagnosis and monitoring; intelligent system; multi-view stereo (MVS); reconstruction; unmanned aerial vehicle (UAV);
D O I
10.1109/TIE.2023.3337697
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
3-D reconstruction is essential to defect localization. This article proposes LCM-MVSNet, a novel multi-view stereo (MVS) network with learnable cost metric (LCM) for more accurate and complete dense point cloud reconstruction. To adapt to the scene variation and improve the reconstruction quality in non-Lambertian low-textured scenes, we propose LCM to adaptively aggregate multi-view matching similarity into the 3-D cost volume by leveraging sparse point hints. The proposed LCM benefits the MVS approaches in four folds, including depth estimation enhancement, reconstruction quality improvement, memory footprint reduction, and computational burden alleviation, allowing the depth inference for high-resolution images to achieve more accurate and complete reconstruction. In addition, we improve the depth estimation by enhancing the shallow feature propagation via a bottom-up pathway and strengthen the end-to-end supervision by adapting the focal loss to reduce ambiguity caused by sample imbalance. Extensive experiments on three benchmark datasets show that our method achieves state-of-the-art performance on the DTU and BlendedMVS dataset, and exhibits strong generalization ability with a competitive performance on the Tanks and Temples benchmark. Furthermore, we deploy our LCM-MVSNet into our UAV-based infrastructure defect inspection framework for infrastructure reconstruction and defect localization, demonstrating the effectiveness and efficiency of our method. More experiment results can be found in the Appendix.
引用
收藏
页码:11519 / 11528
页数:10
相关论文
共 50 条
  • [31] Multi-view Stereo Reconstruction via Homogeneous Spatial Expansion
    Li Y.
    Li Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2018, 30 (01): : 124 - 137
  • [32] Multi-View Stereo Reconstruction with High Dynamic Range Texture
    Lu, Feng
    Ji, Xiangyang
    Dai, Qionghai
    Er, Guihua
    COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 412 - +
  • [33] A surface-growing approach to multi-view, stereo reconstruction
    Habbecke, Martin
    Kobbelt, Leif
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 1720 - +
  • [34] Multi-view stereo reconstruction of dense shape and complex appearance
    Jin, HL
    Soatto, S
    Yezzi, AJ
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 63 (03) : 175 - 189
  • [35] Multi-View Stereo Reconstruction of Dense Shape and Complex Appearance
    Hailin Jin
    Stefano Soatto
    Anthony J. Yezzi
    International Journal of Computer Vision, 2005, 63 : 175 - 189
  • [36] Hole-filling algorithm in multi-view stereo reconstruction
    Wu, Xiaojun
    Wen, Fei
    Wen, Peizhi
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2012, 24 (12): : 1606 - 1613
  • [37] Multi-View Stereo 3D Edge Reconstruction
    Bignoli, Andrea
    Romanoni, Andrea
    Matteucci, Matteo
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 867 - 875
  • [38] POEM: Reconstructing Hand in a Point Embedded Multi-view Stereo
    Yang, Lixin
    Xu, Jian
    Zhong, Licheng
    Zhan, Xinyu
    Wang, Zhicheng
    Wu, Kejian
    Lu, Cewu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21108 - 21117
  • [39] Efficient Multi-view Stereo by Iterative Dynamic Cost Volume
    Wang, Shaoqian
    Li, Bo
    Dai, Yuchao
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8645 - 8654
  • [40] Conditional Single-view Shape Generation for Multi-view Stereo Reconstruction
    Wei, Yi
    Liu, Shaohui
    Zhao, Wang
    Lu, Jiwen
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9643 - 9652