Learnable Cost Metric-Based Multi-View Stereo for Point Cloud Reconstruction

被引:2
|
作者
Yang, Guidong [1 ]
Zhou, Xunkuai [1 ]
Gao, Chuanxiang [1 ]
Chen, Xi [1 ]
Chen, Ben M. [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Shatin, Hong Kong, Peoples R China
关键词
Defect inspection; depth estimation; diagnosis and monitoring; intelligent system; multi-view stereo (MVS); reconstruction; unmanned aerial vehicle (UAV);
D O I
10.1109/TIE.2023.3337697
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
3-D reconstruction is essential to defect localization. This article proposes LCM-MVSNet, a novel multi-view stereo (MVS) network with learnable cost metric (LCM) for more accurate and complete dense point cloud reconstruction. To adapt to the scene variation and improve the reconstruction quality in non-Lambertian low-textured scenes, we propose LCM to adaptively aggregate multi-view matching similarity into the 3-D cost volume by leveraging sparse point hints. The proposed LCM benefits the MVS approaches in four folds, including depth estimation enhancement, reconstruction quality improvement, memory footprint reduction, and computational burden alleviation, allowing the depth inference for high-resolution images to achieve more accurate and complete reconstruction. In addition, we improve the depth estimation by enhancing the shallow feature propagation via a bottom-up pathway and strengthen the end-to-end supervision by adapting the focal loss to reduce ambiguity caused by sample imbalance. Extensive experiments on three benchmark datasets show that our method achieves state-of-the-art performance on the DTU and BlendedMVS dataset, and exhibits strong generalization ability with a competitive performance on the Tanks and Temples benchmark. Furthermore, we deploy our LCM-MVSNet into our UAV-based infrastructure defect inspection framework for infrastructure reconstruction and defect localization, demonstrating the effectiveness and efficiency of our method. More experiment results can be found in the Appendix.
引用
下载
收藏
页码:11519 / 11528
页数:10
相关论文
共 50 条
  • [1] Multi-View Stereo with Learnable Cost Metric
    Yang, Guidong
    Zhou, Xunkuai
    Gao, Chuanxiang
    Zhao, Benyun
    Zhang, Jihan
    Chen, Yizhou
    Chen, Xi
    Chen, Ben M.
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 3017 - 3024
  • [2] Accurate stereo 3D point cloud generation suitable for multi-view stereo reconstruction
    Kordelas, Georgios A.
    Daras, Petros
    Klavdianos, Patrycia
    Izquierdo, Ebroul
    Zhang, Qianni
    2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 307 - 310
  • [3] Exploring the Point Feature Relation on Point Cloud for Multi-View Stereo
    Zhao, Rong
    Han, Xie
    Guo, Xindong
    Kuang, Liqun
    Yang, Xiaowen
    Sun, Fusheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6747 - 6763
  • [4] GPU-based multi-view stereo reconstruction
    Wang, Bo Ling
    Jiang, Yan Feng
    Peng, Zhen
    Yu, Sheng Chen
    INFORMATION TECHNOLOGY AND COMPUTER APPLICATION ENGINEERING, 2014, : 451 - 454
  • [5] Automatic segmentation of plant point cloud from Multi-view stereo
    Guo, Jingwei
    Li, Dawei
    Xu, Lihong
    PROCEEDINGS OF THE 2016 INTERNATIONAL FORUM ON MECHANICAL, CONTROL AND AUTOMATION (IFMCA 2016), 2017, 113 : 487 - 493
  • [6] Point-Based Multi-View Stereo Network
    Chen, Rui
    Han, Songfang
    Xu, Jing
    Su, Hao
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1538 - 1547
  • [7] Assisted multi-view stereo reconstruction
    Dellepiane, Matteo
    Cavarretta, Emanuele
    Cignoni, Paolo
    Scopigno, Roberto
    2013 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2013), 2013, : 318 - 325
  • [8] Multi-view stereo network with point attention
    Zhao, Rong
    Gu, Zhuoer
    Han, Xie
    He, Ligang
    Sun, Fusheng
    Jiao, Shichao
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26622 - 26636
  • [9] Multi-view stereo network with point attention
    Rong Zhao
    Zhuoer Gu
    Xie Han
    Ligang He
    Fusheng Sun
    Shichao Jiao
    Applied Intelligence, 2023, 53 : 26622 - 26636
  • [10] USING POINT CORRESPONDENCES WITHOUT PROJECTIVE DEFORMATION FOR MULTI-VIEW STEREO RECONSTRUCTION
    Auclair, Adrien
    Vincent, Nicole
    Cohen, Laurent D.
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 193 - 196