Adaptive Patch Deformation for Textureless-Resilient Multi-View Stereo

被引:11
|
作者
Wang, Yuesong [1 ]
Zeng, Zhaojie [1 ]
Guan, Tao [1 ]
Yang, Wei [1 ]
Chen, Zhuo [1 ]
Liu, Wenkai [1 ]
Xu, Luoyuan [1 ]
Luo, Yawei [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Zhejiang Univ, Sch Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/CVPR52729.2023.00162
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep learning-based approaches have shown great strength in multi-view stereo because of their outstanding ability to extract robust visual features. However, most learning-based methods need to build the cost volume and increase the receptive field enormously to get a satisfactory result when dealing with large-scale textureless regions, consequently leading to prohibitive memory consumption. To ensure both memory-friendly and textureless-resilient, we innovatively transplant the spirit of deformable convolution from deep learning into the traditional PatchMatch-based method. Specifically, for each pixel with matching ambiguity (termed unreliable pixel), we adaptively deform the patch centered on it to extend the receptive field until covering enough correlative reliable pixels (without matching ambiguity) that serve as anchors. When performing PatchMatch, constrained by the anchor pixels, the matching cost of an unreliable pixel is guaranteed to reach the global minimum at the correct depth and therefore increases the robustness of multi-view stereo significantly. To detect more anchor pixels to ensure better adaptive patch deformation, we propose to evaluate the matching ambiguity of a certain pixel by checking the convergence of the estimated depth as optimization proceeds. As a result, our method achieves state-of-the-art performance on ETH3D and Tanks and Temples while preserving low memory consumption.
引用
收藏
页码:1621 / 1630
页数:10
相关论文
共 50 条
  • [21] Multi-View Stereo: A Tutorial
    Furukawa, Yasutaka
    Hernandez, Carlos
    [J]. FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
  • [22] AdaptMVSNet: Efficient Multi-View Stereo with adaptive convolution and attention fusion
    Jiang, Pengfei
    Yang, Xiaoyan
    Chen, Yuanjie
    Song, Wenjie
    Li, Yang
    [J]. COMPUTERS & GRAPHICS-UK, 2023, 116 : 128 - 138
  • [23] Multi-view multi-exposure stereo
    Troccoli, Alejandro
    Kang, Sing Bing
    Seitz, Steve
    [J]. THIRD INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING, VISUALIZATION, AND TRANSMISSION, PROCEEDINGS, 2007, : 861 - 868
  • [24] Multi-view stereo beyond Lambert
    Jin, HL
    Soatto, S
    Yezzi, AJ
    [J]. 2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 171 - 178
  • [25] Probabilistic visibility for multi-view stereo
    Hernandez, Carlos
    Vogiatzis, George
    Cipolla, Roberto
    [J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 1704 - 1711
  • [26] Deformable convolutions in multi-view stereo
    Masson, Juliano Emir Nunes
    Petry, Marcelo Roberto
    Coutinho, Daniel Ferreira
    Honorio, Leonardo de Mello
    [J]. IMAGE AND VISION COMPUTING, 2022, 118
  • [27] Multi-View Photometric Stereo Revisited
    Kaya, Berk
    Kumar, Suryansh
    Oliveira, Carlos
    Ferrari, Vittorio
    Van Gool, Luc
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3125 - 3134
  • [28] Progressive Prioritized Multi-view Stereo
    Locher, Alex
    Perdoch, Michal
    Gool, Luc Van
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3244 - 3252
  • [29] Learning a Multi-View Stereo Machine
    Kar, Abhishek
    Hane, Christian
    Malik, Jitendra
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [30] Occluding Contours for Multi-View Stereo
    Shan, Qi
    Curless, Brian
    Furukawa, Yasutaka
    Hernandez, Carlos
    Seitz, Steven M.
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 4002 - 4009