Self-Supervised Multi-view Stereo via Adjacent Geometry Guided Volume Completion

被引:7
|
作者
Xu, Luoyuan [1 ]
Guan, Tao [1 ]
Wang, Yuesong [1 ]
Luo, Yawei [2 ]
Chen, Zhuo [1 ]
Liu, Wenkai [1 ]
Yang, Wei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Zhejiang Univ, Sch Comp Sci & Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Self-supervised learning; Multi-view stereo; Adjacent geometry guided inference; Cost volume completion;
D O I
10.1145/3503161.3547926
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Existing self-supervised multi-viewstereo (MVS) approaches largely rely on photometric consistency for geometry inference, and hence suffer from low-texture or non-Lambertian appearances. In this paper, we observe that adjacent geometry shares certain commonality that can help to infer the correct geometry of the challenging or low-confident regions. Yet exploiting such property in a non-supervised MVS approach remains challenging for the lacking of training data and necessity of ensuring consistency between views. To address the issues, we propose a novel geometry inference training scheme by selectively masking regions with rich textures, where geometry can be well recovered and used for supervisory signal, and then lead a deliberately designed cost volume completion network to learn how to recover geometry of the masked regions. During inference, we then mask the low-confident regions instead and use the cost volume completion network for geometry correction. To deal with the different depth hypotheses of the cost volume pyramid, we design a three-branch volume inference structure for the completion network. Further, by considering plane as a special geometry, we first identify planar regions from pseudo labels and then correct the low-confident pixels by high-confident labels through plane normal consistency. Extensive experiments on DTU and Tanks & Temples demonstrate the effectiveness of the proposed framework and the state-of-the-art performance.
引用
收藏
页码:2202 / 2210
页数:9
相关论文
共 50 条
  • [21] Self-Supervised Graph Convolutional Network for Multi-View Clustering
    Xia, Wei
    Wang, Qianqian
    Gao, Quanxue
    Zhang, Xiangdong
    Gao, Xinbo
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3182 - 3192
  • [22] Self-supervised Spatial Reasoning on Multi-View Line Drawings
    Xiang, Siyuan
    Yang, Anbang
    Xue, Yanfei
    Yang, Yaoqing
    Feng, Chen
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12735 - 12744
  • [23] Self-Supervised Multi-View Person Association and its Applications
    Vo, Minh
    Yumer, Ersin
    Sunkavalli, Kalyan
    Hadap, Sunil
    Sheikh, Yaser
    Narasimhan, Srinivasa G.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (08) : 2794 - 2808
  • [24] Self-supervised Multi-view Clustering for Unsupervised Image Segmentation
    Fang, Tiyu
    Liang, Zhen
    Shao, Xiuli
    Dong, Zihao
    Li, Jinping
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 113 - 125
  • [25] Multi-view Self-supervised Disentanglement for General Image Denoising
    Chen, Hao
    Qu, Chenyuan
    Zhang, Yu
    Chen, Chen
    Jiao, Jianbo
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12247 - 12257
  • [26] MVEB: Self-Supervised Learning With Multi-View Entropy Bottleneck
    Wen, Liangjian
    Wang, Xiasi
    Liu, Jianzhuang
    Xu, Zenglin
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6097 - 6108
  • [27] On the robustness of self-supervised representations for multi-view object classification
    Torpey, David
    Klein, Richard
    [J]. PATTERN RECOGNITION LETTERS, 2022, 161 : 82 - 89
  • [28] Large-scale aerial scene perception based on self-supervised multi-view stereo via cycled generative adversarial network
    Tong K.W.
    Shi Z.
    Zhu G.
    Duan Y.
    Hou Y.
    Wu E.Q.
    Zhu L.
    [J]. Information Fusion, 2024, 109
  • [29] Multi-view and multi-augmentation for self-supervised visual representation learning
    Tran, Van Nhiem
    Huang, Chi-En
    Liu, Shen-Hsuan
    Aslam, Muhammad Saqlain
    Yang, Kai-Lin
    Li, Yung-Hui
    Wang, Jia-Ching
    [J]. APPLIED INTELLIGENCE, 2024, 54 (01) : 629 - 656
  • [30] Multi-view and multi-augmentation for self-supervised visual representation learning
    Van Nhiem Tran
    Chi-En Huang
    Shen-Hsuan Liu
    Muhammad Saqlain Aslam
    Kai-Lin Yang
    Yung-Hui Li
    Jia-Ching Wang
    [J]. Applied Intelligence, 2024, 54 : 629 - 656