Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective

被引:39
|
作者
Zhang, Jiawei [1 ]
Wang, Xiang [1 ]
Bai, Xiao [1 ]
Wang, Chen [1 ]
Huang, Lei [2 ]
Chen, Yimin [1 ]
Gu, Lin [3 ,4 ]
Zhou, Jun [5 ]
Harada, Tatsuya [3 ,4 ]
Hancock, Edwin R. [1 ,6 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Jiangxi Res Inst, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, SKLSDE, Beijing, Peoples R China
[3] RIKEN AIP, Tokyo, Japan
[4] Univ Tokyo, Tokyo, Japan
[5] Griffith Univ, Nathan, Qld, Australia
[6] Univ York, York, N Yorkshire, England
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.01266
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite recent stereo matching networks achieving impressive performance given sufficient training data, they suffer from domain shifts and generalize poorly to unseen domains. We argue that maintaining feature consistency between matching pixels is a vital factor for promoting the generalization capability of stereo matching networks, which has not been adequately considered. Here we address this issue by proposing a simple pixel-wise contrastive learning across the viewpoints. The stereo contrastive feature loss function explicitly constrains the consistency between learned features of matching pixel pairs which are observations of the same 3D points. A stereo selective whitening loss is further introduced to better preserve the stereo feature consistency across domains, which decorrelates stereo features from stereo viewpoint-specific style information. Counter-intuitively, the generalization of feature consistency between two viewpoints in the same scene translates to the generalization of stereo matching performance to unseen domains. Our method is generic in nature as it can be easily embedded into existing stereo networks and does not require access to the samples in the target domain. When trained on synthetic data and generalized to four real-world testing sets, our method achieves superior performance over several state-of-the-art networks. The code is available online(1).
引用
收藏
页码:12991 / 13001
页数:11
相关论文
共 40 条
  • [1] CLIP4STEREO: REVISITING DOMAIN GENERALIZED STEREO MATCHING VIA CLIP
    Ma, Chihao
    Zeng, Pengcheng
    Zhai, Jucai
    Liu, Yang
    Zhao, Yong
    Wang, Xinan
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 106 - 110
  • [2] Revisiting Depth Completion from a Stereo Matching Perspective for Cross-domain Generalization
    Bartolomei, Luca
    Poggi, Matteo
    Conti, Andrea
    Tosi, Fabio
    Mattoccia, Stefano
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1360 - 1370
  • [3] Stereo matching from monocular images using feature consistency
    Lu, Zhongjian
    Chen, An
    Gao, Hongxia
    Zhang, Langwen
    Zhang, Congyu
    Yang, Yang
    IET IMAGE PROCESSING, 2024, 18 (10) : 2540 - 2552
  • [4] MFANet: Multi-feature Aggregation Network for Domain Generalized Stereo Matching
    Yang, Jinlong
    Wang, Gang
    Wu, Cheng
    Chen, Dong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024, 2024, 14872 : 241 - 252
  • [5] IMFA-Stereo: Domain Generalized Stereo Matching via Iterative Multimodal Feature Aggregation Cost Volume
    Wang, Gang
    Yang, Jinlong
    Wu, Cheng
    Chen, Dong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 118 - 130
  • [6] Learning Representations from Foundation Models for Domain Generalized Stereo Matching
    Zhang, Yongjian
    Wang, Longguang
    Li, Kunhong
    Wang, Yun
    Guo, Yulan
    COMPUTER VISION - ECCV 2024, PT XLII, 2025, 15100 : 146 - 162
  • [7] Masked representation learning for domain generalized stereo matching
    Rao, Zhibo
    Xiong, Bangshu
    He, Mingyi
    Dai, Yuchao
    He, Renjie
    Shen, Zhelun
    Li, Xing
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5435 - 5444
  • [8] GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and Task-Oriented Feature
    Liu, Biyang
    Yu, Huimin
    Qi, Guodong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13002 - 13011
  • [9] Domain Generalized Stereo Matching via Hierarchical Visual Transformation
    Chang, Tianyu
    Yang, Xun
    Zhang, Tianzhu
    Wang, Meng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9559 - 9568
  • [10] UNDERWATER STEREO MATCHING VIA UNSUPERVISED APPEARANCE AND FEATURE ADAPTATION NETWORKS
    Zhong, Wei
    Yuan, Yazhi
    Ye, Xinchen
    Zheng, Dian
    Xu, Rui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2295 - 2299