Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective

被引:39
|
作者
Zhang, Jiawei [1 ]
Wang, Xiang [1 ]
Bai, Xiao [1 ]
Wang, Chen [1 ]
Huang, Lei [2 ]
Chen, Yimin [1 ]
Gu, Lin [3 ,4 ]
Zhou, Jun [5 ]
Harada, Tatsuya [3 ,4 ]
Hancock, Edwin R. [1 ,6 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Jiangxi Res Inst, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, SKLSDE, Beijing, Peoples R China
[3] RIKEN AIP, Tokyo, Japan
[4] Univ Tokyo, Tokyo, Japan
[5] Griffith Univ, Nathan, Qld, Australia
[6] Univ York, York, N Yorkshire, England
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.01266
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite recent stereo matching networks achieving impressive performance given sufficient training data, they suffer from domain shifts and generalize poorly to unseen domains. We argue that maintaining feature consistency between matching pixels is a vital factor for promoting the generalization capability of stereo matching networks, which has not been adequately considered. Here we address this issue by proposing a simple pixel-wise contrastive learning across the viewpoints. The stereo contrastive feature loss function explicitly constrains the consistency between learned features of matching pixel pairs which are observations of the same 3D points. A stereo selective whitening loss is further introduced to better preserve the stereo feature consistency across domains, which decorrelates stereo features from stereo viewpoint-specific style information. Counter-intuitively, the generalization of feature consistency between two viewpoints in the same scene translates to the generalization of stereo matching performance to unseen domains. Our method is generic in nature as it can be easily embedded into existing stereo networks and does not require access to the samples in the target domain. When trained on synthetic data and generalized to four real-world testing sets, our method achieves superior performance over several state-of-the-art networks. The code is available online(1).
引用
收藏
页码:12991 / 13001
页数:11
相关论文
共 40 条
  • [21] ITSA: An Information-Theoretic Approach to Automatic Shortcut Avoidance and Domain Generalization in Stereo Matching Networks
    Chuah, WeiQin
    Tennakoon, Ruwan
    Hoseinnezhad, Reza
    Bab-Hadiashar, Alireza
    Suter, David
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13012 - 13022
  • [22] Robust novel view synthesis from multi-view feature stereo matching priors
    Wang, Jianxin
    Shao, Haijian
    Deng, Xing
    Lian, Shuheng
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [23] Generalized Stereo Matching Method Based on Iterative Optimization of Hierarchical Graph Structure Consistency Cost for Urban 3D Reconstruction
    Yang, Shuting
    Chen, Hao
    Chen, Wen
    REMOTE SENSING, 2023, 15 (09)
  • [24] Gradient consistency strategy cooperative meta-feature learning for mixed domain generalized machine fault diagnosis
    Xie, Shushuai
    Cheng, Wei
    Xing, Ji
    Chen, Xuefeng
    Nie, Zelin
    Huang, Qian
    Zhang, Rongyong
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [25] Domain-Generalized EEG Classification With Category-Oriented Feature Decorrelation and Cross-View Consistency Learning
    Liang, Shuang
    Xuan, Changsheng
    Hang, Wenlong
    Lei, Baiying
    Wang, Jun
    Qin, Jing
    Choi, Kup-Sze
    Zhang, Yu
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 3285 - 3296
  • [26] Revisiting the generalized virial theorem and its applications from the perspective of contact and cosymplectic geometry
    Carinena, Jose F.
    Munoz-Lecanda, Miguel C.
    INTERNATIONAL JOURNAL OF GEOMETRIC METHODS IN MODERN PHYSICS, 2025, 22 (02)
  • [27] Semantic Feature-Based Test Selection for Deep Neural Networks: A Frequency Domain Perspective
    Jiang, Zhouxian
    Li, Honghui
    Tian, Xuctao
    Wang, Rui
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 21 (04) : 1499 - 1522
  • [28] A multi-domain feature fusion epilepsy seizure detection method based on spike matching and PLV functional networks
    Fan, Qikai
    Jiang, Lurong
    El Gohary, Amira
    Dong, Fang
    Wu, Duanpo
    Jiang, Tiejia
    Wang, Chen
    Liu, Junbiao
    JOURNAL OF NEURAL ENGINEERING, 2025, 22 (01)
  • [29] Revisiting Topological Properties of Protein-Protein Interaction Networks from the Perspective of Dataset Evolution
    Shao, Mingyu
    Zhou, Shuigeng
    Guan, Jihong
    2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [30] REVISITING THE "PURE" OIL-EXCHANGE CO-MOVEMENT FROM A TIME-DOMAIN PERSPECTIVE
    Ma, Zhe
    Yang, Lu
    SINGAPORE ECONOMIC REVIEW, 2024, 69 (01): : 183 - 202