Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective

被引:39
|
作者
Zhang, Jiawei [1 ]
Wang, Xiang [1 ]
Bai, Xiao [1 ]
Wang, Chen [1 ]
Huang, Lei [2 ]
Chen, Yimin [1 ]
Gu, Lin [3 ,4 ]
Zhou, Jun [5 ]
Harada, Tatsuya [3 ,4 ]
Hancock, Edwin R. [1 ,6 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Jiangxi Res Inst, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, SKLSDE, Beijing, Peoples R China
[3] RIKEN AIP, Tokyo, Japan
[4] Univ Tokyo, Tokyo, Japan
[5] Griffith Univ, Nathan, Qld, Australia
[6] Univ York, York, N Yorkshire, England
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.01266
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite recent stereo matching networks achieving impressive performance given sufficient training data, they suffer from domain shifts and generalize poorly to unseen domains. We argue that maintaining feature consistency between matching pixels is a vital factor for promoting the generalization capability of stereo matching networks, which has not been adequately considered. Here we address this issue by proposing a simple pixel-wise contrastive learning across the viewpoints. The stereo contrastive feature loss function explicitly constrains the consistency between learned features of matching pixel pairs which are observations of the same 3D points. A stereo selective whitening loss is further introduced to better preserve the stereo feature consistency across domains, which decorrelates stereo features from stereo viewpoint-specific style information. Counter-intuitively, the generalization of feature consistency between two viewpoints in the same scene translates to the generalization of stereo matching performance to unseen domains. Our method is generic in nature as it can be easily embedded into existing stereo networks and does not require access to the samples in the target domain. When trained on synthetic data and generalized to four real-world testing sets, our method achieves superior performance over several state-of-the-art networks. The code is available online(1).
引用
收藏
页码:12991 / 13001
页数:11
相关论文
共 40 条
  • [31] Towards Generalized UAV Object Detection: A Novel Perspective from Frequency Domain Disentanglement
    Wang, Kunyu
    Fu, Xueyang
    Ge, Chengjie
    Cao, Chengzhi
    Zha, Zheng-Jun
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (11) : 5410 - 5438
  • [32] Re-deriving data consistency condition in radon domain from perspective of Lie transformation group
    汤少杰
    TONG Kuan
    DUAN Jiayu
    LI Yang
    XU Qiong
    WU Junfeng
    QIAO Zhiwei
    CHEN Cheng
    CONG Zhoujian
    WANG Linghang
    牟轩沁
    中国体视学与图像分析, 2024, 29 (04) : 311 - 319
  • [33] Medical feature matching and model extraction from MRI/CT based on the Invariant Generalized Hough/Radon Transform
    Hlindzich, D.
    Maenner, R.
    4TH EUROPEAN CONFERENCE OF THE INTERNATIONAL FEDERATION FOR MEDICAL AND BIOLOGICAL ENGINEERING, 2009, 22 (1-3): : 608 - 612
  • [34] Revisiting topological properties and models of protein-protein interaction networks from the perspective of dataset evolution
    Shao, Mingyu
    Zhou, Shuigeng
    Guan, Jihong
    IET SYSTEMS BIOLOGY, 2015, 9 (04) : 113 - 119
  • [35] Generalized combined nonlinear adaptive filters: From the perspective of diffusion adaptation over networks
    Lu, Wenxia
    Zhang, Lijun
    Chen, Jie
    Chen, Jingdong
    SIGNAL PROCESSING, 2020, 172
  • [36] Flow field modeling of airfoil based on convolutional neural networks from transform domain perspective
    Hu, Jiawei
    Zhang, Weiwei
    AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 136
  • [37] Generalized Sparse Convolutional Neural Networks for Semantic Segmentation of Point Clouds Derived from Tri-Stereo Satellite Imagery
    Bachhofner, Stefan
    Loghin, Ana-Maria
    Otepka, Johannes
    Pfeifer, Norbert
    Hornacek, Michael
    Siposova, Andrea
    Schmidinger, Niklas
    Hornik, Kurt
    Schiller, Nikolaus
    Kaehler, Olaf
    Hochreiter, Ronald
    REMOTE SENSING, 2020, 12 (08)
  • [38] The Spatiotemporal Matching Relationship between Metro Networks and Urban Population from an Evolutionary Perspective: Passive Adaptation or Active Guidance?
    Lei, Kexin
    Hou, Quanhua
    Duan, Yaqiong
    Xi, Yafei
    Chen, Su
    Miao, Yitong
    Tong, Haiyan
    Hu, Ziye
    LAND, 2024, 13 (08)
  • [39] RTS3D: Real-time Stereo 3D Detection from 4D Feature-Consistency Embedding Space for Autonomous Driving
    Li, Peixuan
    Su, Shun
    Zhao, Huaici
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1930 - 1939
  • [40] Deep feature-domain matching for cardiac-related component separation from a chest electrical impedance tomography image series: proof-of-concept study
    Zhang, Ke
    Li, Maokun
    Liang, Haiqing
    Wang, Juan
    Yang, Fan
    Xu, Shenheng
    Abubakar, Aria
    PHYSIOLOGICAL MEASUREMENT, 2022, 43 (12)