Cost Volume Aggregation in Stereo Matching Revisited: A Disparity Classification Perspective

被引:0
|
作者
Wang, Yun [1 ,2 ]
Wang, Longguang [3 ]
Li, Kunhong [4 ]
Zhang, Yongjian [4 ]
Wu, Dapeng Oliver [5 ]
Guo, Yulan [4 ]
机构
[1] Sun Yat Sen Univ SYSU, Sch Elect & Commun Engn, Shenzhen Campus, Shenzhen 518107, Peoples R China
[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Aviat Univ Air Force, Coll Elect Sci & Technol, Changchun 130022, Peoples R China
[4] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen Campus, Shenzhen 518107, Peoples R China
[5] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo matching; depth estimation; disparity classification; cost volume; NETWORK; DEPTH;
D O I
10.1109/TIP.2024.3484251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cost aggregation plays a critical role in existing stereo matching methods. In this paper, we revisit cost aggregation in stereo matching from disparity classification and propose a generic yet efficient Disparity Context Aggregation (DCA) module to improve the performance of CNN-based methods. Our approach is based on an insight that a coarse disparity class prior is beneficial to disparity regression. To obtain such a prior, we first classify pixels in an image into several disparity classes and treat pixels within the same class as homogeneous regions. We then generate homogeneous region representations and incorporate these representations into the cost volume to suppress irrelevant information while enhancing the matching ability for cost aggregation. With the help of homogeneous region representations, efficient and informative cost aggregation can be achieved with only a shallow 3D CNN. Our DCA module is fully-differentiable and well-compatible with different network architectures, which can be seamlessly plugged into existing networks to improve performance with small additional overheads. It is demonstrated that our DCA module can effectively exploit disparity class priors to improve the performance of cost aggregation. Based on our DCA, we design a highly accurate network named DCANet, which achieves state-of-the-art performance on several benchmarks.
引用
收藏
页码:6425 / 6438
页数:14
相关论文
共 50 条
  • [21] Sparse Cost Volume for Efficient Stereo Matching
    Lu, Chuanhua
    Uchiyama, Hideaki
    Thomas, Diego
    Shimada, Atsushi
    Taniguchi, Rin-ichiro
    REMOTE SENSING, 2018, 10 (11)
  • [22] Guided aggregation and disparity refinement for real-time stereo matching
    Yang, Jinlong
    Wu, Cheng
    Wang, Gang
    Chen, Dong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (05) : 4467 - 4477
  • [23] Cross-Scale Cost Aggregation for Stereo Matching
    Zhang, Kang
    Fang, Yuqiang
    Min, Dongbo
    Sun, Lifeng
    Yang, Shiqiang
    Yan, Shuicheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (05) : 965 - 976
  • [24] Local Stereo Matching with Adaptive and Rapid Cost Aggregation
    Li, Li
    Zhang, Cai-Ming
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 185 - +
  • [25] Cross-Scale Cost Aggregation for Stereo Matching
    Zhang, Kang
    Fang, Yuqiang
    Min, Dongbo
    Sun, Lifeng
    Yang, Shiqiang
    Yan, Shuicheng
    Tian, Qi
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1590 - 1597
  • [26] Asymmetric cost aggregation network for efficient stereo matching
    Wu, Zhong
    Zhu, Hong
    He, Lili
    Wang, Dong
    Shi, Jing
    Wu, Wenhuan
    IET IMAGE PROCESSING, 2023, 17 (08) : 2450 - 2466
  • [27] Fusion of Gray Scale Cost Aggregation for Stereo Matching
    融合灰色尺度的代价聚合的立体匹配
    Yang, Hong-Yu (bchxjbc@163.com), 2018, Chinese Academy of Sciences (29):
  • [28] Cost aggregation and occlusion handling with WLS in stereo matching
    Min, Dongbo
    Sohn, Kwanghoon
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (08) : 1431 - 1442
  • [29] SVCV: segmentation volume combined with cost volume for stereo matching
    Zhu, Hongmei
    Yin, Jihao
    Yuan, Ding
    IET COMPUTER VISION, 2017, 11 (08) : 733 - 743
  • [30] Accurate Image-Guided Stereo Matching With Efficient Matching Cost and Disparity Refinement
    Zhan, Yunlong
    Gu, Yuzhang
    Huang, Kui
    Zhang, Cheng
    Hu, Keli
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (09) : 1632 - 1645