Rethinking 3D cost aggregation in stereo matching

被引:6
|
作者
Gan, Wanshui [1 ,3 ]
Wu, Wenhao [2 ]
Chen, Shifeng [1 ]
Zhao, Yuxiang [1 ]
Wong, Pak Kin [3 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Baidu Inc, Dept Comp vis Technol VIS, Beijing, Peoples R China
[3] Univ Macau, Dept Electromech Engn, Macau, Peoples R China
关键词
Stereo matching; Disparity estimation; Shift operation; 3D Convolution;
D O I
10.1016/j.patrec.2023.02.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the stereo matching task, the 3D convolution network can effectively aggregate the cost volume with the strong representation ability to model the spatial and depth dimensions but with the disadvantage of a high computational cost. In this letter, we revisit the 3D convolution network and its common variant, and then propose the Depth Shift Module (DSM) to model the cost volume in the depth dimension which could imitate the 3D convolution function with the computational complexity of the 2D convolution. The proposed DSM is easy to extend to present 3D cost aggregation methods in stereo matching with less inference time, lower computational complexity, and minor precision loss. Moreover, a novel compact but efficient stereo matching framework named HybridNet is proposed. This framework can hybridize the 2D convolution layer with the proposed DSM to effectively aggregate the cost volume. The proposed HybridNet achieves a better trade-off between the performance, computational complexity, and model size (e.g., 30% less than the size of AANet and 25% less than the size of PSMNet) in public open-source datasets (e.g., Scene Flow and KITTI Stereo 2015). The relevant code is available at https://github.com/ GANWANSHUI/HybridNet .(c) 2023 Published by Elsevier B.V.
引用
收藏
页码:75 / 81
页数:7
相关论文
共 50 条
  • [31] An effective stereo matching algorithm with Optimal Path Cost Aggregation
    Mozerov, Mikhail
    PATTERN RECOGNITION, PROCEEDINGS, 2006, 4174 : 617 - 626
  • [32] Deep self-guided cost aggregation for stereo matching
    Williem
    Park, In Kyu
    PATTERN RECOGNITION LETTERS, 2018, 112 : 168 - 175
  • [33] Loop-tree Method for Cost Aggregation in Stereo Matching
    Zhang, Qian
    Chen, Jun
    SECOND INTERNATIONAL CONFERENCE ON OPTICS AND IMAGE PROCESSING (ICOIP 2022), 2022, 12328
  • [34] Stereo Matching by Adaptive Weighting Selection Based Cost Aggregation
    Xu, Lingfeng
    Au, Oscar C.
    Sun, Wenxiu
    Fang, Lu
    Tang, Ketan
    Li, Jiali
    Guo, Yuanfang
    2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 1420 - 1423
  • [35] HIERARCHICAL AND MULTI-LEVEL COST AGGREGATION FOR STEREO MATCHING
    Guo, Wei
    Zhu, Ziyu
    Xia, Fukun
    Sun, Jiarui
    Zhao, Yong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2863 - 2867
  • [36] Cost Aggregation with Guided Image Filter and Superpixel for Stereo Matching
    Baek, Eu-Tteum
    Ho, Yo-Sung
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [37] 3D LUNAR CRATERS DETECTION BASED ON STEREO MATCHING
    Zhu, Hongmei
    Yin, Jihao
    Yuan, Ding
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 2333 - 2336
  • [38] A Stereo Matching based 3D Building Reconstruction Algorithm
    Cao, Yunyun
    Da, Feipeng
    Sui, Yihuan
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 201 - 205
  • [39] 3D shape recovery with registration assisted stereo matching
    Lin, Huei-Yung
    Liang, Sung-Chung
    Wu, Jing-Ren
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2007, 4478 : 596 - +
  • [40] A Stereo Matching based 3D Face Reconstruction Algorithm
    Fu, Youcheng
    Da, Feipeng
    PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 256 - 261