Rethinking 3D cost aggregation in stereo matching

被引:6
|
作者
Gan, Wanshui [1 ,3 ]
Wu, Wenhao [2 ]
Chen, Shifeng [1 ]
Zhao, Yuxiang [1 ]
Wong, Pak Kin [3 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Baidu Inc, Dept Comp vis Technol VIS, Beijing, Peoples R China
[3] Univ Macau, Dept Electromech Engn, Macau, Peoples R China
关键词
Stereo matching; Disparity estimation; Shift operation; 3D Convolution;
D O I
10.1016/j.patrec.2023.02.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the stereo matching task, the 3D convolution network can effectively aggregate the cost volume with the strong representation ability to model the spatial and depth dimensions but with the disadvantage of a high computational cost. In this letter, we revisit the 3D convolution network and its common variant, and then propose the Depth Shift Module (DSM) to model the cost volume in the depth dimension which could imitate the 3D convolution function with the computational complexity of the 2D convolution. The proposed DSM is easy to extend to present 3D cost aggregation methods in stereo matching with less inference time, lower computational complexity, and minor precision loss. Moreover, a novel compact but efficient stereo matching framework named HybridNet is proposed. This framework can hybridize the 2D convolution layer with the proposed DSM to effectively aggregate the cost volume. The proposed HybridNet achieves a better trade-off between the performance, computational complexity, and model size (e.g., 30% less than the size of AANet and 25% less than the size of PSMNet) in public open-source datasets (e.g., Scene Flow and KITTI Stereo 2015). The relevant code is available at https://github.com/ GANWANSHUI/HybridNet .(c) 2023 Published by Elsevier B.V.
引用
收藏
页码:75 / 81
页数:7
相关论文
共 50 条
  • [41] Improvement of stereo matching algorithm for 3D surface reconstruction
    Hamzah, Rostam Affendi
    Kadmin, A. Fauzan
    Hamid, M. Saad
    Ghani, S. Fakhar A.
    Ibrahim, Haidi
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 65 : 165 - 172
  • [42] Adaptive stereo matching for 3D digitalization of toothless jaws
    Busch, M.
    Ruge, R.
    Kordass, B.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2007, 2 : S406 - S409
  • [43] 3D Shape Estimation Based on Sparsity in Stereo Matching
    Hirose, Naoto
    Yasunobe, Tatsuki
    Kawanaka, Akira
    ADVANCES IN VISUAL COMPUTING, PT II, 2013, 8034 : 562 - 571
  • [44] Correction Compensation and Adaptive Cost Aggregation for Deep Laparoscopic Stereo Matching
    Zhang, Jian
    Yang, Bo
    Zhao, Xuanchi
    Shi, Yi
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [45] Dense Feature Learning and Compact Cost Aggregation for Deep Stereo Matching
    Yin, Chenyang
    Zhi, Henghui
    Li, Huibin
    IEEE ACCESS, 2022, 10 : 100999 - 101010
  • [46] Dense Feature Learning and Compact Cost Aggregation for Deep Stereo Matching
    Yin, Chenyang
    Zhi, Henghui
    Li, Huibin
    IEEE Access, 2022, 10 : 100999 - 101010
  • [47] ITERATIVE COLOR-DEPTH MST COST AGGREGATION FOR STEREO MATCHING
    Yao, Peng
    Zhang, Hua
    Xue, Yanbing
    Zhou, Mian
    Xu, Guangping
    Gao, Zan
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [48] Cost Volume Aggregation in Stereo Matching Revisited: A Disparity Classification Perspective
    Wang, Yun
    Wang, Longguang
    Li, Kunhong
    Zhang, Yongjian
    Wu, Dapeng Oliver
    Guo, Yulan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6425 - 6438
  • [49] COST AGGREGATION WITH ANISOTROPIC DIFFUSION IN FEATURE SPACE FOR HYBRID STEREO MATCHING
    Ham, Bumsub
    Min, Dongbo
    Sohn, Kwanghoon
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [50] Inter-Scale Similarity Guided Cost Aggregation for Stereo Matching
    Li, Pengxiang
    Yao, Chengtang
    Jia, Yunde
    Wu, Yuwei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 134 - 147