Rethinking 3D cost aggregation in stereo matching

被引:6
|
作者
Gan, Wanshui [1 ,3 ]
Wu, Wenhao [2 ]
Chen, Shifeng [1 ]
Zhao, Yuxiang [1 ]
Wong, Pak Kin [3 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Baidu Inc, Dept Comp vis Technol VIS, Beijing, Peoples R China
[3] Univ Macau, Dept Electromech Engn, Macau, Peoples R China
关键词
Stereo matching; Disparity estimation; Shift operation; 3D Convolution;
D O I
10.1016/j.patrec.2023.02.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the stereo matching task, the 3D convolution network can effectively aggregate the cost volume with the strong representation ability to model the spatial and depth dimensions but with the disadvantage of a high computational cost. In this letter, we revisit the 3D convolution network and its common variant, and then propose the Depth Shift Module (DSM) to model the cost volume in the depth dimension which could imitate the 3D convolution function with the computational complexity of the 2D convolution. The proposed DSM is easy to extend to present 3D cost aggregation methods in stereo matching with less inference time, lower computational complexity, and minor precision loss. Moreover, a novel compact but efficient stereo matching framework named HybridNet is proposed. This framework can hybridize the 2D convolution layer with the proposed DSM to effectively aggregate the cost volume. The proposed HybridNet achieves a better trade-off between the performance, computational complexity, and model size (e.g., 30% less than the size of AANet and 25% less than the size of PSMNet) in public open-source datasets (e.g., Scene Flow and KITTI Stereo 2015). The relevant code is available at https://github.com/ GANWANSHUI/HybridNet .(c) 2023 Published by Elsevier B.V.
引用
收藏
页码:75 / 81
页数:7
相关论文
共 50 条
  • [21] 3D Reconstruction Cost Function Algorithm Based on Stereo Matching in the Background of Digital Museums
    Peng, Peng
    Han, Jun
    IEEE ACCESS, 2023, 11 : 123705 - 123716
  • [22] A Miniature Binocular Endoscope with Local Feature Matching and Stereo Matching for 3D Measurement and 3D Reconstruction
    Wang, Di
    Liu, Hua
    Cheng, Xiang
    SENSORS, 2018, 18 (07)
  • [23] Segment-Tree based Cost Aggregation for Stereo Matching
    Mei, Xing
    Sun, Xun
    Dong, Weiming
    Wang, Haitao
    Zhang, Xiaopeng
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 313 - 320
  • [24] A Non-Local Cost Aggregation Method for Stereo Matching
    Yang, Qingxiong
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1402 - 1409
  • [25] Accelerating Cost Aggregation for Real-Time Stereo Matching
    Fang, Jianbin
    Varbanescu, Ana Lucia
    Shen, Jie
    Sips, Henk
    Saygili, Gorkem
    van der Maaten, Laurens
    PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, : 472 - 481
  • [26] Simplified High-Performance Cost Aggregation for Stereo Matching
    Zhu, Chengtao
    Chang, Yau-Zen
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [27] Binocular stereo matching algorithm based on MST cost aggregation
    Zhang, Jian
    Zhang, Yan
    Wang, Cong
    Yu, Huilong
    Qin, Cui
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (04) : 3215 - 3226
  • [28] Fast hierarchical cost volume aggregation for stereo-matching
    Smirnov, Sergey
    Gotchev, Atanas
    2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 498 - 501
  • [29] Joint Histogram-Based Cost Aggregation for Stereo Matching
    Min, Dongbo
    Lu, Jiangbo
    Do, Minh N.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (10) : 2539 - 2545
  • [30] Spatial-Tree Filter for Cost Aggregation in Stereo Matching
    Jin, Yusheng
    Zhao, Hong
    Bu, Penghui
    IET IMAGE PROCESSING, 2021, 15 (10) : 2135 - 2145