LMNet: A learnable multi-scale cost volume for stereo matching

被引:0
|
作者
Liu, Jiatao [1 ]
Zhang, Yaping [2 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Yunnan, Peoples R China
关键词
Deep learning; Stereo matching; Ill-posed regions; Learnable cost volume; Multi-scale cost volumes;
D O I
10.1016/j.image.2024.117169
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Calculating disparities through stereo matching is an important step in a variety of machine vision tasks used for robotics and similar applications. The use of deep neural networks for stereo matching requires the construction of a matching cost volume. However, the occluded, non-textured, and reflective regions are ill- posed, which cannot be directly matched. In previous studies, a direct calculation has typically been used to measure matching costs for single-scale feature maps, which makes it difficult to predict disparity for ill-posed regions. Thus, we propose a learnable multi-scale matching cost calculation method (LMNet) to improve the accuracy of stereo matching. This learned matching cost can reasonably estimate the disparity of the regions that are conventionally difficult to match. Multi-level 3D dilation convolutions for multi-scale features are introduced during constructing cost volumes because the receptive field of the convolution kernels is limited. The experimental results show that the proposed method achieves significant improvement in ill-posed regions. Compared with the classical architecture GwcNet, End-Point-Error (EPE) of the proposed method on the Scene Flow dataset is reduced by 16.46%. The number of parameters and required calculations are also reduced by 8.71% and 20.05%, respectively. The proposed model code and pre-training parameters are available at: https://github.com/jt-liu/LMNet.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] MSDC-Net: Multi-Scale Dense and Contextual Networks for Stereo Matching
    Rao, Zhibo
    He, Mingyi
    Dai, Yuchao
    Zhu, Zhidong
    Li, Bo
    He, Renjie
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 578 - 583
  • [32] Multi-scale inputs and context-aware aggregation network for stereo matching
    Shi, Liqing
    Xiong, Taiping
    Cui, Gengshen
    Pan, Minghua
    Cheng, Nuo
    Wu, Xiangjie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 75171 - 75194
  • [33] Lightweight multi-scale convolutional neural network for real time stereo matching
    Xue, Yanbing
    Zhang, Doudou
    Li, Leida
    Li, Shiyin
    Wang, Yuxin
    IMAGE AND VISION COMPUTING, 2022, 124
  • [34] Stereo Matching Algorithm Based on Improved Census Transform and Multi-Scale Space
    Liu, Jian-Guo
    Yu, Li
    Liu, Si-Jian
    Wang, Shuai-Shuai
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2017, 45 (12): : 43 - 49
  • [35] CGFNet: 3D Convolution Guided and Multi-scale Volume Fusion Network for fast and robust stereo matching
    Wang, Qingyu
    Xing, Hao
    Ying, Yibin
    Zhou, Mingchuan
    PATTERN RECOGNITION LETTERS, 2023, 173 : 38 - 44
  • [36] Stereo Matching With Multiscale Hybrid Cost Volume
    Li, Minhua
    Chang, Qingling
    Wang, Yuhan
    Liu, Xinglin
    Xu, Shiting
    Cui, Yan
    IEEE ACCESS, 2022, 10 : 100128 - 100136
  • [37] Superpixel Cost Volume Excitation for Stereo Matching
    Liu, Shanglong
    Qi, Lin
    Dong, Junyu
    Gu, Wenxiang
    Xu, Liyi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 18 - 31
  • [38] Sparse Cost Volume for Efficient Stereo Matching
    Lu, Chuanhua
    Uchiyama, Hideaki
    Thomas, Diego
    Shimada, Atsushi
    Taniguchi, Rin-ichiro
    REMOTE SENSING, 2018, 10 (11)
  • [39] SVCV: segmentation volume combined with cost volume for stereo matching
    Zhu, Hongmei
    Yin, Jihao
    Yuan, Ding
    IET COMPUTER VISION, 2017, 11 (08) : 733 - 743
  • [40] Multi-Scale Keypoint Matching
    Lotfian, Sina
    Foroosh, Hassan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5168 - 5175