LMNet: A learnable multi-scale cost volume for stereo matching

被引:0
|
作者
Liu, Jiatao [1 ]
Zhang, Yaping [2 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Yunnan, Peoples R China
关键词
Deep learning; Stereo matching; Ill-posed regions; Learnable cost volume; Multi-scale cost volumes;
D O I
10.1016/j.image.2024.117169
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Calculating disparities through stereo matching is an important step in a variety of machine vision tasks used for robotics and similar applications. The use of deep neural networks for stereo matching requires the construction of a matching cost volume. However, the occluded, non-textured, and reflective regions are ill- posed, which cannot be directly matched. In previous studies, a direct calculation has typically been used to measure matching costs for single-scale feature maps, which makes it difficult to predict disparity for ill-posed regions. Thus, we propose a learnable multi-scale matching cost calculation method (LMNet) to improve the accuracy of stereo matching. This learned matching cost can reasonably estimate the disparity of the regions that are conventionally difficult to match. Multi-level 3D dilation convolutions for multi-scale features are introduced during constructing cost volumes because the receptive field of the convolution kernels is limited. The experimental results show that the proposed method achieves significant improvement in ill-posed regions. Compared with the classical architecture GwcNet, End-Point-Error (EPE) of the proposed method on the Scene Flow dataset is reduced by 16.46%. The number of parameters and required calculations are also reduced by 8.71% and 20.05%, respectively. The proposed model code and pre-training parameters are available at: https://github.com/jt-liu/LMNet.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Multi-Scale Context Attention Network for Stereo Matching
    Sang, Haiwei
    Wang, Quanhong
    Zhao, Yong
    IEEE ACCESS, 2019, 7 : 15152 - 15161
  • [12] Multi-Scale Dense Attention Network for Stereo Matching
    Chang, Yuhui
    Xu, Jiangtao
    Gao, Zhiyuan
    ELECTRONICS, 2020, 9 (11) : 1 - 12
  • [13] MPANET: MULTI-SCALE PYRAMID AGGREGATION NETWORK FOR STEREO MATCHING
    Zhu, Ziyu
    Guo, Wei
    Chen, Wei
    Li, Qiuping
    Zhao, Yong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2773 - 2777
  • [14] Multi-scale Slanted O(1) Stereo Matching Algorithm
    Wang, Hongyu
    Gao, Shengyu
    Wang, Teng
    Wang, Yang
    Lou, Xin
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [15] Multi-scale graph neural network for global stereo matching
    Wang, Xiaofeng
    Yu, Jun
    Sun, Zhiheng
    Sun, Jiameng
    Su, Yingying
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 118
  • [16] Multi-Scale Binocular Stereo Matching Based on Semantic Association
    Zheng, Jin
    Jiang, Botao
    Peng, Wei
    Zhang, Qiaohui
    CHINESE JOURNAL OF ELECTRONICS, 2024, 33 (04) : 1010 - 1022
  • [17] Multi-Scale Binocular Stereo Matching Based on Semantic Association
    Jin ZHENG
    Botao JIANG
    Wei PENG
    Qiaohui ZHANG
    Chinese Journal of Electronics, 2024, 33 (04) : 1010 - 1022
  • [18] Improved stereo matching algorithm based on multi-scale fusion
    Chen, Xing
    Zhang, Wenhai
    Hou, Yu
    Yang, Lin
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2021, 39 (04): : 876 - 882
  • [19] Image Stereo Matching Based on Multi-scale Plane set
    Jiang, Xin-hui
    Yu, Shao-jun
    Jiang, Xing
    ADVANCES IN APPLIED SCIENCE, ENGINEERING AND TECHNOLOGY, 2013, 709 : 527 - 533
  • [20] Multi-scale Edge Extraction Based Stereo Matching Algorithm
    Lian, Jing
    Li, Linhui
    Shen, Xiaoyong
    Hao, Xianpeng
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 4162 - +