LMNet: A learnable multi-scale cost volume for stereo matching

被引:0
|
作者
Liu, Jiatao [1 ]
Zhang, Yaping [2 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Yunnan, Peoples R China
关键词
Deep learning; Stereo matching; Ill-posed regions; Learnable cost volume; Multi-scale cost volumes;
D O I
10.1016/j.image.2024.117169
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Calculating disparities through stereo matching is an important step in a variety of machine vision tasks used for robotics and similar applications. The use of deep neural networks for stereo matching requires the construction of a matching cost volume. However, the occluded, non-textured, and reflective regions are ill- posed, which cannot be directly matched. In previous studies, a direct calculation has typically been used to measure matching costs for single-scale feature maps, which makes it difficult to predict disparity for ill-posed regions. Thus, we propose a learnable multi-scale matching cost calculation method (LMNet) to improve the accuracy of stereo matching. This learned matching cost can reasonably estimate the disparity of the regions that are conventionally difficult to match. Multi-level 3D dilation convolutions for multi-scale features are introduced during constructing cost volumes because the receptive field of the convolution kernels is limited. The experimental results show that the proposed method achieves significant improvement in ill-posed regions. Compared with the classical architecture GwcNet, End-Point-Error (EPE) of the proposed method on the Scene Flow dataset is reduced by 16.46%. The number of parameters and required calculations are also reduced by 8.71% and 20.05%, respectively. The proposed model code and pre-training parameters are available at: https://github.com/jt-liu/LMNet.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Multi-Dimensional Attention on Cost Volume for Stereo Matching
    Zhou Jiale
    Huang, Wenqin
    Liao, Qingmin
    Lu, Zongqing
    Liu, Xiaoqian
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [22] Multi-View Stereo with Learnable Cost Metric
    Yang, Guidong
    Zhou, Xunkuai
    Gao, Chuanxiang
    Zhao, Benyun
    Zhang, Jihan
    Chen, Yizhou
    Chen, Xi
    Chen, Ben M.
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 3017 - 3024
  • [23] Parameterized Cost Volume for Stereo Matching
    Zeng, Jiaxi
    Yao, Chengtang
    Yu, Lidong
    Wu, Yuwei
    Jia, Yunde
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18301 - 18311
  • [24] Novel Stereo Matching Method on Multi-scale Harris Corner Points
    Fan Tiesheng
    Niu Bing
    Wang Qingsong
    Wang Tao
    Qu Dapeng
    ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 167 - 170
  • [25] Multi-scale Cross-form Pyramid Network for Stereo Matching
    Zhu, Zhidong
    He, Mingyi
    Dai, Yuchao
    Rao, Zhibo
    Li, Bo
    PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 1789 - 1794
  • [26] Self-adaptive Multi-scale Aggregation Network for Stereo Matching
    Li, Pengfei
    Ye, Shuiqiang
    Zhang, Jiaquan
    Wang Xinan
    Dai, Qifei
    Yu, Zhengzhong
    Li, Fuchi
    Zhao, Yong
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3794 - 3800
  • [27] Cascaded Multi-scale and Multi-dimension Convolutional Neural Network for Stereo Matching
    Lu, Haihua
    Xu, Hai
    Zhang, Li
    Ma, Yanbo
    Zhao, Yong
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [28] Stereo matching based on multi-scale fusion and multi-type support regions
    Li, Haibin
    Gao, Yakun
    Huang, Ziyue
    Zhang, Yakun
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2019, 36 (09) : 1523 - 1533
  • [29] Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching
    Gu, Xiaodong
    Fan, Zhiwen
    Zhu, Siyu
    Dai, Zuozhuo
    Tan, Feitong
    Tan, Ping
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2492 - 2501
  • [30] Multi-Scale Fusion Stereo Matching Algorithm Based on Adaptive Texture Region
    Chen, Yi
    Yu, Jiyan
    Yu, Hongsen
    Computer Engineering and Applications, 2023, 59 (18) : 198 - 206