LMNet: A learnable multi-scale cost volume for stereo matching

被引：0

作者：

Liu, Jiatao ^{[1
]}

Zhang, Yaping ^{[2
]}

机构：

[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China

[2] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Yunnan, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2024年 / 128卷

关键词：

Deep learning; Stereo matching; Ill-posed regions; Learnable cost volume; Multi-scale cost volumes;

D O I：

10.1016/j.image.2024.117169

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Calculating disparities through stereo matching is an important step in a variety of machine vision tasks used for robotics and similar applications. The use of deep neural networks for stereo matching requires the construction of a matching cost volume. However, the occluded, non-textured, and reflective regions are ill- posed, which cannot be directly matched. In previous studies, a direct calculation has typically been used to measure matching costs for single-scale feature maps, which makes it difficult to predict disparity for ill-posed regions. Thus, we propose a learnable multi-scale matching cost calculation method (LMNet) to improve the accuracy of stereo matching. This learned matching cost can reasonably estimate the disparity of the regions that are conventionally difficult to match. Multi-level 3D dilation convolutions for multi-scale features are introduced during constructing cost volumes because the receptive field of the convolution kernels is limited. The experimental results show that the proposed method achieves significant improvement in ill-posed regions. Compared with the classical architecture GwcNet, End-Point-Error (EPE) of the proposed method on the Scene Flow dataset is reduced by 16.46%. The number of parameters and required calculations are also reduced by 8.71% and 20.05%, respectively. The proposed model code and pre-training parameters are available at: https://github.com/jt-liu/LMNet.

引用

页数：10

共 50 条

[31] MSDC-Net: Multi-Scale Dense and Contextual Networks for Stereo Matching
Rao, Zhibo
He, Mingyi
Dai, Yuchao
Zhu, Zhidong
Li, Bo
He, Renjie
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 578 - 583
[32] Multi-scale inputs and context-aware aggregation network for stereo matching
Shi, Liqing
Xiong, Taiping
Cui, Gengshen
Pan, Minghua
Cheng, Nuo
Wu, Xiangjie
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 75171 - 75194
[33] Lightweight multi-scale convolutional neural network for real time stereo matching
Xue, Yanbing
Zhang, Doudou
Li, Leida
Li, Shiyin
Wang, Yuxin
IMAGE AND VISION COMPUTING, 2022, 124
[34] Stereo Matching Algorithm Based on Improved Census Transform and Multi-Scale Space
Liu, Jian-Guo
Yu, Li
Liu, Si-Jian
Wang, Shuai-Shuai
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2017, 45 (12): : 43 - 49
[35] CGFNet: 3D Convolution Guided and Multi-scale Volume Fusion Network for fast and robust stereo matching
Wang, Qingyu
Xing, Hao
Ying, Yibin
Zhou, Mingchuan
PATTERN RECOGNITION LETTERS, 2023, 173 : 38 - 44
[36] Stereo Matching With Multiscale Hybrid Cost Volume
Li, Minhua
Chang, Qingling
Wang, Yuhan
Liu, Xinglin
Xu, Shiting
Cui, Yan
IEEE ACCESS, 2022, 10 : 100128 - 100136
[37] Superpixel Cost Volume Excitation for Stereo Matching
Liu, Shanglong
Qi, Lin
Dong, Junyu
Gu, Wenxiang
Xu, Liyi
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 18 - 31
[38] Sparse Cost Volume for Efficient Stereo Matching
Lu, Chuanhua
Uchiyama, Hideaki
Thomas, Diego
Shimada, Atsushi
Taniguchi, Rin-ichiro
REMOTE SENSING, 2018, 10 (11)
[39] SVCV: segmentation volume combined with cost volume for stereo matching
Zhu, Hongmei
Yin, Jihao
Yuan, Ding
IET COMPUTER VISION, 2017, 11 (08) : 733 - 743
[40] Multi-Scale Keypoint Matching
Lotfian, Sina
Foroosh, Hassan
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5168 - 5175

← 1 2 3 4 5 →