A light-weight stereo matching network based on multi-scale features fusion and robust disparity refinement

被引：4

作者：

Yang, Xiaowei ^{[1
,2
]}

Zhao, Yong ^{[3
]}

Feng, Zhiguo ^{[1
,7
]}

Sang, Haiwei ^{[4
]}

Zhang, Zhenbo ^{[1
]}

Zhang, Guiying ^{[5
]}

He, Lin ^{[1
,6
]}

机构：

[1] Guizhou Univ, Sch Mech Engn, Guiyang, Peoples R China

[2] Guizhou Acad Agr Sci, Guizhou Tea Res Inst, Guiyang, Peoples R China

[3] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen, Peoples R China

[4] Guizhou Educ Univ, Sch Math & Big Data, Guiyang, Peoples R China

[5] Guangzhou Med Univ, Qingyuan Peoples Hosp, Affiliated Hosp 6, Sch Biomed Engn, Guangzhou, Peoples R China

[6] Liupanshui Normal Coll, Sch Min & Civil Engn, Liupanshui, Peoples R China

[7] Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R China

来源：

IET IMAGE PROCESSING | 2023年 / 17卷 / 06期

关键词：

computer vision; image processing; stereo image processing; ATTENTION NETWORK; DEPTH; NET;

D O I：

10.1049/ipr2.12756

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, convolutional-neural-network based stereo matching methods have achieved significant gains compared to conventional methods in terms of both speed and accuracy. Current state-of-the-art disparity estimation algorithms require many parameters and large amounts of computational resources and are not suited for applications on edge devices. In this paper, an end-to-end light-weight network (LWNet) for fast stereo matching is proposed, which consists of an efficient backbone with multi-scale feature fusion for feature extraction, a 3D U-Net aggregation architecture for disparity computation, and color guidance in a 2D convolutional neural network (CNN) for disparity refinement. MobileNetV2 is adopted as an efficient backbone in feature extraction. The channel attention module is applied to improve the representational capacity of features and multi-resolution information is adaptively incorporated into the cost volume via cross-scale connections. Further, a left-right consistency check and color guidance refinement are introduced and a robust disparity refinement network is designed with skip connections and dilated convolutions to capture global context information and improve disparity estimation accuracy with little computational cost and memory space. Extensive experiments on Scene Flow, KITTI 2015, and KITTI 2012 demonstrate that the proposed LWNet achieves competitive accuracy and speed when compared with state-of-the-art stereo matching methods.

引用

页码：1797 / 1811

页数：15

共 50 条

[1] MULTI-SCALE CASCADE DISPARITY REFINEMENT STEREO NETWORK
Jia, Xiaogang
Chen, Wei
Liang, Zhengfa
Luo, Xin
Wu, Mingfei
Tan, Yusong
Huang, Libo
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2110 - 2114
[2] Improved stereo matching algorithm based on multi-scale fusion
Chen, Xing
Zhang, Wenhai
Hou, Yu
Yang, Lin
Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2021, 39 (04): : 876 - 882
[3] Multi-Scale Cost Attention and Adaptive Fusion Stereo Matching Network
Liu, Zhenguo
Li, Zhao
Ao, Wengang
Zhang, Shaoshuang
Liu, Wenlong
He, Yizhi
ELECTRONICS, 2023, 12 (07)
[4] A Light-weight stereo matching network for an embedded vision system
Kang, Jo-In
Lee, Seong-Won
11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1234 - 1237
[5] LADet:A Light-weight and Adaptive Network for Multi-scale Object Detection
Zhou, Jiaming
Tian, Yuqiao
Li, Weicheng
Wang, Rui
Luan, Zhongzhi
Qian, Depei
ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 912 - 923
[6] Multi-Scale Dense Attention Network for Stereo Matching
Chang, Yuhui
Xu, Jiangtao
Gao, Zhiyuan
ELECTRONICS, 2020, 9 (11) : 1 - 12
[7] Multi-Scale Context Attention Network for Stereo Matching
Sang, Haiwei
Wang, Quanhong
Zhao, Yong
IEEE ACCESS, 2019, 7 : 15152 - 15161
[8] Stereo matching based on multi-scale fusion and multi-type support regions
Li, Haibin
Gao, Yakun
Huang, Ziyue
Zhang, Yakun
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2019, 36 (09) : 1523 - 1533
[9] Multi-Scale Fusion Stereo Matching Algorithm Based on Adaptive Texture Region
Chen, Yi
Yu, Jiyan
Yu, Hongsen
Computer Engineering and Applications, 2023, 59 (18) : 198 - 206
[10] CGFNet: 3D Convolution Guided and Multi-scale Volume Fusion Network for fast and robust stereo matching
Wang, Qingyu
Xing, Hao
Ying, Yibin
Zhou, Mingchuan
PATTERN RECOGNITION LETTERS, 2023, 173 : 38 - 44

← 1 2 3 4 5 →