A light-weight stereo matching network based on multi-scale features fusion and robust disparity refinement

被引:4
|
作者
Yang, Xiaowei [1 ,2 ]
Zhao, Yong [3 ]
Feng, Zhiguo [1 ,7 ]
Sang, Haiwei [4 ]
Zhang, Zhenbo [1 ]
Zhang, Guiying [5 ]
He, Lin [1 ,6 ]
机构
[1] Guizhou Univ, Sch Mech Engn, Guiyang, Peoples R China
[2] Guizhou Acad Agr Sci, Guizhou Tea Res Inst, Guiyang, Peoples R China
[3] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen, Peoples R China
[4] Guizhou Educ Univ, Sch Math & Big Data, Guiyang, Peoples R China
[5] Guangzhou Med Univ, Qingyuan Peoples Hosp, Affiliated Hosp 6, Sch Biomed Engn, Guangzhou, Peoples R China
[6] Liupanshui Normal Coll, Sch Min & Civil Engn, Liupanshui, Peoples R China
[7] Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R China
关键词
computer vision; image processing; stereo image processing; ATTENTION NETWORK; DEPTH; NET;
D O I
10.1049/ipr2.12756
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, convolutional-neural-network based stereo matching methods have achieved significant gains compared to conventional methods in terms of both speed and accuracy. Current state-of-the-art disparity estimation algorithms require many parameters and large amounts of computational resources and are not suited for applications on edge devices. In this paper, an end-to-end light-weight network (LWNet) for fast stereo matching is proposed, which consists of an efficient backbone with multi-scale feature fusion for feature extraction, a 3D U-Net aggregation architecture for disparity computation, and color guidance in a 2D convolutional neural network (CNN) for disparity refinement. MobileNetV2 is adopted as an efficient backbone in feature extraction. The channel attention module is applied to improve the representational capacity of features and multi-resolution information is adaptively incorporated into the cost volume via cross-scale connections. Further, a left-right consistency check and color guidance refinement are introduced and a robust disparity refinement network is designed with skip connections and dilated convolutions to capture global context information and improve disparity estimation accuracy with little computational cost and memory space. Extensive experiments on Scene Flow, KITTI 2015, and KITTI 2012 demonstrate that the proposed LWNet achieves competitive accuracy and speed when compared with state-of-the-art stereo matching methods.
引用
收藏
页码:1797 / 1811
页数:15
相关论文
共 50 条
  • [1] MULTI-SCALE CASCADE DISPARITY REFINEMENT STEREO NETWORK
    Jia, Xiaogang
    Chen, Wei
    Liang, Zhengfa
    Luo, Xin
    Wu, Mingfei
    Tan, Yusong
    Huang, Libo
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2110 - 2114
  • [2] Improved stereo matching algorithm based on multi-scale fusion
    Chen, Xing
    Zhang, Wenhai
    Hou, Yu
    Yang, Lin
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2021, 39 (04): : 876 - 882
  • [3] Multi-Scale Cost Attention and Adaptive Fusion Stereo Matching Network
    Liu, Zhenguo
    Li, Zhao
    Ao, Wengang
    Zhang, Shaoshuang
    Liu, Wenlong
    He, Yizhi
    ELECTRONICS, 2023, 12 (07)
  • [4] A Light-weight stereo matching network for an embedded vision system
    Kang, Jo-In
    Lee, Seong-Won
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1234 - 1237
  • [5] LADet:A Light-weight and Adaptive Network for Multi-scale Object Detection
    Zhou, Jiaming
    Tian, Yuqiao
    Li, Weicheng
    Wang, Rui
    Luan, Zhongzhi
    Qian, Depei
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 912 - 923
  • [6] Multi-Scale Dense Attention Network for Stereo Matching
    Chang, Yuhui
    Xu, Jiangtao
    Gao, Zhiyuan
    ELECTRONICS, 2020, 9 (11) : 1 - 12
  • [7] Multi-Scale Context Attention Network for Stereo Matching
    Sang, Haiwei
    Wang, Quanhong
    Zhao, Yong
    IEEE ACCESS, 2019, 7 : 15152 - 15161
  • [8] Stereo matching based on multi-scale fusion and multi-type support regions
    Li, Haibin
    Gao, Yakun
    Huang, Ziyue
    Zhang, Yakun
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2019, 36 (09) : 1523 - 1533
  • [9] Multi-Scale Fusion Stereo Matching Algorithm Based on Adaptive Texture Region
    Chen, Yi
    Yu, Jiyan
    Yu, Hongsen
    Computer Engineering and Applications, 2023, 59 (18) : 198 - 206
  • [10] CGFNet: 3D Convolution Guided and Multi-scale Volume Fusion Network for fast and robust stereo matching
    Wang, Qingyu
    Xing, Hao
    Ying, Yibin
    Zhou, Mingchuan
    PATTERN RECOGNITION LETTERS, 2023, 173 : 38 - 44