A light-weight stereo matching network based on multi-scale features fusion and robust disparity refinement

被引:4
|
作者
Yang, Xiaowei [1 ,2 ]
Zhao, Yong [3 ]
Feng, Zhiguo [1 ,7 ]
Sang, Haiwei [4 ]
Zhang, Zhenbo [1 ]
Zhang, Guiying [5 ]
He, Lin [1 ,6 ]
机构
[1] Guizhou Univ, Sch Mech Engn, Guiyang, Peoples R China
[2] Guizhou Acad Agr Sci, Guizhou Tea Res Inst, Guiyang, Peoples R China
[3] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen, Peoples R China
[4] Guizhou Educ Univ, Sch Math & Big Data, Guiyang, Peoples R China
[5] Guangzhou Med Univ, Qingyuan Peoples Hosp, Affiliated Hosp 6, Sch Biomed Engn, Guangzhou, Peoples R China
[6] Liupanshui Normal Coll, Sch Min & Civil Engn, Liupanshui, Peoples R China
[7] Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R China
关键词
computer vision; image processing; stereo image processing; ATTENTION NETWORK; DEPTH; NET;
D O I
10.1049/ipr2.12756
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, convolutional-neural-network based stereo matching methods have achieved significant gains compared to conventional methods in terms of both speed and accuracy. Current state-of-the-art disparity estimation algorithms require many parameters and large amounts of computational resources and are not suited for applications on edge devices. In this paper, an end-to-end light-weight network (LWNet) for fast stereo matching is proposed, which consists of an efficient backbone with multi-scale feature fusion for feature extraction, a 3D U-Net aggregation architecture for disparity computation, and color guidance in a 2D convolutional neural network (CNN) for disparity refinement. MobileNetV2 is adopted as an efficient backbone in feature extraction. The channel attention module is applied to improve the representational capacity of features and multi-resolution information is adaptively incorporated into the cost volume via cross-scale connections. Further, a left-right consistency check and color guidance refinement are introduced and a robust disparity refinement network is designed with skip connections and dilated convolutions to capture global context information and improve disparity estimation accuracy with little computational cost and memory space. Extensive experiments on Scene Flow, KITTI 2015, and KITTI 2012 demonstrate that the proposed LWNet achieves competitive accuracy and speed when compared with state-of-the-art stereo matching methods.
引用
收藏
页码:1797 / 1811
页数:15
相关论文
共 50 条
  • [21] Multi-Scale Binocular Stereo Matching Based on Semantic Association
    Jin ZHENG
    Botao JIANG
    Wei PENG
    Qiaohui ZHANG
    Chinese Journal of Electronics, 2024, 33 (04) : 1010 - 1022
  • [22] Multi-Scale Binocular Stereo Matching Based on Semantic Association
    Zheng, Jin
    Jiang, Botao
    Peng, Wei
    Zhang, Qiaohui
    CHINESE JOURNAL OF ELECTRONICS, 2024, 33 (04) : 1010 - 1022
  • [23] Image Stereo Matching Based on Multi-scale Plane set
    Jiang, Xin-hui
    Yu, Shao-jun
    Jiang, Xing
    ADVANCES IN APPLIED SCIENCE, ENGINEERING AND TECHNOLOGY, 2013, 709 : 527 - 533
  • [24] Multi-scale Edge Extraction Based Stereo Matching Algorithm
    Lian, Jing
    Li, Linhui
    Shen, Xiaoyong
    Hao, Xianpeng
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 4162 - +
  • [25] Stereo Matching with Multi-scale Based on Anisotropic Match Cost
    Liu, Huaiguang
    Cai, Yu
    Zhou, Shiyang
    Yang, Jintang
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (24):
  • [26] MA-Stereo: Real-Time Stereo Matching via Multi-Scale Attention Fusion and Spatial Error-Aware Refinement
    Gao, Wei
    Cai, Yongjie
    Akoudad, Youssef
    Yang, Yang
    Chen, Jie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9954 - 9961
  • [27] Low Light Image Enhancement Based on Multi-Scale Network Fusion
    Liu, Xuan
    Zhang, Chenfeng
    Wang, Yingzhi
    Ding, Kai
    Han, Tailin
    Liu, Hong
    Tian, Yu
    Xu, Bo
    Ju, Mingchi
    IEEE ACCESS, 2022, 10 : 127853 - 127862
  • [28] A Light-Weight Deep-Learning Model with Multi-Scale Features for Steel Surface Defect Classification
    Liu, Yang
    Yuan, Yachao
    Balta, Cristhian
    Liu, Jing
    MATERIALS, 2020, 13 (20) : 1 - 13
  • [29] Cascaded Multi-scale and Multi-dimension Convolutional Neural Network for Stereo Matching
    Lu, Haihua
    Xu, Hai
    Zhang, Li
    Ma, Yanbo
    Zhao, Yong
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [30] Coarse-to-Fine Stereo Matching Network Based on Multi-Scale Structural Information Filtrating
    Bi, Yuanwei
    Li, Chuanbiao
    Zheng, Qiang
    Wang, Guohui
    Xu, Shidong
    Wang, Weiyuan
    IEEE ACCESS, 2023, 11 : 83692 - 83702