Improved stereo matching framework with embedded multilevel attention

被引:2
|
作者
Li, Bohan [1 ]
Du, Juan [1 ]
Okae, James [2 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China
[2] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Peoples R China
关键词
stereo matching; multilevel attention; global coherence; disparity optimization; deep learning;
D O I
10.1117/1.JEI.31.3.033037
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The recent advent of deep convolutional neural networks (CNNs) in stereo matching has led to significant improvements. However, current CNN methods still face challenges in incorporating hierarchical context information with global dependencies and lacking the discriminative ability of feature representation to resolve matching ambiguities in ill-conditioned regions. To address the aforementioned problems, we propose an improved stereo matching framework that joins a stereo backbone network and an embedded independent multilevel attention subnetwork in an end-to-end trainable pipeline. The stereo backbone network applies a residual atrous spatial pyramid pooling integrated with channelwise attention to capture richer multiscale contextual information and selectively enhance discriminative features. This is followed by unary feature concatenation to construct cost volume for disparity prediction. To further improve performance, the embedded multilevel attention subnetwork learns global coherent contextual information to generate three attention streams, which are used to boost the unary feature representations with spatial encoding, enhance the quality of cost volume, and refine the disparity map, respectively. We show that appending the proposed multilevel attention subnetwork to the stereo backbone network produces significant improvements in matching accuracy. The experimental results on Scene Flow and KITTI 2012/2015 demonstrate that our method can achieve competitive performance in stereo matching. (C) 2022 SPIE and IS&T
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A generic implementation framework for FPGA based stereo matching
    Porter, RB
    Bergmann, NW
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 461 - 464
  • [42] Multilevel Disparity Reconstruction Network for Real-Time Stereo Matching
    Liu Z.
    Zhao X.
    Journal of Shanghai Jiaotong University (Science), 2022, 27 (05): : 715 - 722
  • [43] Accurate and Efficient Stereo Matching via Attention Concatenation Volume
    Xu, Gangwei
    Wang, Yun
    Cheng, Junda
    Tang, Jinhui
    Yang, Xin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (04) : 2461 - 2474
  • [44] Multi-Scale Dense Attention Network for Stereo Matching
    Chang, Yuhui
    Xu, Jiangtao
    Gao, Zhiyuan
    ELECTRONICS, 2020, 9 (11) : 1 - 12
  • [45] Stereo Matching Algorithm Based on Multi-Attention Mechanism
    Chen Qibo
    Ge Baozhen
    Li Yunpeng
    Quan Jianing
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (16)
  • [46] Multi-Dimensional Attention on Cost Volume for Stereo Matching
    Zhou Jiale
    Huang, Wenqin
    Liao, Qingmin
    Lu, Zongqing
    Liu, Xiaoqian
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [47] Multi-Scale Context Attention Network for Stereo Matching
    Sang, Haiwei
    Wang, Quanhong
    Zhao, Yong
    IEEE ACCESS, 2019, 7 : 15152 - 15161
  • [48] A Stereo-Matching Neural Network Based on Attention Mechanism
    Cheng Mingyang
    Gai Shaoyan
    Da Feipeng
    ACTA OPTICA SINICA, 2020, 40 (14)
  • [49] A Fast Stereo Matching Network with Multi-Cross Attention
    Wei, Ming
    Zhu, Ming
    Wu, Yi
    Sun, Jiaqi
    Wang, Jiarong
    Liu, Changji
    SENSORS, 2021, 21 (18)
  • [50] Efficient stereo matching using attention mechanism and edge optimization
    Zhao, Daliang
    Liu, Kejian
    Zhang, Zhen
    Song, Yinliang
    Peng, Tao
    Tai, Yichun
    Zhang, Zhijiang
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)