Improved stereo matching framework with embedded multilevel attention

被引:2
|
作者
Li, Bohan [1 ]
Du, Juan [1 ]
Okae, James [2 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China
[2] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Peoples R China
关键词
stereo matching; multilevel attention; global coherence; disparity optimization; deep learning;
D O I
10.1117/1.JEI.31.3.033037
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The recent advent of deep convolutional neural networks (CNNs) in stereo matching has led to significant improvements. However, current CNN methods still face challenges in incorporating hierarchical context information with global dependencies and lacking the discriminative ability of feature representation to resolve matching ambiguities in ill-conditioned regions. To address the aforementioned problems, we propose an improved stereo matching framework that joins a stereo backbone network and an embedded independent multilevel attention subnetwork in an end-to-end trainable pipeline. The stereo backbone network applies a residual atrous spatial pyramid pooling integrated with channelwise attention to capture richer multiscale contextual information and selectively enhance discriminative features. This is followed by unary feature concatenation to construct cost volume for disparity prediction. To further improve performance, the embedded multilevel attention subnetwork learns global coherent contextual information to generate three attention streams, which are used to boost the unary feature representations with spatial encoding, enhance the quality of cost volume, and refine the disparity map, respectively. We show that appending the proposed multilevel attention subnetwork to the stereo backbone network produces significant improvements in matching accuracy. The experimental results on Scene Flow and KITTI 2012/2015 demonstrate that our method can achieve competitive performance in stereo matching. (C) 2022 SPIE and IS&T
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Attention Aggregation Encoder-Decoder Network Framework for Stereo Matching
    Zhang, Yaru
    Li, Yaqian
    Kong, Yating
    Liu, Bin
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 760 - 764
  • [2] Attention Stereo Matching Network
    Zhang, Doudou
    Cai, Jing
    Xue, Yanbing
    Gao, Zan
    Zhang, Hua
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4973 - 4980
  • [3] Parallax attention stereo matching network based on the improved group-wise correlation stereo network
    Yu, Xuefei
    Gu, Jinan
    Huang, Zedong
    Zhang, Zhijie
    PLOS ONE, 2022, 17 (02):
  • [4] Multiple attention networks for stereo matching
    Longyuan Guo
    Houyu Duan
    Wuwei Zhou
    Multimedia Tools and Applications, 2021, 80 : 28583 - 28601
  • [5] Multiple attention networks for stereo matching
    Guo, Longyuan
    Duan, Houyu
    Zhou, Wuwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) : 28583 - 28601
  • [6] A fast multilevel method for matching stereo images
    Hariti, M
    Ruichek, Y
    Koukam, A
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 203 - 206
  • [7] RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching
    Lipson, Lahav
    Teed, Zachary
    Deng, Jia
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 218 - 227
  • [8] A progressive framework for dense stereo matching
    Jia B.
    Liu S.
    Du Z.
    Pattern Recognition and Image Analysis, 2016, 26 (2) : 294 - 301
  • [9] A global matching framework for stereo computation
    Tao, H
    Sawhney, HS
    Kumar, R
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, 2001, : 532 - 539
  • [10] An Improved Filtering for Fast Stereo Matching
    Huang, Xiaoming
    Cui, Guoqin
    Zhang, Yundong
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2448 - 2452