Multi-scale inputs and context-aware aggregation network for stereo matching

被引:1
|
作者
Shi, Liqing [1 ,2 ,3 ]
Xiong, Taiping [1 ,2 ]
Cui, Gengshen [2 ]
Pan, Minghua [2 ]
Cheng, Nuo [1 ,2 ]
Wu, Xiangjie [1 ,2 ]
机构
[1] Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
[3] Guilin Univ Elect Technol, Nanning Res Inst, Nanning 530000, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-scale feature fusion; Context-aware capability; 3D squeeze-and-excitation; Stereo matching; Binocular vision;
D O I
10.1007/s11042-024-18492-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the significant progress made in deep learning-based stereo matching, the accuracy of these methods significantly decreases when faced with challenges such as occlusions, reflections, textureless areas, and scale variations. In this paper, we propose MSCANet, a novel stereo matching network that integrates multi-scale inputs and context-aware aggregation ability. MSCANet effectively integrates rich multi-scale feature information and exhibits context-aware capability, thereby enabling it to achieve superior performance. Firstly, a multi-scale aware fusion module is designed to efficiently incorporate more comprehensive global context features at different scales, which allows the model to enhance its ability to generalize across images of varying scales. Secondly, a novel V-shaped encoder/decoder module is developed to effectively exploit the rich feature information. In the encoding stage, a 3D squeeze-and-excitation block is introduced to facilitate adaptively recalibration of learned feature maps. This block effectively suppresses irrelevant features while enhancing useful features, which improved efficiency and accuracy in disparity prediction. Additionally, a 3D context-aware decode block is designed to effectively utilize global context features to restore the original image structure during the decoding stage. Moreover, the high-level feature maps can be employed to augment low-level feature maps by incorporating more detailed information to avoid the side effects caused by the loss of information during the encoding process. Extensive ablation experiments and comparative experiments were conducted on Scene Flow dataset, KITTI2012 and KITTI2015 datasets to validate the effectiveness of each proposed module. The experimental results demonstrate MSCANet achieves competitive performance and offers a more straightforward and efficient model design, as well as faster inference speed.
引用
收藏
页码:75171 / 75194
页数:24
相关论文
共 50 条
  • [41] Robust Scale-Aware Stereo Matching Network
    Okae J.
    Li B.
    Du J.
    Hu Y.
    IEEE Transactions on Artificial Intelligence, 2022, 3 (02): : 244 - 253
  • [42] Multi-Scale Context-Aware Correlation Filter Tracking Algorithm Based on Channel Reliability
    Yin Mingfeng
    Bo Yuming
    Zhu Jianliang
    Wu Panlong
    ACTA OPTICA SINICA, 2019, 39 (05)
  • [43] SI-NET: MULTI-SCALE CONTEXT-AWARE CONVOLUTIONAL BLOCK FOR SPEAKER VERIFICATION
    Li, Zhuo
    Fang, Ce
    Xiao, Runqiu
    Wang, Wenchao
    Yan, Yonghong
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 220 - 227
  • [44] Context-Aware Multi-View Summarization Network for Image-Text Matching
    Qu, Leigang
    Liu, Meng
    Cao, Da
    Nie, Liqiang
    Tian, Qi
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1047 - 1055
  • [45] Multi-Scale Structure Perception and Global Context-Aware Method for Small-Scale Pedestrian Detection
    Gao, Hao
    Huang, Shucheng
    Li, Mingxing
    Li, Tian
    IEEE ACCESS, 2024, 12 : 76392 - 76403
  • [46] A multi-scale context-aware and batch-independent lightweight network for green tide extraction from SAR images
    Xu, Mingming
    Zhu, Xiaofang
    Liu, Yanfen
    Liu, Shanwei
    Sheng, Hui
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (13) : 4474 - 4499
  • [47] A Confidence-Aware Cascade Network for Multi-Scale Stereo Matching of Very-High-Resolution Remote Sensing Images
    Tao, Rongshu
    Xiang, Yuming
    You, Hongjian
    REMOTE SENSING, 2022, 14 (07)
  • [48] Deep Multi-Scale Context Aware Feature Aggregation for Curved Scene Text Detection
    Dai, Pengwen
    Zhang, Hua
    Cao, Xiaochun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (08) : 1969 - 1984
  • [49] CONVOLUTIONAL NEURAL NETWORK USING MULTI-SCALE INFORMATION FOR STEREO MATCHING COST COMPUTATION
    Chen, Jiahui
    Yuan, Chun
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3424 - 3428
  • [50] Global context-aware feature modulation networks for unified multi-scale super-resolution
    Zhang, Dacheng
    Lei, Weimin
    Zhang, Wei
    Chen, Xinyi
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)