FPattNet: A Multi-Scale Feature Fusion Network with Occlusion Awareness for Depth Estimation of Light Field Images

被引:4
|
作者
Xiao, Min [1 ]
Lv, Chen [1 ]
Liu, Xiaomin [1 ]
机构
[1] Zhengzhou Univ, Sch Phys & Microelect, Zhengzhou 450001, Peoples R China
关键词
light field; depth estimation; deep learning; occlusion handling;
D O I
10.3390/s23177480
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A light field camera can capture light information from various directions within a scene, allowing for the reconstruction of the scene. The light field image inherently contains the depth information of the scene, and depth estimations of light field images have become a popular research topic. This paper proposes a depth estimation network of light field images with occlusion awareness. Since light field images contain many views from different viewpoints, identifying the combinations that contribute the most to the depth estimation of the center view is critical to improving the depth estimation accuracy. Current methods typically rely on a fixed set of views, such as vertical, horizontal, and diagonal, which may not be optimal for all scenes. To address this limitation, we propose a novel approach that considers all available views during depth estimation while leveraging an attention mechanism to assign weights to each view dynamically. By inputting all views into the network and employing the attention mechanism, we enable the model to adaptively determine the most informative views for each scene, thus achieving more accurate depth estimation. Furthermore, we introduce a multi-scale feature fusion strategy that amalgamates contextual information and expands the receptive field to enhance the network's performance in handling challenging scenarios, such as textureless and occluded regions.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Double multi-scale feature fusion network for crowd counting
    Liu, Qian
    Fang, Jiongtao
    Zhong, Yixiong
    Wang, Cunbao
    Qi, Youwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (34) : 81831 - 81855
  • [32] Siamese Network Tracker Based on Multi-Scale Feature Fusion
    Zhao, Jiaxu
    Niu, Dapeng
    SYSTEMS, 2023, 11 (08):
  • [33] Human pose estimation based on feature enhancement and multi-scale feature fusion
    Dandan Cao
    Weibin Liu
    Weiwei Xing
    Xiang Wei
    Signal, Image and Video Processing, 2023, 17 : 643 - 650
  • [34] MFANet: Multi-scale feature fusion network with attention mechanism
    Wang, Gaihua
    Gan, Xin
    Cao, Qingcheng
    Zhai, Qianyu
    VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
  • [35] Dense monocular depth estimation for stereoscopic vision based on pyramid transformer and multi-scale feature fusion
    Xia, Zhongyi
    Wu, Tianzhao
    Wang, Zhuoyan
    Zhou, Man
    Wu, Boqi
    Chan, C. Y.
    Kong, Ling Bing
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [36] MFANet: Multi-scale feature fusion network with attention mechanism
    Gaihua Wang
    Xin Gan
    Qingcheng Cao
    Qianyu Zhai
    The Visual Computer, 2023, 39 : 2969 - 2980
  • [37] Fourier ptychography based on multi-scale feature fusion network
    Song Dong-han
    Wang Bin
    Zhu You-qiang
    Liu Xin
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (11) : 1476 - 1487
  • [38] Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
    Xiao, Xianbing
    Xiong, Xingzhong
    Meng, Fanqin
    Chen, Zhen
    SENSORS, 2023, 23 (07)
  • [39] Dense monocular depth estimation for stereoscopic vision based on pyramid transformer and multi-scale feature fusion
    Zhongyi Xia
    Tianzhao Wu
    Zhuoyan Wang
    Man Zhou
    Boqi Wu
    C. Y. Chan
    Ling Bing Kong
    Scientific Reports, 14
  • [40] Multi-Scale Boosted Dehazing Network with Dense Feature Fusion
    Dong, Hang
    Pan, Jinshan
    Xiang, Lei
    Hu, Zhe
    Zhang, Xinyi
    Wang, Fei
    Yang, Ming-Hsuan
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2154 - 2164