FPattNet: A Multi-Scale Feature Fusion Network with Occlusion Awareness for Depth Estimation of Light Field Images

被引:4
|
作者
Xiao, Min [1 ]
Lv, Chen [1 ]
Liu, Xiaomin [1 ]
机构
[1] Zhengzhou Univ, Sch Phys & Microelect, Zhengzhou 450001, Peoples R China
关键词
light field; depth estimation; deep learning; occlusion handling;
D O I
10.3390/s23177480
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A light field camera can capture light information from various directions within a scene, allowing for the reconstruction of the scene. The light field image inherently contains the depth information of the scene, and depth estimations of light field images have become a popular research topic. This paper proposes a depth estimation network of light field images with occlusion awareness. Since light field images contain many views from different viewpoints, identifying the combinations that contribute the most to the depth estimation of the center view is critical to improving the depth estimation accuracy. Current methods typically rely on a fixed set of views, such as vertical, horizontal, and diagonal, which may not be optimal for all scenes. To address this limitation, we propose a novel approach that considers all available views during depth estimation while leveraging an attention mechanism to assign weights to each view dynamically. By inputting all views into the network and employing the attention mechanism, we enable the model to adaptively determine the most informative views for each scene, thus achieving more accurate depth estimation. Furthermore, we introduce a multi-scale feature fusion strategy that amalgamates contextual information and expands the receptive field to enhance the network's performance in handling challenging scenarios, such as textureless and occluded regions.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Monocular Depth Estimation With Multi-Scale Feature Fusion
    Xu, Xianfa
    Chen, Zhe
    Yin, Fuliang
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 678 - 682
  • [2] Monocular depth estimation with multi-scale feature fusion
    Wang Q.
    Zhang S.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (05): : 7 - 12
  • [3] MANET: MULTI-SCALE AGGREGATED NETWORK FOR LIGHT FIELD DEPTH ESTIMATION
    Li, Yan
    Zhang, Lu
    Wang, Qiong
    Lafruit, Gauthier
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1998 - 2002
  • [4] Occlusion-Aware Unsupervised Light Field Depth Estimation Based on Multi-Scale GANs
    Yan, Wenbin
    Zhang, Xiaogang
    Chen, Hua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6318 - 6333
  • [5] Monocular endoscopy images depth estimation with multi-scale residual fusion
    Liu, Shiyuan
    Fan, Jingfan
    Yang, Yun
    Xiao, Deqiang
    Ai, Danni
    Song, Hong
    Wang, Yongtian
    Yang, Jian
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [6] Enhanced light field depth estimation through occlusion refinement and feature fusion
    Gao, Yuxuan
    Zhang, Haiwei
    Chen, Zhihong
    Xue, Lifang
    Miao, Yinping
    Fu, Jiamin
    OPTICS AND LASERS IN ENGINEERING, 2025, 184
  • [7] Occlusion Removal in Light-Field Images Using CSPDarknet53 and Bidirectional Feature Pyramid Network: A Multi-Scale Fusion-Based Approach
    Senussi, Mostafa Farouk
    Kang, Hyun-Soo
    APPLIED SCIENCES-BASEL, 2024, 14 (20):
  • [8] Binocular Depth Estimation Algorithm Based on Multi-Scale Attention Feature Fusion
    Yang Huitong
    Lei Lang
    Lin Yongchun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [9] Robust depth estimation for multi-occlusion in light-field images
    Ai, Wei
    Xiang, Sen
    Yu, Li
    OPTICS EXPRESS, 2019, 27 (17) : 24793 - 24807
  • [10] DEPTH ESTIMATION WITH OCCLUSION PREDICTION IN LIGHT FIELD IMAGES
    Ghorai, Mrinmoy
    Munteanu, Adrian
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1049 - 1053