Unsupervised disparity estimation from light field using plug-and-play weighted warping loss

被引:5
|
作者
Iwatsuki, Taisei [1 ]
Takahashi, Keita [1 ]
Fujii, Toshiaki [1 ]
机构
[1] Nagoya Univ, Grad Sch Engn, Furo Cho,Chikusa Ku, Nagoya 4648603, Japan
关键词
Light field; Disparity estimation; CNN; Unsupervised learning; STEREO; DEPTH;
D O I
10.1016/j.image.2022.116764
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigated disparity estimation from a light field using a convolutional neural network (CNN). Most of the methods implemented a supervised learning framework, where the predicted disparity map was compared directly to the corresponding ground-truth disparity map in the training stage. However, light field data accompanied with ground-truth disparity maps were insufficient and rarely available for real-world scenes. The lack of training data resulted in limited generality of the methods trained with them. To tackle this problem, we took a simple Figure-and-play approach to remake a supervised method into an unsupervised (self-supervised) one. We replaced the loss function of the original method with one that does not depend on the ground-truth disparity maps. More specifically, our loss function is designed to indirectly evaluate the accuracy of the disparity map by using warping errors among the input light field views. We designed pixel-wise weights to properly evaluate the warping errors in the presence of occlusions, and an edge loss to encourage edge alignment between the image and the disparity map. As a result of this unsupervised learning framework, our method can use more abundant training datasets (even those without ground-truth disparity maps) than the original supervised method. Our method was evaluated on computer-generated scenes (4D Light Field Benchmark) and real-world scenes captured by Lytro Illum cameras. Our method achieved the state-ofthe-art performance as an unsupervised method on the benchmark. We also demonstrated that our method can estimate disparity maps more accurately than the original supervised method for various real-world scenes.
引用
收藏
页数:8
相关论文
共 40 条
  • [31] Depth Estimation from Light Field Geometry Using Convolutional Neural Networks
    Han, Lei
    Huang, Xiaohua
    Shi, Zhan
    Zheng, Shengnan
    SENSORS, 2021, 21 (18)
  • [32] ESTIMATION OF ENERGY LOSS DUE TO LIGHT POLLUTION USING SATELLITE AND FIELD MEASUREMENT DATA: EXAMPLE OF ERZINCAN CITY
    Yilmaz, Abdulvahap
    ENVIRONMENTAL ENGINEERING AND MANAGEMENT JOURNAL, 2024, 23 (12):
  • [33] Toward Real-World Light Field Depth Estimation: A Noise-Aware Paradigm Using Multi-Stereo Disparity Integration
    Mo, Yu
    Yang, Jungang
    Xiao, Chao
    An, Wei
    IEEE ACCESS, 2019, 7 : 94391 - 94399
  • [34] Unsupervised estimation of left ventricular displacement from MR tagged images using Markov random field edge priors
    Yan, LT
    Denney, TS
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 689 - 693
  • [35] Zero-Shot Depth Estimation From Light Field Using A Convolutional Neural Network
    Peng, Jiayong
    Xiong, Zhiwei
    Wang, Yicheng
    Zhang, Yueyi
    Liu, Dong
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2020, 6 : 682 - 696
  • [36] Shape Estimation from Shading, Defocus, and Correspondence Using Light-Field Angular Coherence
    Tao, Michael W.
    Srinivasan, Pratul P.
    Hadap, Sunil
    Rusinkiewicz, Szymon
    Malik, Jitendra
    Ramamoorthi, Ravi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (03) : 546 - 560
  • [37] Depth Estimation From Light Field Using Graph-Based Structure-Aware Analysis
    Zhang, Yuchen
    Dai, Wenrui
    Xu, Mingxing
    Zou, Junni
    Zhang, Xiaopeng
    Xiong, Hongkai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4269 - 4283
  • [38] LIGHT-FIELD RECONSTRUCTION AND DEPTH ESTIMATION FROM FOCAL STACK IMAGES USING CONVOLUTIONAL NEURAL NETWORKS
    Huang, Zhengyu
    Fessler, Jeffrey A.
    Norris, Theodore B.
    Chun, Il Yong
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8648 - 8652
  • [39] Occlusion Handling in Depth Estimation of a Scene from a Given Light Field Using a Geodesic Distance and the Principle of Symmetry of the View
    Mozerov, M. G.
    Karnaukhov, V. N.
    Kober, V. I.
    Zimina, L. V.
    JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2024, 69 (1-3) : 29 - 37
  • [40] Towards tDCS Digital Twins Using Deep Learning-Based Direct Estimation of Personalized Electrical Field Maps from T1-Weighted MRI
    Stolte, Skylar E.
    Indahlastari, Aprinda
    Albizu, Alejandro
    Woods, Adam J.
    Fang, Ruogu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT II, 2024, 15002 : 465 - 475