Unsupervised disparity estimation from light field using plug-and-play weighted warping loss

被引：5

作者：

Iwatsuki, Taisei ^{[1
]}

Takahashi, Keita ^{[1
]}

Fujii, Toshiaki ^{[1
]}

机构：

[1] Nagoya Univ, Grad Sch Engn, Furo Cho,Chikusa Ku, Nagoya 4648603, Japan

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2022年 / 107卷

关键词：

Light field; Disparity estimation; CNN; Unsupervised learning; STEREO; DEPTH;

D O I：

10.1016/j.image.2022.116764

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We investigated disparity estimation from a light field using a convolutional neural network (CNN). Most of the methods implemented a supervised learning framework, where the predicted disparity map was compared directly to the corresponding ground-truth disparity map in the training stage. However, light field data accompanied with ground-truth disparity maps were insufficient and rarely available for real-world scenes. The lack of training data resulted in limited generality of the methods trained with them. To tackle this problem, we took a simple Figure-and-play approach to remake a supervised method into an unsupervised (self-supervised) one. We replaced the loss function of the original method with one that does not depend on the ground-truth disparity maps. More specifically, our loss function is designed to indirectly evaluate the accuracy of the disparity map by using warping errors among the input light field views. We designed pixel-wise weights to properly evaluate the warping errors in the presence of occlusions, and an edge loss to encourage edge alignment between the image and the disparity map. As a result of this unsupervised learning framework, our method can use more abundant training datasets (even those without ground-truth disparity maps) than the original supervised method. Our method was evaluated on computer-generated scenes (4D Light Field Benchmark) and real-world scenes captured by Lytro Illum cameras. Our method achieved the state-ofthe-art performance as an unsupervised method on the benchmark. We also demonstrated that our method can estimate disparity maps more accurately than the original supervised method for various real-world scenes.

引用

页数：8

共 40 条

[31] Depth Estimation from Light Field Geometry Using Convolutional Neural Networks
Han, Lei
Huang, Xiaohua
Shi, Zhan
Zheng, Shengnan
SENSORS, 2021, 21 (18)
[32] ESTIMATION OF ENERGY LOSS DUE TO LIGHT POLLUTION USING SATELLITE AND FIELD MEASUREMENT DATA: EXAMPLE OF ERZINCAN CITY
Yilmaz, Abdulvahap
ENVIRONMENTAL ENGINEERING AND MANAGEMENT JOURNAL, 2024, 23 (12):
[33] Toward Real-World Light Field Depth Estimation: A Noise-Aware Paradigm Using Multi-Stereo Disparity Integration
Mo, Yu
Yang, Jungang
Xiao, Chao
An, Wei
IEEE ACCESS, 2019, 7 : 94391 - 94399
[34] Unsupervised estimation of left ventricular displacement from MR tagged images using Markov random field edge priors
Yan, LT
Denney, TS
1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 689 - 693
[35] Zero-Shot Depth Estimation From Light Field Using A Convolutional Neural Network
Peng, Jiayong
Xiong, Zhiwei
Wang, Yicheng
Zhang, Yueyi
Liu, Dong
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2020, 6 : 682 - 696
[36] Shape Estimation from Shading, Defocus, and Correspondence Using Light-Field Angular Coherence
Tao, Michael W.
Srinivasan, Pratul P.
Hadap, Sunil
Rusinkiewicz, Szymon
Malik, Jitendra
Ramamoorthi, Ravi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (03) : 546 - 560
[37] Depth Estimation From Light Field Using Graph-Based Structure-Aware Analysis
Zhang, Yuchen
Dai, Wenrui
Xu, Mingxing
Zou, Junni
Zhang, Xiaopeng
Xiong, Hongkai
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4269 - 4283
[38] LIGHT-FIELD RECONSTRUCTION AND DEPTH ESTIMATION FROM FOCAL STACK IMAGES USING CONVOLUTIONAL NEURAL NETWORKS
Huang, Zhengyu
Fessler, Jeffrey A.
Norris, Theodore B.
Chun, Il Yong
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8648 - 8652
[39] Occlusion Handling in Depth Estimation of a Scene from a Given Light Field Using a Geodesic Distance and the Principle of Symmetry of the View
Mozerov, M. G.
Karnaukhov, V. N.
Kober, V. I.
Zimina, L. V.
JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2024, 69 (1-3) : 29 - 37
[40] Towards tDCS Digital Twins Using Deep Learning-Based Direct Estimation of Personalized Electrical Field Maps from T1-Weighted MRI
Stolte, Skylar E.
Indahlastari, Aprinda
Albizu, Alejandro
Woods, Adam J.
Fang, Ruogu
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT II, 2024, 15002 : 465 - 475

← 1 2 3 4 →