View synthesis with multiplane images from computationally generated RGB-D light fields

被引：0

作者：

Yoon, Gang-Joon ^{[1
]}

Jung, Geunho ^{[2
]}

Song, Jinjoo ^{[2
]}

Yoon, Sang Min ^{[2
]}

机构：

[1] Natl Inst Math Sci, 70 Yuseong Daero 1689 Beon Gil, Daejeon 34047, South Korea

[2] Kookmin Univ, Coll Comp Sci, HCI Lab, 77 Jeongneung Ro, Seoul 02707, South Korea

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 132卷

基金：

新加坡国家研究基金会;

关键词：

View synthesis; Multiplane images; Light field images;

D O I：

10.1016/j.engappai.2024.107930

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Image based view synthesis using deep neural networks provide novel scene views from a set of captured single or multiple images. Multiplane images (MPI) represent scene content as set of RGB alpha planes within a reference view frustum and render novel views by projecting the content into the target viewpoints. Image based view synthesis with multiple images is very popularly deployed in various areas because it effectively represents geometric uncertainty in ambiguous regions and can convincingly simulate non-Lambertian effects. However, previous image based view synthesis approaches suffer from interpolating and extrapolating information in pixels or ray spaces to generate seamless novel views without occlusion. To effectively improve visual performance for view interpolation and extrapolation, this paper proposes a novel view synthesis with MPI images. From a monocular RGB image, light field images are computationally generated, the proposed depth map guided deep network produces robust MPI using the light field images and their corresponding depth images, and the MPI network embedded with depth attention blocks forces semantic and geometric information to be uniformly distributed and divided among layers. The proposed approach achieves 3.5% and 4.02% improvements in SSIM and PSNR values, comparing to the SOTA approaches. Qualitative analysis on benchmark dataset also verifies the robustness of the proposed approach.

引用

页数：9

共 50 条

[21] Incremental Registration of RGB-D Images
Dryanovski, Ivan
Jaramillo, Carlos
Xiao, Jizhong
2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 1685 - 1690
[22] Unsupervised Segmentation of RGB-D Images
Deng, Zhuo
Latecki, Longin Jan
COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 423 - 435
[23] Visual Recognition in RGB Images and Videos by Learning from RGB-D Data
Li, Wen
Chen, Lin
Xu, Dong
Van Gool, Luc
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (08) : 2030 - 2036
[24] Single-View View Synthesis with Multiplane Images
Tucker, Richard
Snavely, Noah
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 548 - 557
[25] Point Light Source Position Estimation From RGB-D Images by Learning Surface Attributes
Karaoglu, Sezer
Liu, Yang
Gevers, Theo
Smeulders, Arnold W. M.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) : 5149 - 5159
[26] Understanding Everyday Hands in Action from RGB-D Images
Rogez, Gregory
Supancic, James S., III
Ramanan, Deva
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3889 - 3897
[27] Stereo Extrapolation: View synthesis with multiplane images
Salem, Ahmed
Ibrahem, Hatem
Yagoub, Bilel
Kang, Hyun Soo
2021 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2021,
[28] Fine-Grained Categorization From RGB-D Images
Tan, Yanhao
Rahman, Mohammad Muntasir
Yan, Yanfu
Xue, Jian
Shao, Ling
Lu, Ke
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 917 - 928
[29] Difference-in-level Detection from RGB-D Images
Nonaka, Yusuke
Uchiyama, Hideaki
Saito, Hideo
Yachida, Shoji
Iwamoto, Kota
ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 : 393 - 406
[30] COMPACT AND ADAPTIVE MULTIPLANE IMAGES FOR VIEW SYNTHESIS
Navarro, Julia
Sabater, Neus
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3403 - 3407

← 1 2 3 4 5 →