View synthesis with multiplane images from computationally generated RGB-D light fields

被引:0
|
作者
Yoon, Gang-Joon [1 ]
Jung, Geunho [2 ]
Song, Jinjoo [2 ]
Yoon, Sang Min [2 ]
机构
[1] Natl Inst Math Sci, 70 Yuseong Daero 1689 Beon Gil, Daejeon 34047, South Korea
[2] Kookmin Univ, Coll Comp Sci, HCI Lab, 77 Jeongneung Ro, Seoul 02707, South Korea
基金
新加坡国家研究基金会;
关键词
View synthesis; Multiplane images; Light field images;
D O I
10.1016/j.engappai.2024.107930
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image based view synthesis using deep neural networks provide novel scene views from a set of captured single or multiple images. Multiplane images (MPI) represent scene content as set of RGB alpha planes within a reference view frustum and render novel views by projecting the content into the target viewpoints. Image based view synthesis with multiple images is very popularly deployed in various areas because it effectively represents geometric uncertainty in ambiguous regions and can convincingly simulate non-Lambertian effects. However, previous image based view synthesis approaches suffer from interpolating and extrapolating information in pixels or ray spaces to generate seamless novel views without occlusion. To effectively improve visual performance for view interpolation and extrapolation, this paper proposes a novel view synthesis with MPI images. From a monocular RGB image, light field images are computationally generated, the proposed depth map guided deep network produces robust MPI using the light field images and their corresponding depth images, and the MPI network embedded with depth attention blocks forces semantic and geometric information to be uniformly distributed and divided among layers. The proposed approach achieves 3.5% and 4.02% improvements in SSIM and PSNR values, comparing to the SOTA approaches. Qualitative analysis on benchmark dataset also verifies the robustness of the proposed approach.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Incremental Registration of RGB-D Images
    Dryanovski, Ivan
    Jaramillo, Carlos
    Xiao, Jizhong
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 1685 - 1690
  • [22] Unsupervised Segmentation of RGB-D Images
    Deng, Zhuo
    Latecki, Longin Jan
    COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 423 - 435
  • [23] Visual Recognition in RGB Images and Videos by Learning from RGB-D Data
    Li, Wen
    Chen, Lin
    Xu, Dong
    Van Gool, Luc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (08) : 2030 - 2036
  • [24] Single-View View Synthesis with Multiplane Images
    Tucker, Richard
    Snavely, Noah
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 548 - 557
  • [25] Point Light Source Position Estimation From RGB-D Images by Learning Surface Attributes
    Karaoglu, Sezer
    Liu, Yang
    Gevers, Theo
    Smeulders, Arnold W. M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) : 5149 - 5159
  • [26] Understanding Everyday Hands in Action from RGB-D Images
    Rogez, Gregory
    Supancic, James S., III
    Ramanan, Deva
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3889 - 3897
  • [27] Stereo Extrapolation: View synthesis with multiplane images
    Salem, Ahmed
    Ibrahem, Hatem
    Yagoub, Bilel
    Kang, Hyun Soo
    2021 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2021,
  • [28] Fine-Grained Categorization From RGB-D Images
    Tan, Yanhao
    Rahman, Mohammad Muntasir
    Yan, Yanfu
    Xue, Jian
    Shao, Ling
    Lu, Ke
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 917 - 928
  • [29] Difference-in-level Detection from RGB-D Images
    Nonaka, Yusuke
    Uchiyama, Hideaki
    Saito, Hideo
    Yachida, Shoji
    Iwamoto, Kota
    ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 : 393 - 406
  • [30] COMPACT AND ADAPTIVE MULTIPLANE IMAGES FOR VIEW SYNTHESIS
    Navarro, Julia
    Sabater, Neus
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3403 - 3407