View synthesis with multiplane images from computationally generated RGB-D light fields

被引:0
|
作者
Yoon, Gang-Joon [1 ]
Jung, Geunho [2 ]
Song, Jinjoo [2 ]
Yoon, Sang Min [2 ]
机构
[1] Natl Inst Math Sci, 70 Yuseong Daero 1689 Beon Gil, Daejeon 34047, South Korea
[2] Kookmin Univ, Coll Comp Sci, HCI Lab, 77 Jeongneung Ro, Seoul 02707, South Korea
基金
新加坡国家研究基金会;
关键词
View synthesis; Multiplane images; Light field images;
D O I
10.1016/j.engappai.2024.107930
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image based view synthesis using deep neural networks provide novel scene views from a set of captured single or multiple images. Multiplane images (MPI) represent scene content as set of RGB alpha planes within a reference view frustum and render novel views by projecting the content into the target viewpoints. Image based view synthesis with multiple images is very popularly deployed in various areas because it effectively represents geometric uncertainty in ambiguous regions and can convincingly simulate non-Lambertian effects. However, previous image based view synthesis approaches suffer from interpolating and extrapolating information in pixels or ray spaces to generate seamless novel views without occlusion. To effectively improve visual performance for view interpolation and extrapolation, this paper proposes a novel view synthesis with MPI images. From a monocular RGB image, light field images are computationally generated, the proposed depth map guided deep network produces robust MPI using the light field images and their corresponding depth images, and the MPI network embedded with depth attention blocks forces semantic and geometric information to be uniformly distributed and divided among layers. The proposed approach achieves 3.5% and 4.02% improvements in SSIM and PSNR values, comparing to the SOTA approaches. Qualitative analysis on benchmark dataset also verifies the robustness of the proposed approach.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] GPU ACCELERATED VIEW SYNTHESIS FROM MULTIPLE RGB-D IMAGES
    Park, Anjin
    Kim, Jinwook
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 573 - 576
  • [2] Point-based View Synthesis from RGB-D Images
    Park, Anjin
    Kim, Jinwook
    2014 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2014, : 974 - 975
  • [3] Neural Radiance Fields From Sparse RGB-D Images for High-Quality View Synthesis
    Yuan, Yu-Jie
    Lai, Yu-Kun
    Huang, Yi-Hua
    Kobbelt, Leif
    Gao, Lin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8713 - 8728
  • [4] SYNTHESIS OF LIGHT-FIELD RAWDATA FROM RGB-D IMAGES
    Sun, Chao
    Wu, Yiqun
    Zeng, Bing
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1448 - 1452
  • [5] A UNIFIED DEEP LEARNING APPROACH FOR FOVEATED RENDERING & NOVEL VIEW SYNTHESIS FROM SPARSE RGB-D LIGHT FIELDS
    Thumuluri, Vineet
    Sharma, Mansi
    2020 INTERNATIONAL CONFERENCE ON 3D IMMERSION (IC3D), 2020,
  • [6] Domain adaptation from RGB-D to RGB images
    Li, Xiao
    Fang, Min
    Zhang, Ju-Jie
    Wu, Jinqiao
    SIGNAL PROCESSING, 2017, 131 : 27 - 35
  • [7] Building change detection with RGB-D map generated from UAV images
    Chen, Baohua
    Chen, Zhixiang
    Deng, Lei
    Duan, Yueqi
    Zhou, Jie
    NEUROCOMPUTING, 2016, 208 : 350 - 364
  • [8] Light-Field Raw Data Synthesis From RGB-D Images: Pushing to the Extreme
    Wu, Yiqun
    Liu, Shuaicheng
    Sun, Chao
    Zeng, Bing
    IEEE ACCESS, 2020, 8 : 33391 - 33405
  • [9] Recognizing RGB Images by Learning from RGB-D Data
    Chen, Lin
    Li, Wen
    Xu, Dong
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1418 - 1425
  • [10] VIRTUAL VIEW SYNTHESIS USING RGB-D CAMERAS
    Chien, Chun-Liang
    Lee, Tzu-Chin
    Hang, Hsueh-Ming
    2016 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2016,