Steered Mixture-of-Experts for Light Field Images and Video: Representation and Coding

被引:24
|
作者
Verhack, Ruben [1 ,2 ]
Sikora, Thomas [2 ]
Van Wallendael, Glenn [1 ]
Lambert, Peter [1 ]
机构
[1] Univ Ghent, IDLab, IMEC, B-9052 Ghent, Belgium
[2] Tech Univ Berlin, Commun Syst Grp, D-10623 Berlin, Germany
关键词
Kernel; Encoding; Cameras; Image coding; Solid modeling; Image reconstruction; Image resolution; Mixture of experts; light fields; mixture models; sparse representation; bayesian modeling; QUALITY ASSESSMENT; MULTIVIEW;
D O I
10.1109/TMM.2019.2932614
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research in light field (LF) processing has heavily increased over the last decade. This is largely driven by the desire to achieve the same level of immersion and navigational freedom for camera-captured scenes as it is currently available for CGI content. Standardization organizations such as MPEG and JPEG continue to follow conventional coding paradigms in which viewpoints are discretely represented on 2-D regular grids. These grids are then further decorrelated through hybrid DPCM/transform techniques. However, these 2-D regular grids are less suited for high-dimensional data, such as LFs. We propose a novel coding framework for higher-dimensional image modalities, called Steered Mixture-of-Experts (SMoE). Coherent areas in the higher-dimensional space are represented by single higher-dimensional entities, called kernels. These kernels hold spatially localized information about light rays at any angle arriving at a certain region. The global model consists thus of a set of kernels which define a continuous approximation of the underlying plenoptic function. We introduce the theory of SMoE and illustrate its application for 2-D images, 4-D LF images, and 5-D LF video. We also propose an efficient coding strategy to convert the model parameters into a bitstream. Even without provisions for high-frequency information, the proposed method performs comparable to the state of the art for low-to-mid range bitrates with respect to subjective visual quality of 4-D LF images. In case of 5-D LF video, we observe superior decorrelation and coding performance with coding gains of a factor of 4x in bitrate for the same quality. At least equally important is the fact that our method inherently has desired functionality for LF rendering which is lacking in other state-of-the-art techniques: (1) full zero-delay random access, (2) light-weight pixel-parallel view reconstruction, and (3) intrinsic view interpolation and super-resolution.
引用
收藏
页码:579 / 593
页数:15
相关论文
共 50 条
  • [21] Representation and coding of light field data
    Lelescu, D
    Bossen, F
    [J]. GRAPHICAL MODELS, 2004, 66 (04) : 203 - 225
  • [22] Special issue on representation and coding of images and video II
    Ngan, KN
    Panchanathan, S
    Sikora, T
    Sun, MT
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (01) : 1 - 4
  • [23] CAPTURING LIGHT FIELD TEXTURES FOR VIDEO CODING
    Mantzel, William
    Romberg, Justin
    [J]. 2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 1604 - 1607
  • [24] Representation of moving images with skewed planes and its application to the video coding
    Ueda, Y
    Kaneko, M
    Saito, T
    Harashima, H
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL II, 1996, : 505 - 508
  • [25] Estimating elastic parameters from digital rock images based on multi-task learning with multi-gate mixture-of-experts
    Hou, Zhiyu
    Cao, Danping
    [J]. JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2022, 213
  • [26] Immersive Light Field Video with a Layered Mesh Representation
    Broxton, Michael
    Flynn, John
    Overbeck, Ryan
    Erickson, Daniel
    Hedman, Peter
    Duvall, Matthew
    Dourgarian, Jason
    Busch, Jay
    Whalen, Matt
    Debevec, Paul
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04):
  • [27] Light Field Coding Using Weighted Binary Images
    Komatsu, Koji
    Isechi, Kohei
    Takahashi, Keita
    Fujii, Toshiaki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (11) : 2110 - 2119
  • [28] Compositing Light field Video Using Multiplane Images
    DuVall, Matthew
    Flynn, John
    Broxton, Michael
    Debevec, Paul
    [J]. SIGGRAPH '19 - ACM SIGGRAPH 2019 POSTERS, 2019,
  • [29] Light Field Image Coding Based on Hybrid Data Representation
    Monteiro, Ricardo J. S.
    Rodrigues, Nuno M. M.
    Faria, Sergio M. M.
    Nunes, Paulo J. L.
    [J]. IEEE ACCESS, 2020, 8 : 115728 - 115744
  • [30] A Practical Light Field Representation and Coding Scheme with an Emphasis on Refocusing
    Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon
    16419, Korea, Republic of
    [J]. IEIE Trans. Smart Process Comput., 2022, 5 (305-315): : 305 - 315