Steered Mixture-of-Experts for Light Field Images and Video: Representation and Coding

被引:24
|
作者
Verhack, Ruben [1 ,2 ]
Sikora, Thomas [2 ]
Van Wallendael, Glenn [1 ]
Lambert, Peter [1 ]
机构
[1] Univ Ghent, IDLab, IMEC, B-9052 Ghent, Belgium
[2] Tech Univ Berlin, Commun Syst Grp, D-10623 Berlin, Germany
关键词
Kernel; Encoding; Cameras; Image coding; Solid modeling; Image reconstruction; Image resolution; Mixture of experts; light fields; mixture models; sparse representation; bayesian modeling; QUALITY ASSESSMENT; MULTIVIEW;
D O I
10.1109/TMM.2019.2932614
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research in light field (LF) processing has heavily increased over the last decade. This is largely driven by the desire to achieve the same level of immersion and navigational freedom for camera-captured scenes as it is currently available for CGI content. Standardization organizations such as MPEG and JPEG continue to follow conventional coding paradigms in which viewpoints are discretely represented on 2-D regular grids. These grids are then further decorrelated through hybrid DPCM/transform techniques. However, these 2-D regular grids are less suited for high-dimensional data, such as LFs. We propose a novel coding framework for higher-dimensional image modalities, called Steered Mixture-of-Experts (SMoE). Coherent areas in the higher-dimensional space are represented by single higher-dimensional entities, called kernels. These kernels hold spatially localized information about light rays at any angle arriving at a certain region. The global model consists thus of a set of kernels which define a continuous approximation of the underlying plenoptic function. We introduce the theory of SMoE and illustrate its application for 2-D images, 4-D LF images, and 5-D LF video. We also propose an efficient coding strategy to convert the model parameters into a bitstream. Even without provisions for high-frequency information, the proposed method performs comparable to the state of the art for low-to-mid range bitrates with respect to subjective visual quality of 4-D LF images. In case of 5-D LF video, we observe superior decorrelation and coding performance with coding gains of a factor of 4x in bitrate for the same quality. At least equally important is the fact that our method inherently has desired functionality for LF rendering which is lacking in other state-of-the-art techniques: (1) full zero-delay random access, (2) light-weight pixel-parallel view reconstruction, and (3) intrinsic view interpolation and super-resolution.
引用
收藏
页码:579 / 593
页数:15
相关论文
共 50 条
  • [1] Steered Mixture-of-Experts for Light Field Video Coding
    Avramelos, Vasileios
    Saenen, Ignace
    Verhack, Ruben
    Van Wallendael, Glenn
    Lambert, Peter
    Sikora, Thomas
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XLI, 2018, 10752
  • [2] Video Representation and Coding Using a Sparse Steered Mixture-of-Experts Network
    Lange, Lieven
    Verhack, Ruben
    Sikora, Thomas
    [J]. 2016 PICTURE CODING SYMPOSIUM (PCS), 2016,
  • [3] STEERED MIXTURE-OF-EXPERTS FOR LIGHT FIELD CODING, DEPTH ESTIMATION, AND PROCESSING
    Verhack, Ruben
    Sikora, Thomas
    Lange, Lieven
    Jongebloed, Rolf
    Van Wallendael, Glenn
    Lambert, Peter
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1183 - 1188
  • [4] PROGRESSIVE MODELING OF STEERED MIXTURE-OF-EXPERTS FOR LIGHT FIELD VIDEO APPROXIMATION
    Verhack, Ruben
    Van Wallendael, Glenn
    Courteaux, Martijn
    Lambert, Peter
    Sikora, Thomas
    [J]. 2018 PICTURE CODING SYMPOSIUM (PCS 2018), 2018, : 268 - 272
  • [5] COLOR PREDICTION IN IMAGE CODING USING STEERED MIXTURE-OF-EXPERTS
    Verhack, Ruben
    Van de Keer, Simon
    Van Wallendael, Glenn
    Sikora, Thomas
    Lambert, Peter
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1288 - 1292
  • [6] UNIVERSAL IMAGE CODING APPROACH USING SPARSE STEERED MIXTURE-OF-EXPERTS REGRESSION
    Verhack, Ruben
    Sikora, Thomas
    Lange, Lieven
    Van Wallendael, Glenn
    Lambert, Peter
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 2142 - 2146
  • [7] Highly parallel steered mixture-of-experts rendering at pixel-level for image and light field data
    Avramelos, Vasileios
    Verhack, Ruben
    Saenen, Ignace
    Van Wallendael, Glenn
    Goossens, Bart
    Lambert, Peter
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (04) : 931 - 947
  • [8] Highly parallel steered mixture-of-experts rendering at pixel-level for image and light field data
    Vasileios Avramelos
    Ruben Verhack
    Ignace Saenen
    Glenn Van Wallendael
    Bart Goossens
    Peter Lambert
    [J]. Journal of Real-Time Image Processing, 2020, 17 : 931 - 947
  • [9] Steered Mixture-of-Experts Approximation of Spherical Image Data
    Verhack, Ruben
    Madhu, Nilesh
    Van Wallendael, Glenn
    Lambert, Peter
    Sikora, Thomas
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 256 - 260
  • [10] HARD REAL-TIME, PIXEL-PARALLEL RENDERING OF LIGHT FIELD VIDEOS USING STEERED MIXTURE-OF-EXPERTS
    Saenen, Ignace P.
    Verhack, Ruben
    Avramelos, Vasileios
    Van Wallendael, Glenn
    Lambert, Peter
    [J]. 2018 PICTURE CODING SYMPOSIUM (PCS 2018), 2018, : 337 - 341