Steered Mixture-of-Experts for Light Field Images and Video: Representation and Coding

被引：24

作者：

Verhack, Ruben ^{[1
,2
]}

Sikora, Thomas ^{[2
]}

Van Wallendael, Glenn ^{[1
]}

Lambert, Peter ^{[1
]}

机构：

[1] Univ Ghent, IDLab, IMEC, B-9052 Ghent, Belgium

[2] Tech Univ Berlin, Commun Syst Grp, D-10623 Berlin, Germany

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2020年 / 22卷 / 03期

关键词：

Kernel; Encoding; Cameras; Image coding; Solid modeling; Image reconstruction; Image resolution; Mixture of experts; light fields; mixture models; sparse representation; bayesian modeling; QUALITY ASSESSMENT; MULTIVIEW;

D O I：

10.1109/TMM.2019.2932614

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Research in light field (LF) processing has heavily increased over the last decade. This is largely driven by the desire to achieve the same level of immersion and navigational freedom for camera-captured scenes as it is currently available for CGI content. Standardization organizations such as MPEG and JPEG continue to follow conventional coding paradigms in which viewpoints are discretely represented on 2-D regular grids. These grids are then further decorrelated through hybrid DPCM/transform techniques. However, these 2-D regular grids are less suited for high-dimensional data, such as LFs. We propose a novel coding framework for higher-dimensional image modalities, called Steered Mixture-of-Experts (SMoE). Coherent areas in the higher-dimensional space are represented by single higher-dimensional entities, called kernels. These kernels hold spatially localized information about light rays at any angle arriving at a certain region. The global model consists thus of a set of kernels which define a continuous approximation of the underlying plenoptic function. We introduce the theory of SMoE and illustrate its application for 2-D images, 4-D LF images, and 5-D LF video. We also propose an efficient coding strategy to convert the model parameters into a bitstream. Even without provisions for high-frequency information, the proposed method performs comparable to the state of the art for low-to-mid range bitrates with respect to subjective visual quality of 4-D LF images. In case of 5-D LF video, we observe superior decorrelation and coding performance with coding gains of a factor of 4x in bitrate for the same quality. At least equally important is the fact that our method inherently has desired functionality for LF rendering which is lacking in other state-of-the-art techniques: (1) full zero-delay random access, (2) light-weight pixel-parallel view reconstruction, and (3) intrinsic view interpolation and super-resolution.

引用

页码：579 / 593

页数：15

共 50 条

[21] Representation and coding of light field data
Lelescu, D
Bossen, F
[J]. GRAPHICAL MODELS, 2004, 66 (04) : 203 - 225
[22] Special issue on representation and coding of images and video II
Ngan, KN
Panchanathan, S
Sikora, T
Sun, MT
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (01) : 1 - 4
[23] CAPTURING LIGHT FIELD TEXTURES FOR VIDEO CODING
Mantzel, William
Romberg, Justin
[J]. 2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 1604 - 1607
[24] Representation of moving images with skewed planes and its application to the video coding
Ueda, Y
Kaneko, M
Saito, T
Harashima, H
[J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL II, 1996, : 505 - 508
[25] Estimating elastic parameters from digital rock images based on multi-task learning with multi-gate mixture-of-experts
Hou, Zhiyu
Cao, Danping
[J]. JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2022, 213
[26] Immersive Light Field Video with a Layered Mesh Representation
Broxton, Michael
Flynn, John
Overbeck, Ryan
Erickson, Daniel
Hedman, Peter
Duvall, Matthew
Dourgarian, Jason
Busch, Jay
Whalen, Matt
Debevec, Paul
[J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04):
[27] Light Field Coding Using Weighted Binary Images
Komatsu, Koji
Isechi, Kohei
Takahashi, Keita
Fujii, Toshiaki
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (11) : 2110 - 2119
[28] Compositing Light field Video Using Multiplane Images
DuVall, Matthew
Flynn, John
Broxton, Michael
Debevec, Paul
[J]. SIGGRAPH '19 - ACM SIGGRAPH 2019 POSTERS, 2019,
[29] Light Field Image Coding Based on Hybrid Data Representation
Monteiro, Ricardo J. S.
Rodrigues, Nuno M. M.
Faria, Sergio M. M.
Nunes, Paulo J. L.
[J]. IEEE ACCESS, 2020, 8 : 115728 - 115744
[30] A Practical Light Field Representation and Coding Scheme with an Emphasis on Refocusing
Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon
16419, Korea, Republic of
[J]. IEIE Trans. Smart Process Comput., 2022, 5 (305-315): : 305 - 315

← 1 2 3 4 5 →