Sparse Spatio-Temporal Representation With Adaptive Regularized Dictionary Learning for Low Bit-Rate Video Coding

被引:23
|
作者
Xiong, Hongkai [1 ]
Pan, Zhiming [1 ]
Ye, Xinwei [1 ]
Chen, Chang Wen [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
基金
中国国家自然科学基金;
关键词
Atom decomposition; dictionary learning; primitive patch; sparse representation; video coding; BLOCK MOTION COMPENSATION; IMAGE QUALITY ASSESSMENT; SUPERRESOLUTION; ALGORITHM;
D O I
10.1109/TCSVT.2012.2221271
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For promising vision-based video coding on low-quality data, this paper proposes a sparse spatio-temporal representation with adaptive regularized dictionary learning and develops a low bit-rate video coding scheme. In a reversed-complexity Wyner-Ziv coding manner, it selects a subset of key frames to code at original resolution, while the rest are down sampled and reconstructed by a sparse spatio-temporal approximation using key frames as a training dataset. Since primitive patches (geometry) are of low dimensionality and can be well learned from the primitive patches across frames in a scale space, a video frame is divided into three layers: a primitive layer, a nonprimitive coarse layer, and a nonprimitive smooth layer. The multiscale differential feature representations are invertible to facilitate reconstruction with dictionary learning, and the target is formulated as an optimization problem by constructing a sparse representation of 2-D patches and 3-D volumes over adaptive regularized dictionaries, a set of 2-D subdictionary pairs trained from primitive patches, and a 3-D dictionary trained from nonprimitive volumes. Specifically, the nonprimitive layer is constructed as volumes in to order keep it consistent along the motion trajectory, which enables sparse representations over a learned 3-D spatio-temporal dictionary. Through hierarchical bidirectional motion estimation and adaptive overlapped block motion compensation, the 3-D low-frequency and high-frequency dictionary pair is designed by the K-SVD algorithm to update the atoms for optimal sparse representation and convergence. In reconstruction, the lost high-frequency information of the down-sampled frames can be synthesized from the sparse spatio-temporal representation over the adaptive regularized dictionaries. Extensive experiments validate the compression efficiency of the proposed scheme versus H.264/AVC in terms of both objective and subjective comparisons.
引用
收藏
页码:710 / 728
页数:19
相关论文
共 50 条
  • [21] VERY-LOW BIT-RATE VIDEO CODING
    TZOU, KH
    MUSMANN, HG
    AIZAWA, K
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1994, 4 (03) : 213 - 215
  • [22] Very low bit-rate digital video coding
    Scargall, Lee
    Dlay, Satnam
    [J]. Advances in Intelligent Systems and Computer Science, 1999, : 273 - 279
  • [23] A Learning-Based Framework for Low Bit-Rate Image and Video Coding
    Xiong, Hongkai
    Yuan, Zhe
    Xu, Yang
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2009, 2009, 5879 : 232 - 244
  • [24] An adaptive deblocking algorithm for low bit-rate video
    Kocovski, Blagoj
    Kartalov, Tomislav
    Ivanovski, Zoran
    Panovski, Ljupcho
    [J]. 2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 888 - +
  • [25] Perceptual rate control algorithms for low bit-rate video coding
    Chang, SC
    Yang, JF
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2002, 25 (03) : 317 - 325
  • [26] AN ANALYSIS OF OPTIMAL FRAME RATE IN LOW BIT-RATE VIDEO CODING
    TAKISHIMA, Y
    WADA, M
    MURAKAMI, H
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 1993, E76B (11) : 1389 - 1397
  • [27] Variable frame rate for very low bit-rate video coding
    Guaragnella, C
    Di Sciascio, E
    [J]. MELECON 2000: INFORMATION TECHNOLOGY AND ELECTROTECHNOLOGY FOR THE MEDITERRANEAN COUNTRIES, VOLS 1-3, PROCEEDINGS, 2000, : 503 - 506
  • [28] Special issue on very low bit-rate video coding
    Hatori, Y
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 1996, E79B (10) : 1413 - 1414
  • [29] Spatio-temporal rate allocation for hybrid video coding
    Beermann, M
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2003, PTS 1-3, 2003, 5150 : 222 - 230
  • [30] A NEW CODING METHOD FOR LOW BIT-RATE VIDEO SIGNALS
    TU, G
    VANEYCKEN, L
    OOSTERLINCK, A
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING IV, PTS 1-3, 1989, 1199 : 514 - 521