Video Coding with Rate-Distortion Optimized Transform

被引:35
|
作者
Zhao, Xin [1 ]
Zhang, Li [2 ]
Ma, Siwei [2 ]
Gao, Wen [2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Inst Digital Media, Beijing 100871, Peoples R China
基金
美国国家科学基金会;
关键词
Directional transform; H.264/AVC; Karhunen-Loeve transform (KLT); mode-dependent directional transform (MDDT); video coding;
D O I
10.1109/TCSVT.2011.2158363
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Block-based discrete cosine transform (DCT) has been successfully adopted into several international image/video coding standards, e. g., MPEG-2, H.264/AVC, as it can achieve a good tradeoff between performance and complexity. Although DCT theoretically approximates the optimum Karhunen-Loeve transform under first-order Markov conditions, one fixed set of transform basis functions (TBF) cannot handle all the cases efficiently due to the non-stationary nature of video contents. To further improve the performance of block-based transform coding, in this paper, we present the design of rate-distortion optimized transform (RDOT) which contributes to both intraframe and interframe coding. The most important property which makes a difference between RDOT and the conventional DCT is that, in the proposed method, transform is implemented with multiple TBF candidates which are obtained from off-line training. With this feature, for coding each residual block, the encoder is capable to select the optimal set of TBF in terms of rate-distortion performance, and better energy compaction is achieved in the transform domain. To obtain an optimum group of candidate TBF, we have developed a two-step iterative optimization technique for the off-line training, with which the TBF candidates are refined at each iteration until the training process becomes converged. Moreover, analysis on the optimal group of candidate TBF is also presented in this paper, with a detailed description of a practical implementation for the proposed algorithm on the latest VCEG key technical area software platform. Extensive experimental results show that, compared with the conventional DCT-based transform scheme adopted into the state-of-the-art H.264/AVC video coding standard, significant improvement of coding performance has been achieved for both intraframe and interframe coding with our proposed method.
引用
收藏
页码:138 / 151
页数:14
相关论文
共 50 条
  • [1] Rate-distortion optimized adaptive transform coding
    Lim, Sung-Chang
    Kim, Dae-Yeon
    Jeong, Seyoon
    Choi, Jin Soo
    Choi, Haechul
    Lee, Yung-Lyul
    [J]. OPTICAL ENGINEERING, 2009, 48 (08)
  • [2] Rate-distortion optimized video coding considering frameskip
    Vetro, A
    Wang, Y
    Sun, HF
    [J]. 2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2001, : 534 - 537
  • [3] The optimized method of video coding rate control based on rate-distortion
    Li, Xiaohui
    Wang, Li
    [J]. 27TH INTERNATIONAL CONGRESS ON HIGH SPEED PHOTOGRAPHY AND PHOTONICS, PRTS 1-3, 2007, 6279
  • [4] Rate-Distortion Optimized Video Coding Using Automatic Sprites
    Krutz, Andreas
    Glantz, Alexander
    Frater, Michael
    Sikora, Thomas
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (07) : 1309 - 1321
  • [5] Rate-distortion optimized bit allocation for wireless video coding
    Zhang, W
    Zhou, YH
    [J]. PROCEEDINGS OF THE IEEE 6TH CIRCUITS AND SYSTEMS SYMPOSIUM ON EMERGING TECHNOLOGIES: FRONTIERS OF MOBILE AND WIRELESS COMMUNICATION, VOLS 1 AND 2, 2004, : 265 - 268
  • [6] A Rate-Distortion Optimized Coding Method for Region of Interest in Scalable Video Coding
    Wang, Hongtao
    Zhang, Dong
    Li, Houqiang
    [J]. ADVANCES IN MULTIMEDIA, 2015, 2015
  • [7] Rate-distortion optimized motion estimation for error resilient video coding
    Yang, H
    Rose, K
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 173 - 176
  • [8] Rate-distortion optimized video coding with stopping rules: Quality and complexity
    Moecke, M
    Seara, R
    [J]. ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 753 - 756
  • [9] Rate-distortion optimized rate control in 3D subband video coding
    School of Telecommunication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
    [J]. Beijing Youdian Daxue Xuebao, 2007, 3 (79-82+87):
  • [10] Rate-Distortion Optimal Wavelet Packet Transform for Low Bit Rate Video Coding
    Zong, Xiaofei
    Men, Aidong
    Yang, Bo
    [J]. IST: 2009 IEEE INTERNATIONAL WORKSHOP ON IMAGING SYSTEMS AND TECHNIQUES, 2009, : 359 - 363