Long-term prediction for hierarchical-B-picture-based coding of video with repeated shots

被引:0
|
作者
Xu-guang Zuo
Lu Yu
机构
[1] Zhejiang University,Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking (IPCAN), Institute of Information and Communication Engineering
关键词
High Efficiency Video Coding (HEVC); Long-term temporal correlation; Long-term prediction; Hierarchical B-picture structure; TN919.8;
D O I
暂无
中图分类号
学科分类号
摘要
The latest video coding standard High Efficiency Video Coding (HEVC) can achieve much higher coding efficiency than previous video coding standards. Particularly, by exploiting the hierarchical B-picture prediction structure, temporal redundancy among neighbor frames is eliminated remarkably well. In practice, videos available to consumers usually contain many repeated shots, such as TV series, movies, and talk shows. According to our observations, when these videos are encoded by HEVC with the hierarchical B-picture structure, the temporal correlation in each shot is well exploited. However, the long-term correlation between repeated shots has not been used. We propose a long-term prediction (LTP) scheme to use the long-term temporal correlation between correlated shots in a video. The long-term reference (LTR) frames of a source video are chosen by clustering similar shots and extracting the representative frames, and a modified hierarchical B-picture coding structure based on an LTR frame is introduced to support long-term temporal prediction. An adaptive quantization method is further designed for LTR frames to improve the overall video coding efficiency. Experimental results show that up to 22.86% coding gain can be achieved using the new coding scheme.
引用
收藏
页码:459 / 470
页数:11
相关论文
共 50 条
  • [21] Learning to Generate Long-term Future via Hierarchical Prediction
    Villegas, Ruben
    Yang, Jimei
    Zou, Yuliang
    Sohn, Sungryull
    Lin, Xunyu
    Lee, Honglak
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [22] Rate control of hierarchical B prediction structure for multi-view video coding
    Lei, Jianjun
    Feng, Kun
    Wu, Meimin
    Li, Shuai
    Hou, Chunping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 72 (01) : 825 - 842
  • [23] Rate control of hierarchical B prediction structure for multi-view video coding
    Jianjun Lei
    Kun Feng
    Meimin Wu
    Shuai Li
    Chunping Hou
    Multimedia Tools and Applications, 2014, 72 : 825 - 842
  • [24] Scene-library-based video coding scheme exploiting long-term temporal correlation
    Zuo, Xuguang
    Yu, Lu
    Yu, Hualong
    Mao, Jue
    Zhao, Yin
    JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (04)
  • [25] Improved Temporal scalable video coding based on low-delay hierarchical dual reference P-picture prediction structure
    Tian, Gang
    Hu, RuiMin
    Liu, Qiong
    Wang, ZhongYuan
    2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 433 - +
  • [26] Long-Term Background Redundancy Reduction for Earth Observatory Video Coding
    Wang, Xu
    Hu, Ruimin
    Wang, Zhongyuan
    Xiao, Jing
    Satoh, Shin'ichi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4309 - 4320
  • [27] Improved video coding using long-term global motion compensation
    Smolic, A
    Vatis, Y
    Schwarz, H
    Kauff, P
    Gölz, U
    Wiegand, T
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2004, PTS 1 AND 2, 2004, 5308 : 343 - 354
  • [28] Hierarchical-P Reference Picture Selection Based Error Resilient Video Coding Framework for High Efficiency Video Coding Transmission Applications
    Maung, Htoo Maung
    Aramvith, Supavadee
    Miyanaga, Yoshikazu
    ELECTRONICS, 2019, 8 (03):
  • [29] PhyLoNet: Physically-Constrained Long-Term Video Prediction
    Ben Zikri, Nir
    Sharf, Andrei
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 570 - 587
  • [30] The Bidirectional-Based SRMC for Hierarchical B Frame in Scalable Video Coding
    Lee, Yu-Xuan
    Liu, Hsing-Chuang
    Tsai, Tsung-Han
    2008 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2, 2008, : 908 - 911