Long-term prediction for hierarchical-B-picture-based coding of video with repeated shots

被引:3
|
作者
Zuo, Xu-guang [1 ]
Yu, Lu [1 ]
机构
[1] Zhejiang Univ, Zhejiang Prov Key Lab Informat Proc Commun & Netw, Inst Informat & Commun Engn, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
High Efficiency Video Coding (HEVC); Long-term temporal correlation; Long-term prediction; Hierarchical B-picture structure; REFERENCE FRAME SELECTION;
D O I
10.1631/FITEE.1601552
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The latest video coding standard High Efficiency Video Coding (HEVC) can achieve much higher coding efficiency than previous video coding standards. Particularly, by exploiting the hierarchical B-picture prediction structure, temporal redundancy among neighbor frames is eliminated remarkably well. In practice, videos available to consumers usually contain many repeated shots, such as TV series, movies, and talk shows. According to our observations, when these videos are encoded by HEVC with the hierarchical B-picture structure, the temporal correlation in each shot is well exploited. However, the long-term correlation between repeated shots has not been used. We propose a long-term prediction (LTP) scheme to use the long-term temporal correlation between correlated shots in a video. The long-term reference (LTR) frames of a source video are chosen by clustering similar shots and extracting the representative frames, and a modified hierarchical B-picture coding structure based on an LTR frame is introduced to support long-term temporal prediction. An adaptive quantization method is further designed for LTR frames to improve the overall video coding efficiency. Experimental results show that up to 22.86% coding gain can be achieved using the new coding scheme.
引用
收藏
页码:459 / 470
页数:12
相关论文
共 50 条
  • [41] A Highly-Reliable Buffer Strategy Based on Long-Term Throughput Prediction for Mobile Video Streaming
    Kanai, Kenji
    Hidenori, Konishi
    Ishizu, Yuya
    Katto, Jiro
    2015 12TH ANNUAL IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, 2015, : 677 - 682
  • [42] Transformer-based Long-Term Viewport Prediction in 360° Video: Scanpath is All You Need
    Chao, Fang-Yi
    Ozcinar, Cagri
    Smolic, Aljosa
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [43] Long-Term Trajectory Prediction Model Based on Transformer
    Tong, Qiang
    Hu, Jinqing
    Chen, Yuli
    Guo, Dongdong
    Liu, Xiulei
    IEEE ACCESS, 2023, 11 : 143695 - 143703
  • [44] A self-attention sequential model for long-term prediction of video streams
    Ge, Yunfeng
    Li, Hongyan
    Shi, Keyi
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2024, 51 (03): : 88 - 102
  • [45] Towards Long-Term Coverage and Video Users Satisfaction Prediction in Cellular Networks
    Pimpinella, Andrea
    Redondi, Alessandro E. C.
    Galimberti, Iacopo
    Foglia, Francesco
    Venturini, Luisa
    PROCEEDINGS OF THE 2019 12TH IFIP WIRELESS AND MOBILE NETWORKING CONFERENCE (WMNC 2019), 2019, : 146 - 153
  • [46] Long-term memory motion-compensated prediction for robust video transmission
    Wiegand, T
    Färber, N
    Stuhlmüller, K
    Girod, B
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, : 152 - 155
  • [47] An Efficient Inter-Prediction Mode Decision Method for Temporal Scalability Coding With Hierarchical B-Picture Structure
    Lee, Bumshik
    Kim, Munchurl
    IEEE TRANSACTIONS ON BROADCASTING, 2012, 58 (02) : 285 - 290
  • [48] KEMP: Keyframe-Based Hierarchical End-to-End Deep Model for Long-Term Trajectory Prediction
    Lu, Qiujing
    Han, Weiqiao
    Ling, Jeffrey
    Wang, Minfa
    Chen, Haoyu
    Varadarajan, Balakrishnan
    Covington, Paul
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
  • [49] LSAN: Modeling Long-term Dependencies and Short-term Correlations with Hierarchical Attention for Risk Prediction
    Ye, Muchao
    Luo, Junyu
    Xiao, Cao
    Ma, Fenglong
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1753 - 1762
  • [50] HPRNN: A HIERARCHICAL SEQUENCE PREDICTION MODEL FOR LONG-TERM WEATHER RADAR ECHO EXTRAPOLATION
    Jing, Jinrui
    Li, Qian
    Peng, Xuan
    Ma, Qiang
    Tang, Shaoen
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4142 - 4146