The Multi-Scale Deep Decoder for the Standard HEVC Bitstreams

被引:11
|
作者
Wang, Tingting
Xiao, Wenhui
Chen, Mingjin
Chao, Hongyang [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
关键词
MOTION ESTIMATION; PREDICTION;
D O I
10.1109/DCC.2018.00028
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As we all know, there is strong multi-scale similarity among video frames. However, almost none of the current video coding standards takes this similarity into consideration. There exist two major problems when utilizing the multi-scale information at encoder-end: one is the extra motion models and the overheads brought by new motion parameters; the other is the extreme increment of the encoding algorithms' complexity. Is it possible to employ the multi-scale similarity only at the decoder-end to improve the decoded videos' quality, i.e., to further boost the coding efficiency? This paper mainly studies how to answer this question by proposing a novel Multi-Scale Deep Decoder (MSDD) for HEVC. Benefiting from the efficiency of deep learning technology (Convolutional Neural Network and Long Short-Term Memory network), MSDD achieves a higher coding efficiency only at the decoder-end without changing any encoding algorithms. Extensive experiments validate the feasibility and effectiveness of MSDD. MSDD leads to on averagely 6.5%, 8.0%, 6.4%, and 6.7% BD-rate reduction compared to HEVC anchor, for AI, LP, LB and RA coding configurations respectively. Especially for the videos with multi-scale similarity, the proposed approach obviously improves the coding efficiency indeed.
引用
收藏
页码:197 / 206
页数:10
相关论文
共 50 条
  • [31] MULTI-SCALE ENHANCED DEEP NETWORK FOR ROAD DETECTION
    Lu, Xiaoyan
    Zhong, Yanfei
    Zhao, Ji
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3947 - 3950
  • [32] Multi-scale digital holographic reconstruction with deep learning
    Wang, Huaying
    Li, Qiwen
    Wang, Shuo
    Men, Gaofu
    APPLIED OPTICS, 2025, 64 (07)
  • [33] Multi-scale Deep Learning for Gesture Detection and Localization
    Neverova, Natalia
    Wolf, Christian
    Taylor, Graham W.
    Nebout, Florian
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 474 - 490
  • [34] Understanding Mobility via Deep Multi-Scale Learning
    Zhang, Rui
    Xie, Peng
    Wang, Chen
    Liu, Gaoyang
    Wan, Shaohua
    2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 : 487 - 494
  • [35] Multi-scale digital soil mapping with deep learning
    Behrens, Thorsten
    Schmidt, Karsten
    MacMillan, Robert A.
    Rossel, Raphael A. Viscarra
    SCIENTIFIC REPORTS, 2018, 8
  • [36] Multi-scale volumes for deep object detection and localization
    Ohn-Bar, Eshed
    Trivedi, Mohan Manubhai
    PATTERN RECOGNITION, 2017, 61 : 557 - 572
  • [37] Multi-scale Deep Representation Learning for Face Detection
    Han, Jifei
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [38] Multi-scale Pyramid Pooling for Deep Convolutional Representation
    Yoo, Donggeun
    Park, Sunggyun
    Lee, Joon-Young
    Kweon, In So
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [39] Multi-Scale Deep Subspace Clustering With Discriminative Learning
    Wang, Jiao
    Wu, Bin
    Ren, Zhenwen
    Zhou, Yunhui
    IEEE ACCESS, 2022, 10 : 91283 - 91293
  • [40] MULTI-SCALE DEEP RESIDUAL LEARNING FOR CLOUD REMOVAL
    Yang, Qiaoqiao
    Wang, Guangxing
    Zhao, Yaxuan
    Zhang, Xiaoyu
    Dong, Guoshuai
    Ren, Peng
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 4967 - 4970