The Multi-Scale Deep Decoder for the Standard HEVC Bitstreams

被引:11
|
作者
Wang, Tingting
Xiao, Wenhui
Chen, Mingjin
Chao, Hongyang [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
关键词
MOTION ESTIMATION; PREDICTION;
D O I
10.1109/DCC.2018.00028
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As we all know, there is strong multi-scale similarity among video frames. However, almost none of the current video coding standards takes this similarity into consideration. There exist two major problems when utilizing the multi-scale information at encoder-end: one is the extra motion models and the overheads brought by new motion parameters; the other is the extreme increment of the encoding algorithms' complexity. Is it possible to employ the multi-scale similarity only at the decoder-end to improve the decoded videos' quality, i.e., to further boost the coding efficiency? This paper mainly studies how to answer this question by proposing a novel Multi-Scale Deep Decoder (MSDD) for HEVC. Benefiting from the efficiency of deep learning technology (Convolutional Neural Network and Long Short-Term Memory network), MSDD achieves a higher coding efficiency only at the decoder-end without changing any encoding algorithms. Extensive experiments validate the feasibility and effectiveness of MSDD. MSDD leads to on averagely 6.5%, 8.0%, 6.4%, and 6.7% BD-rate reduction compared to HEVC anchor, for AI, LP, LB and RA coding configurations respectively. Especially for the videos with multi-scale similarity, the proposed approach obviously improves the coding efficiency indeed.
引用
下载
收藏
页码:197 / 206
页数:10
相关论文
共 50 条
  • [21] Unsupervised deep homography with multi-scale global attention
    Hu, Wei
    He, Chu
    Lin, Mingyuan
    Zhou, Haoyu
    IET IMAGE PROCESSING, 2023, 17 (10) : 2937 - 2948
  • [22] Deep Learning for Multi-scale Object Detection: A Survey
    Chen K.-Q.
    Zhu Z.-L.
    Deng X.-M.
    Ma C.-X.
    Wang H.-A.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (04): : 1201 - 1227
  • [23] DEEP MULTI-SCALE ARCHITECTURES FOR MONOCULAR DEPTH ESTIMATION
    Moukari, M.
    Picard, S.
    Simon, L.
    Jurie, F.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2940 - 2944
  • [24] MULTI-SCALE DEEP NETWORKS FOR IMAGE COMPRESSED SENSING
    Shi, Wuzhen
    Jiang, Feng
    Liu, Shaohui
    Zhao, Debin
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 46 - 50
  • [25] A MULTI-SCALE DENSELY DEEP LEARNING METHOD FOR PANSHARPENING
    Xiang, Zhikang
    Xiao, Liang
    Liu, Pengfei
    Zhang, Yufei
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2786 - 2789
  • [26] RGBD deep multi-scale network for background subtraction
    Ihssane Houhou
    Athmane Zitouni
    Yassine Ruichek
    Salah Eddine Bekhouche
    Mohamed Kas
    Abdelmalik Taleb-Ahmed
    International Journal of Multimedia Information Retrieval, 2022, 11 : 395 - 407
  • [27] Rethinking Multi-Scale Representations in Deep Deraining Transformer
    Chen, Hongming
    Chen, Xiang
    Lu, Jiyang
    Li, Yufeng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1046 - 1053
  • [28] Multi-Scale Deep Representation Aggregation for Vein Recognition
    Pan, Zaiyu
    Wang, Jun
    Wang, Guoqing
    Zhu, Jihong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 1 - 15
  • [29] Multi-scale Deep Learning for Gesture Detection and Localization
    Neverova, Natalia
    Wolf, Christian
    Taylor, Graham W.
    Nebout, Florian
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 474 - 490
  • [30] MULTI-SCALE ENHANCED DEEP NETWORK FOR ROAD DETECTION
    Lu, Xiaoyan
    Zhong, Yanfei
    Zhao, Ji
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3947 - 3950