The Multi-Scale Deep Decoder for the Standard HEVC Bitstreams

被引:11
|
作者
Wang, Tingting
Xiao, Wenhui
Chen, Mingjin
Chao, Hongyang [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
关键词
MOTION ESTIMATION; PREDICTION;
D O I
10.1109/DCC.2018.00028
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As we all know, there is strong multi-scale similarity among video frames. However, almost none of the current video coding standards takes this similarity into consideration. There exist two major problems when utilizing the multi-scale information at encoder-end: one is the extra motion models and the overheads brought by new motion parameters; the other is the extreme increment of the encoding algorithms' complexity. Is it possible to employ the multi-scale similarity only at the decoder-end to improve the decoded videos' quality, i.e., to further boost the coding efficiency? This paper mainly studies how to answer this question by proposing a novel Multi-Scale Deep Decoder (MSDD) for HEVC. Benefiting from the efficiency of deep learning technology (Convolutional Neural Network and Long Short-Term Memory network), MSDD achieves a higher coding efficiency only at the decoder-end without changing any encoding algorithms. Extensive experiments validate the feasibility and effectiveness of MSDD. MSDD leads to on averagely 6.5%, 8.0%, 6.4%, and 6.7% BD-rate reduction compared to HEVC anchor, for AI, LP, LB and RA coding configurations respectively. Especially for the videos with multi-scale similarity, the proposed approach obviously improves the coding efficiency indeed.
引用
下载
收藏
页码:197 / 206
页数:10
相关论文
共 50 条
  • [1] The Interpretable Fast Multi-Scale Deep Decoder for the Standard HEVC Bitstreams
    Xiao, Wenhui
    He, Huiguo
    Wang, Tingting
    Chao, Hongyang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1680 - 1691
  • [2] Multi-scale deep encoder-decoder network for salient object detection
    Ren, Qinghua
    Hu, Renjie
    NEUROCOMPUTING, 2018, 316 : 95 - 104
  • [3] Multi-scale Transformer with Decoder for Image Quality Assessment
    Zhang, Shuai
    Liu, Yutao
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 220 - 231
  • [4] SAR IMAGES ENHANCEMENT VIA DEEP MULTI-SCALE ENCODER-DECODER NEURAL NETWORK
    Yang, Xiaqing
    Zhou, Yuanyuan
    Wang, Chen
    Shi, Jun
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3368 - 3371
  • [5] Roadway Crack Segmentation Based on an Encoder-decoder Deep Network with Multi-scale Convolutional Blocks
    Sun, Mengyuan
    Guo, Runhua
    Zhu, Jinhui
    Fan, Wenhui
    2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 869 - 874
  • [6] Multi-scale Deep Nearest Neighbors
    Chauhan, Abhijeet
    Davoudi, Omid
    Komeili, Majid
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [7] Multi-Scale Deep Compressive Imaging
    Canh, Thuong Nguyen
    Jeon, Byeungwoo
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2021, 7 : 86 - 97
  • [8] Steganalysis for HEVC video based on multi-scale residual convolution network
    Zhang M.
    Li Z.
    Liu J.
    Zhang Z.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2021, 47 (11): : 2226 - 2233
  • [9] A Multi-scale Edge Detection Method Based on Encoder-Decoder
    Tian, An-Lin
    Lei, Wei-Min
    Zhang, Peng
    Zhang, Wei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (07): : 936 - 943
  • [10] MUSTER: A Multi-Scale Transformer-Based Decoder for Semantic Segmentation
    Xu, Jing
    Shi, Wentao
    Gao, Pan
    Li, Qizhu
    Wang, Zhengwei
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,