The Multi-Scale Deep Decoder for the Standard HEVC Bitstreams

被引：11

作者：

Wang, Tingting

Xiao, Wenhui

Chen, Mingjin

Chao, Hongyang ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China

来源：

2018 DATA COMPRESSION CONFERENCE (DCC 2018) | 2018年

关键词：

MOTION ESTIMATION; PREDICTION;

D O I：

10.1109/DCC.2018.00028

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

As we all know, there is strong multi-scale similarity among video frames. However, almost none of the current video coding standards takes this similarity into consideration. There exist two major problems when utilizing the multi-scale information at encoder-end: one is the extra motion models and the overheads brought by new motion parameters; the other is the extreme increment of the encoding algorithms' complexity. Is it possible to employ the multi-scale similarity only at the decoder-end to improve the decoded videos' quality, i.e., to further boost the coding efficiency? This paper mainly studies how to answer this question by proposing a novel Multi-Scale Deep Decoder (MSDD) for HEVC. Benefiting from the efficiency of deep learning technology (Convolutional Neural Network and Long Short-Term Memory network), MSDD achieves a higher coding efficiency only at the decoder-end without changing any encoding algorithms. Extensive experiments validate the feasibility and effectiveness of MSDD. MSDD leads to on averagely 6.5%, 8.0%, 6.4%, and 6.7% BD-rate reduction compared to HEVC anchor, for AI, LP, LB and RA coding configurations respectively. Especially for the videos with multi-scale similarity, the proposed approach obviously improves the coding efficiency indeed.

引用

下载

页码：197 / 206

页数：10

共 50 条

[21] Unsupervised deep homography with multi-scale global attention
Hu, Wei
He, Chu
Lin, Mingyuan
Zhou, Haoyu
IET IMAGE PROCESSING, 2023, 17 (10) : 2937 - 2948
[22] Deep Learning for Multi-scale Object Detection: A Survey
Chen K.-Q.
Zhu Z.-L.
Deng X.-M.
Ma C.-X.
Wang H.-A.
Ruan Jian Xue Bao/Journal of Software, 2021, 32 (04): : 1201 - 1227
[23] DEEP MULTI-SCALE ARCHITECTURES FOR MONOCULAR DEPTH ESTIMATION
Moukari, M.
Picard, S.
Simon, L.
Jurie, F.
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2940 - 2944
[24] MULTI-SCALE DEEP NETWORKS FOR IMAGE COMPRESSED SENSING
Shi, Wuzhen
Jiang, Feng
Liu, Shaohui
Zhao, Debin
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 46 - 50
[25] A MULTI-SCALE DENSELY DEEP LEARNING METHOD FOR PANSHARPENING
Xiang, Zhikang
Xiao, Liang
Liu, Pengfei
Zhang, Yufei
2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2786 - 2789
[26] RGBD deep multi-scale network for background subtraction
Ihssane Houhou
Athmane Zitouni
Yassine Ruichek
Salah Eddine Bekhouche
Mohamed Kas
Abdelmalik Taleb-Ahmed
International Journal of Multimedia Information Retrieval, 2022, 11 : 395 - 407
[27] Rethinking Multi-Scale Representations in Deep Deraining Transformer
Chen, Hongming
Chen, Xiang
Lu, Jiyang
Li, Yufeng
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1046 - 1053
[28] Multi-Scale Deep Representation Aggregation for Vein Recognition
Pan, Zaiyu
Wang, Jun
Wang, Guoqing
Zhu, Jihong
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 1 - 15
[29] Multi-scale Deep Learning for Gesture Detection and Localization
Neverova, Natalia
Wolf, Christian
Taylor, Graham W.
Nebout, Florian
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 474 - 490
[30] MULTI-SCALE ENHANCED DEEP NETWORK FOR ROAD DETECTION
Lu, Xiaoyan
Zhong, Yanfei
Zhao, Ji
2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3947 - 3950

← 1 2 3 4 5 →