High Efficiency Deep-learning Based Video Compression

被引：1

作者：

Tang, Lv ^{[1
]}

Zhang, Xinfeng ^{[1
]}

机构：

[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2024年 / 20卷 / 08期

关键词：

Deep-learning-based video compression; attention mechanism; multi- scale feature extraction; channel selection; recurrent neural network; PERCEPTUAL IMAGE COMPRESSION; AUTO-ENCODER;

D O I：

10.1145/3661311

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Although deep learning technique has achieved significant improvement on image compression, but its advantages are not fully explored in video compression, which leads to the performance of deep-learning-based video compression (DLVC) is obviously inferior to that of hybrid video coding framework. In this article, we proposed a novel network to improve the performance of DLVC from its most important modules, includsecond-order attention and multi-scale feature extraction module to fully remove the warping artifacts from multi-scale feature space and pixel space, which can help reduce the distortion in the following process. In RC, we propose a channel selection mechanism to gradually drop redundant information while preserving informative channels for a better rate-distortion performance. Finally, in FR, we introduce a residual multi-scale recurrent network to improve the quality of the current reconstructed frame by progressively exploiting temporal context information between it and its several previous reconstructed frames. Extensive experiments are conducted on the three widely used video compression datasets (HEVC, UVG, and MCL-JVC), and the performance demonstrates the superiority of our proposed approach over the state-of-the-art methods.

引用

页数：23

共 50 条

[21] Value-Based Deep-Learning Acceleration
Moshovos, Andreas
Albericio, Jorge
Judd, Patrick
Lascorz, Alberto Delmas
Sharify, Sayeh
Hetherington, Tayler
Aamodt, Tor
Jerger, Natalie Enright
IEEE MICRO, 2018, 38 (01) : 41 - 55
[22] Deep video compression based on Long-range Temporal Context Learning
Wu, Kejun
Li, Zhenxing
Yang, You
Liu, Qiong
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
[23] A deep-learning based automatic glaucoma identification
Kucur, Serife Seda Seda
Abegg, Mathias
Wolf, Sebastian
Sznitman, Raphael
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2017, 58 (08)
[24] Interactive deep-learning based tumor segmentation
Wei, Z.
Ren, J.
Eriksen, J. G.
Korreman, S. S.
Nijkamp, J. A.
RADIOTHERAPY AND ONCOLOGY, 2021, 161 : S1385 - S1386
[25] A Deep-Learning Enabled Traffic Analysis Engine for Video Source Identification
Shi, Yan
Biswas, Subir
2019 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2019, : 50 - 56
[26] High-Resolution Bathymetry by Deep-Learning Based Point Cloud Upsampling
Irisawa, Naoya
Iiyama, Masaaki
IEEE ACCESS, 2024, 12 : 4387 - 4398
[27] High-Quality Video Watermarking Based on Deep Neural Networks for Video with HEVC Compression
Kaczynski, Maciej
Piotrowski, Zbigniew
Pietrow, Dymitr
SENSORS, 2022, 22 (19)
[28] Deep Learning in Latent Space for Video Prediction and Compression
Liu, Bowen
Chen, Yu
Liu, Shiyu
Kim, Hun-Seok
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 701 - 710
[29] Hybrid Deep-Learning Model for Deepfake Detection in Video using Transfer Learning Approach
Pandey, Raksha
Kushwaha, Alok Kumar Singh
NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2024,
[30] Deep Learning Approaches for Video Compression: A Bibliometric Analysis
Bidwe, Ranjeet Vasant
Mishra, Sashikala
Patil, Shruti
Shaw, Kailash
Vora, Deepali Rahul
Kotecha, Ketan
Zope, Bhushan
BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)

← 1 2 3 4 5 →