High Efficiency Deep-learning Based Video Compression

被引:1
|
作者
Tang, Lv [1 ]
Zhang, Xinfeng [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China
关键词
Deep-learning-based video compression; attention mechanism; multi- scale feature extraction; channel selection; recurrent neural network; PERCEPTUAL IMAGE COMPRESSION; AUTO-ENCODER;
D O I
10.1145/3661311
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although deep learning technique has achieved significant improvement on image compression, but its advantages are not fully explored in video compression, which leads to the performance of deep-learning-based video compression (DLVC) is obviously inferior to that of hybrid video coding framework. In this article, we proposed a novel network to improve the performance of DLVC from its most important modules, includsecond-order attention and multi-scale feature extraction module to fully remove the warping artifacts from multi-scale feature space and pixel space, which can help reduce the distortion in the following process. In RC, we propose a channel selection mechanism to gradually drop redundant information while preserving informative channels for a better rate-distortion performance. Finally, in FR, we introduce a residual multi-scale recurrent network to improve the quality of the current reconstructed frame by progressively exploiting temporal context information between it and its several previous reconstructed frames. Extensive experiments are conducted on the three widely used video compression datasets (HEVC, UVG, and MCL-JVC), and the performance demonstrates the superiority of our proposed approach over the state-of-the-art methods.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Value-Based Deep-Learning Acceleration
    Moshovos, Andreas
    Albericio, Jorge
    Judd, Patrick
    Lascorz, Alberto Delmas
    Sharify, Sayeh
    Hetherington, Tayler
    Aamodt, Tor
    Jerger, Natalie Enright
    IEEE MICRO, 2018, 38 (01) : 41 - 55
  • [22] Deep video compression based on Long-range Temporal Context Learning
    Wu, Kejun
    Li, Zhenxing
    Yang, You
    Liu, Qiong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [23] A deep-learning based automatic glaucoma identification
    Kucur, Serife Seda Seda
    Abegg, Mathias
    Wolf, Sebastian
    Sznitman, Raphael
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2017, 58 (08)
  • [24] Interactive deep-learning based tumor segmentation
    Wei, Z.
    Ren, J.
    Eriksen, J. G.
    Korreman, S. S.
    Nijkamp, J. A.
    RADIOTHERAPY AND ONCOLOGY, 2021, 161 : S1385 - S1386
  • [25] A Deep-Learning Enabled Traffic Analysis Engine for Video Source Identification
    Shi, Yan
    Biswas, Subir
    2019 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2019, : 50 - 56
  • [26] High-Resolution Bathymetry by Deep-Learning Based Point Cloud Upsampling
    Irisawa, Naoya
    Iiyama, Masaaki
    IEEE ACCESS, 2024, 12 : 4387 - 4398
  • [27] High-Quality Video Watermarking Based on Deep Neural Networks for Video with HEVC Compression
    Kaczynski, Maciej
    Piotrowski, Zbigniew
    Pietrow, Dymitr
    SENSORS, 2022, 22 (19)
  • [28] Deep Learning in Latent Space for Video Prediction and Compression
    Liu, Bowen
    Chen, Yu
    Liu, Shiyu
    Kim, Hun-Seok
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 701 - 710
  • [29] Hybrid Deep-Learning Model for Deepfake Detection in Video using Transfer Learning Approach
    Pandey, Raksha
    Kushwaha, Alok Kumar Singh
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2024,
  • [30] Deep Learning Approaches for Video Compression: A Bibliometric Analysis
    Bidwe, Ranjeet Vasant
    Mishra, Sashikala
    Patil, Shruti
    Shaw, Kailash
    Vora, Deepali Rahul
    Kotecha, Ketan
    Zope, Bhushan
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)