High Efficiency Deep-learning Based Video Compression

被引:1
|
作者
Tang, Lv [1 ]
Zhang, Xinfeng [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China
关键词
Deep-learning-based video compression; attention mechanism; multi- scale feature extraction; channel selection; recurrent neural network; PERCEPTUAL IMAGE COMPRESSION; AUTO-ENCODER;
D O I
10.1145/3661311
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although deep learning technique has achieved significant improvement on image compression, but its advantages are not fully explored in video compression, which leads to the performance of deep-learning-based video compression (DLVC) is obviously inferior to that of hybrid video coding framework. In this article, we proposed a novel network to improve the performance of DLVC from its most important modules, includsecond-order attention and multi-scale feature extraction module to fully remove the warping artifacts from multi-scale feature space and pixel space, which can help reduce the distortion in the following process. In RC, we propose a channel selection mechanism to gradually drop redundant information while preserving informative channels for a better rate-distortion performance. Finally, in FR, we introduce a residual multi-scale recurrent network to improve the quality of the current reconstructed frame by progressively exploiting temporal context information between it and its several previous reconstructed frames. Extensive experiments are conducted on the three widely used video compression datasets (HEVC, UVG, and MCL-JVC), and the performance demonstrates the superiority of our proposed approach over the state-of-the-art methods.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Review and Evaluation of End-to-End Video Compression with Deep-Learning
    Yasin, Hajar Maseeh
    Ameen, Siddeeq Yosef
    2021 INTERNATIONAL CONFERENCE OF MODERN TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY INDUSTRY (MTICTI 2021), 2021, : 81 - 88
  • [2] Deep Learning Based Video Compression
    Ji, Kang Da
    Hlavacs, Helmut
    INTELLIGENT TECHNOLOGIES FOR INTERACTIVE ENTERTAINMENT, INTETAIN 2021, 2022, 429 : 127 - 141
  • [3] A Deep-Learning Based Model for Emotional Evaluation of Video Clips
    Kim, Byoungjun
    Lee, Joonwhoan
    INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2018, 18 (04) : 245 - 253
  • [4] Deep-BVQM: A Deep-learning Bitstream-based Video Quality Model
    Avanaki, Nasim Jamshidi
    Schmidt, Steven
    Michael, Thilo
    Zadtootaghaj, Saman
    Meoller, Sebastian
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 915 - 923
  • [5] Temporal video scene segmentation using deep-learning
    Tiago Henrique Trojahn
    Rudinei Goularte
    Multimedia Tools and Applications, 2021, 80 : 17487 - 17513
  • [6] Video-Based Contactless Detection of Sleep Apnea With Deep-Learning Model
    Chiu, Li-Wen
    Chou, Yang-Ren
    Wu, Yi-Chiao
    Chung, Meng-Liang
    Wu, Bing-Fei
    Chou, Kun-Ta
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [7] Hybrid deep-learning framework for object-based forgery detection in video
    Tan, Shunquan
    Chen, Baoying
    Zeng, Jishen
    Li, Bin
    Huang, Jiwu
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 105
  • [8] Temporal video scene segmentation using deep-learning
    Trojahn, Tiago Henrique
    Goularte, Rudinei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (12) : 17487 - 17513
  • [9] 3D Video Compression Based on High Efficiency Video Coding
    Van Wallendael, Glenn
    Van Leuven, Sebastiaan
    De Cock, Jan
    Bruls, Fons
    Van de Walle, Rik
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (01) : 137 - 145
  • [10] Power grid cloud resource status data compression method based on deep-learning
    Liang W.
    Zhu Y.
    Li G.
    Recent Advances in Computer Science and Communications, 2021, 14 (03): : 941 - 951