Multi-Scale Motion Alignment and Frame Reconstruction for Efficient Deep Video Compression

被引:0
|
作者
Yang, Gongning [1 ]
Wei, Xiaojie [1 ]
Lin, Hongbin [1 ]
机构
[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou 350116, Peoples R China
关键词
Convolution; Decoding; Motion compensation; Video compression; Feature extraction; Encoding; Video codecs; Deep video compression; end-to-end video codec; flexible rate adjustment; video coding;
D O I
10.1109/LSP.2024.3443516
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As video data continues to grow, the burden on network transmission increases significantly. Efficient video compression techniques are crucial to meet the rising demand for multimedia content. In this letter, we propose a Multi-scale Motion Alignment and Frame Reconstruction-based Video Codec (MFVC) for efficient video compression. MFVC focuses on optimizing the motion compensation and video reconstruction processes within a deep video compression framework. First, we design a Multi-Scale Motion Alignment Network (MSMA-Net) to achieve precise motion compensation, which extracts multi-scale features from video frames and utilizes flow information for deformable convolution. Second, we design a Frame Reconstruction Network (FR-Net) to recover high-quality video frames, which utilizes reference information for feature enhancement without additional bitrate consumption. Moreover, to achieve smooth rate adjustment, we introduce a feature scaling technique. Experimental results show that MFVC reduces bitrate by 7.86%/48.34% compared to VVC (VTM 13.2) at the same PSNR/MS-SSIM.
引用
收藏
页码:2125 / 2129
页数:5
相关论文
共 50 条
  • [31] An efficient multi-layer reference frame motion estimation for video coding
    Paramkusam, Adapa Venkata
    Reddy, Vustikayala Siva Kumar
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2016, 11 (04) : 645 - 661
  • [32] Multi-scale Guided Image and Video Fusion: A Fast and Efficient Approach
    Bavirisetti, Durga Prasad
    Xiao, Gang
    Zhao, Junhao
    Dhuli, Ravindra
    Liu, Gang
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (12) : 5576 - 5605
  • [33] Efficient Multi-scale Plane Extraction Based RGBD Video Segmentation
    Liu, Hong
    Wang, Jun
    Wang, Xiangdong
    Qian, Yueliang
    MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 614 - 625
  • [34] Multi-scale Guided Image and Video Fusion: A Fast and Efficient Approach
    Durga Prasad Bavirisetti
    Gang Xiao
    Junhao Zhao
    Ravindra Dhuli
    Gang Liu
    Circuits, Systems, and Signal Processing, 2019, 38 : 5576 - 5605
  • [35] MCTracker: Satellite video multi-object tracking considering inter-frame motion correlation and multi-scale cascaded feature enhancement
    Wang, Bin
    Sui, Haigang
    Ma, Guorui
    Zhou, Yuan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 214 : 82 - 103
  • [36] Multi-scale image compression and reconstruction algorithm for structural health monitoring system
    Shen, Wei
    Tian, Xi
    Zeng, Dongyang
    Zhang, Yang
    ENGINEERING STRUCTURES, 2024, 315
  • [37] An Efficient Multi-Scale Modelling Approach for ssDNA Motion in Fluid Flow
    M. Benke
    E. Shapiro
    D. Drikakis
    Journal of Bionic Engineering, 2008, 5 (04) : 299 - 307
  • [38] An Efficient Multi-Scale Modelling Approach for ssDNA Motion in Fluid Flow
    Benke, M.
    Shapiro, E.
    Drikakis, D.
    JOURNAL OF BIONIC ENGINEERING, 2008, 5 (04) : 299 - 307
  • [39] An Efficient Multi-Scale Modelling Approach for ssDNA Motion in Fluid Flow
    M. Benke
    E. Shapiro
    D. Drikakis
    Journal of Bionic Engineering, 2008, 5 : 299 - 307
  • [40] Spatially adaptive wavelet transform for video coding with multi-scale motion compensation
    Mrak, Marta
    Izquierdo, Ebroul
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 881 - 884