Multi-Scale Motion Alignment and Frame Reconstruction for Efficient Deep Video Compression

被引:0
|
作者
Yang, Gongning [1 ]
Wei, Xiaojie [1 ]
Lin, Hongbin [1 ]
机构
[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou 350116, Peoples R China
关键词
Convolution; Decoding; Motion compensation; Video compression; Feature extraction; Encoding; Video codecs; Deep video compression; end-to-end video codec; flexible rate adjustment; video coding;
D O I
10.1109/LSP.2024.3443516
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As video data continues to grow, the burden on network transmission increases significantly. Efficient video compression techniques are crucial to meet the rising demand for multimedia content. In this letter, we propose a Multi-scale Motion Alignment and Frame Reconstruction-based Video Codec (MFVC) for efficient video compression. MFVC focuses on optimizing the motion compensation and video reconstruction processes within a deep video compression framework. First, we design a Multi-Scale Motion Alignment Network (MSMA-Net) to achieve precise motion compensation, which extracts multi-scale features from video frames and utilizes flow information for deformable convolution. Second, we design a Frame Reconstruction Network (FR-Net) to recover high-quality video frames, which utilizes reference information for feature enhancement without additional bitrate consumption. Moreover, to achieve smooth rate adjustment, we introduce a feature scaling technique. Experimental results show that MFVC reduces bitrate by 7.86%/48.34% compared to VVC (VTM 13.2) at the same PSNR/MS-SSIM.
引用
收藏
页码:2125 / 2129
页数:5
相关论文
共 50 条
  • [1] Multi-Scale Warping for Video Frame Interpolation
    Choi, Whan
    Koh, Yeong Jun
    Kim, Chang-Su
    IEEE ACCESS, 2021, 9 : 150470 - 150479
  • [2] Multi-scale Clustering of Frame-to-Frame Correspondences for Motion Segmentation
    Dragon, Ralf
    Rosenhahn, Bodo
    Ostermann, Joern
    COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 : 445 - 458
  • [3] Multi-scale Intermediate Flow Estimation for Video Frame Interpolation
    Fan, Zehua
    Zhu, Feng
    Li, Lei
    Tan, Xiaoyang
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 893 - 900
  • [4] Video frame interpolation via spatial multi-scale modelling
    Qu, Zhe
    Liu, Weijing
    Cui, Lizhen
    Yang, Xiaohui
    IET COMPUTER VISION, 2024, 18 (04) : 458 - 472
  • [5] Multi-Scale Video Inverse Tone Mapping with Deformable Alignment
    Zou, Jiaqi
    Mei, Ke
    Sun, Songlin
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 9 - 12
  • [6] An Efficient Multi-Scale Attention Feature Fusion Network for 4K Video Frame Interpolation
    Ning, Xin
    Li, Yuhang
    Feng, Ziwei
    Liu, Jinhua
    Ding, Youdong
    ELECTRONICS, 2024, 13 (06)
  • [7] Multi-scale digital holographic reconstruction with deep learning
    Wang, Huaying
    Li, Qiwen
    Wang, Shuo
    Men, Gaofu
    APPLIED OPTICS, 2025, 64 (07)
  • [8] A Multi-Scale Position Feature Transform Network for Video Frame Interpolation
    Cheng, Xianhang
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 3968 - 3981
  • [9] Deep frame interpolation for video compression
    Begaint, Jean
    Galpin, Franck
    Guillotel, Philippe
    Guillemot, Christine
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 556 - 556
  • [10] Video Frame Interpolation via Multi-scale Expandable Deformable Convolution
    Zhang, Dengyong
    Huang, Pu
    Ding, Xiangling
    Li, Feng
    Yang, Gaobo
    PROCEEDINGS OF THE 2023 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2023, 2023, : 19 - 28