Multi-Scale Motion Alignment and Frame Reconstruction for Efficient Deep Video Compression

被引：0

作者：

Yang, Gongning ^{[1
]}

Wei, Xiaojie ^{[1
]}

Lin, Hongbin ^{[1
]}

机构：

[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou 350116, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2024年 / 31卷

关键词：

Convolution; Decoding; Motion compensation; Video compression; Feature extraction; Encoding; Video codecs; Deep video compression; end-to-end video codec; flexible rate adjustment; video coding;

D O I：

10.1109/LSP.2024.3443516

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

As video data continues to grow, the burden on network transmission increases significantly. Efficient video compression techniques are crucial to meet the rising demand for multimedia content. In this letter, we propose a Multi-scale Motion Alignment and Frame Reconstruction-based Video Codec (MFVC) for efficient video compression. MFVC focuses on optimizing the motion compensation and video reconstruction processes within a deep video compression framework. First, we design a Multi-Scale Motion Alignment Network (MSMA-Net) to achieve precise motion compensation, which extracts multi-scale features from video frames and utilizes flow information for deformable convolution. Second, we design a Frame Reconstruction Network (FR-Net) to recover high-quality video frames, which utilizes reference information for feature enhancement without additional bitrate consumption. Moreover, to achieve smooth rate adjustment, we introduce a feature scaling technique. Experimental results show that MFVC reduces bitrate by 7.86%/48.34% compared to VVC (VTM 13.2) at the same PSNR/MS-SSIM.

引用

页码：2125 / 2129

页数：5

共 50 条

[1] Multi-Scale Warping for Video Frame Interpolation
Choi, Whan
Koh, Yeong Jun
Kim, Chang-Su
IEEE ACCESS, 2021, 9 : 150470 - 150479
[2] Multi-scale Clustering of Frame-to-Frame Correspondences for Motion Segmentation
Dragon, Ralf
Rosenhahn, Bodo
Ostermann, Joern
COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 : 445 - 458
[3] Multi-scale Intermediate Flow Estimation for Video Frame Interpolation
Fan, Zehua
Zhu, Feng
Li, Lei
Tan, Xiaoyang
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 893 - 900
[4] Video frame interpolation via spatial multi-scale modelling
Qu, Zhe
Liu, Weijing
Cui, Lizhen
Yang, Xiaohui
IET COMPUTER VISION, 2024, 18 (04) : 458 - 472
[5] Multi-Scale Video Inverse Tone Mapping with Deformable Alignment
Zou, Jiaqi
Mei, Ke
Sun, Songlin
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 9 - 12
[6] An Efficient Multi-Scale Attention Feature Fusion Network for 4K Video Frame Interpolation
Ning, Xin
Li, Yuhang
Feng, Ziwei
Liu, Jinhua
Ding, Youdong
ELECTRONICS, 2024, 13 (06)
[7] Multi-scale digital holographic reconstruction with deep learning
Wang, Huaying
Li, Qiwen
Wang, Shuo
Men, Gaofu
APPLIED OPTICS, 2025, 64 (07)
[8] A Multi-Scale Position Feature Transform Network for Video Frame Interpolation
Cheng, Xianhang
Chen, Zhenzhong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 3968 - 3981
[9] Deep frame interpolation for video compression
Begaint, Jean
Galpin, Franck
Guillotel, Philippe
Guillemot, Christine
2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 556 - 556
[10] Video Frame Interpolation via Multi-scale Expandable Deformable Convolution
Zhang, Dengyong
Huang, Pu
Ding, Xiangling
Li, Feng
Yang, Gaobo
PROCEEDINGS OF THE 2023 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2023, 2023, : 19 - 28

← 1 2 3 4 5 →