Multi-Scale Motion Alignment and Frame Reconstruction for Efficient Deep Video Compression

被引：0

作者：

Yang, Gongning ^{[1
]}

Wei, Xiaojie ^{[1
]}

Lin, Hongbin ^{[1
]}

机构：

[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou 350116, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2024年 / 31卷

关键词：

Convolution; Decoding; Motion compensation; Video compression; Feature extraction; Encoding; Video codecs; Deep video compression; end-to-end video codec; flexible rate adjustment; video coding;

D O I：

10.1109/LSP.2024.3443516

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

As video data continues to grow, the burden on network transmission increases significantly. Efficient video compression techniques are crucial to meet the rising demand for multimedia content. In this letter, we propose a Multi-scale Motion Alignment and Frame Reconstruction-based Video Codec (MFVC) for efficient video compression. MFVC focuses on optimizing the motion compensation and video reconstruction processes within a deep video compression framework. First, we design a Multi-Scale Motion Alignment Network (MSMA-Net) to achieve precise motion compensation, which extracts multi-scale features from video frames and utilizes flow information for deformable convolution. Second, we design a Frame Reconstruction Network (FR-Net) to recover high-quality video frames, which utilizes reference information for feature enhancement without additional bitrate consumption. Moreover, to achieve smooth rate adjustment, we introduce a feature scaling technique. Experimental results show that MFVC reduces bitrate by 7.86%/48.34% compared to VVC (VTM 13.2) at the same PSNR/MS-SSIM.

引用

页码：2125 / 2129

页数：5

共 50 条

[41] Neural Multi-scale Image Compression
Nakanishi, Ken M.
Maeda, Shin-ichi
Miyato, Takeru
Okanohara, Daisuke
COMPUTER VISION - ACCV 2018, PT VI, 2019, 11366 : 718 - 732
[42] Event Recognition in Unconstrained Video using Multi-Scale Deep Spatial Features
Umer, Saiyed
Ghorai, Mrinmoy
Mohanta, Partha Pratim
2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 286 - 291
[43] Multi-frame motion-compensated video compression for the digital set top box
Girod, B
Flierl, M
2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2002, : 1 - 4
[44] PredGAN - a deep multi-scale video prediction framework for detecting anomalies in videos
Jamadandi, Adarsh
Kotturshettar, Sunidhi
Mudenagudi, Uma
ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
[45] A Fast Approach for Video Deblurring using Multi-Scale Deep Neural Network
Gupta, Rahul Kumar
Upla, Kishor
2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
[46] Enhanced Motion Compensation for Deep Video Compression
Guo, Haifeng
Kwong, Sam
Jia, Chuanmin
Wang, Shiqi
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 673 - 677
[47] Multi-scale registration algorithm for alignment of meshes
Vadde, S
Kamarthi, SV
Gupta, SM
INTELLIGENT MANUFACTURING, 2004, 5263 : 113 - 119
[48] Multi-scale Alignment and Positioning System - MAPS
Fesperman, Ronnie
Ozturk, Ozkan
Hocken, Robert
Ruben, Shalom
Tsao, Tsu-Chin
Phipps, James
Lemmons, Tiffany
Brien, John
Caskey, Greg
PRECISION ENGINEERING-JOURNAL OF THE INTERNATIONAL SOCIETIES FOR PRECISION ENGINEERING AND NANOTECHNOLOGY, 2012, 36 (04): : 517 - 537
[49] An Efficient Video Coding System With an Adaptive Overfitted Multi-Scale Attention Network
He, Gang
Wu, Chang
Xu, Li
Li, Lei
Xu, Ziyao
Xie, Weiying
Li, Yunsong
IEEE ACCESS, 2021, 9 : 64022 - 64032
[50] Multi-Scale Reconstruction of Crop Canopy
Lu, Shenglian
Guo, Xinyu
Zhao, Chunjiang
Qian, Tingting
Wen, Weiliang
Du, Jianjun
2012 IEEE FOURTH INTERNATIONAL SYMPOSIUM ON PLANT GROWTH MODELING, SIMULATION, VISUALIZATION AND APPLICATIONS (PMA), 2012, : 262 - 269

← 1 2 3 4 5 →