Multi-Scale Motion Alignment and Frame Reconstruction for Efficient Deep Video Compression

被引:0
|
作者
Yang, Gongning [1 ]
Wei, Xiaojie [1 ]
Lin, Hongbin [1 ]
机构
[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou 350116, Peoples R China
关键词
Convolution; Decoding; Motion compensation; Video compression; Feature extraction; Encoding; Video codecs; Deep video compression; end-to-end video codec; flexible rate adjustment; video coding;
D O I
10.1109/LSP.2024.3443516
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As video data continues to grow, the burden on network transmission increases significantly. Efficient video compression techniques are crucial to meet the rising demand for multimedia content. In this letter, we propose a Multi-scale Motion Alignment and Frame Reconstruction-based Video Codec (MFVC) for efficient video compression. MFVC focuses on optimizing the motion compensation and video reconstruction processes within a deep video compression framework. First, we design a Multi-Scale Motion Alignment Network (MSMA-Net) to achieve precise motion compensation, which extracts multi-scale features from video frames and utilizes flow information for deformable convolution. Second, we design a Frame Reconstruction Network (FR-Net) to recover high-quality video frames, which utilizes reference information for feature enhancement without additional bitrate consumption. Moreover, to achieve smooth rate adjustment, we introduce a feature scaling technique. Experimental results show that MFVC reduces bitrate by 7.86%/48.34% compared to VVC (VTM 13.2) at the same PSNR/MS-SSIM.
引用
收藏
页码:2125 / 2129
页数:5
相关论文
共 50 条
  • [41] Neural Multi-scale Image Compression
    Nakanishi, Ken M.
    Maeda, Shin-ichi
    Miyato, Takeru
    Okanohara, Daisuke
    COMPUTER VISION - ACCV 2018, PT VI, 2019, 11366 : 718 - 732
  • [42] Event Recognition in Unconstrained Video using Multi-Scale Deep Spatial Features
    Umer, Saiyed
    Ghorai, Mrinmoy
    Mohanta, Partha Pratim
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 286 - 291
  • [43] Multi-frame motion-compensated video compression for the digital set top box
    Girod, B
    Flierl, M
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2002, : 1 - 4
  • [44] PredGAN - a deep multi-scale video prediction framework for detecting anomalies in videos
    Jamadandi, Adarsh
    Kotturshettar, Sunidhi
    Mudenagudi, Uma
    ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
  • [45] A Fast Approach for Video Deblurring using Multi-Scale Deep Neural Network
    Gupta, Rahul Kumar
    Upla, Kishor
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [46] Enhanced Motion Compensation for Deep Video Compression
    Guo, Haifeng
    Kwong, Sam
    Jia, Chuanmin
    Wang, Shiqi
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 673 - 677
  • [47] Multi-scale registration algorithm for alignment of meshes
    Vadde, S
    Kamarthi, SV
    Gupta, SM
    INTELLIGENT MANUFACTURING, 2004, 5263 : 113 - 119
  • [48] Multi-scale Alignment and Positioning System - MAPS
    Fesperman, Ronnie
    Ozturk, Ozkan
    Hocken, Robert
    Ruben, Shalom
    Tsao, Tsu-Chin
    Phipps, James
    Lemmons, Tiffany
    Brien, John
    Caskey, Greg
    PRECISION ENGINEERING-JOURNAL OF THE INTERNATIONAL SOCIETIES FOR PRECISION ENGINEERING AND NANOTECHNOLOGY, 2012, 36 (04): : 517 - 537
  • [49] An Efficient Video Coding System With an Adaptive Overfitted Multi-Scale Attention Network
    He, Gang
    Wu, Chang
    Xu, Li
    Li, Lei
    Xu, Ziyao
    Xie, Weiying
    Li, Yunsong
    IEEE ACCESS, 2021, 9 : 64022 - 64032
  • [50] Multi-Scale Reconstruction of Crop Canopy
    Lu, Shenglian
    Guo, Xinyu
    Zhao, Chunjiang
    Qian, Tingting
    Wen, Weiliang
    Du, Jianjun
    2012 IEEE FOURTH INTERNATIONAL SYMPOSIUM ON PLANT GROWTH MODELING, SIMULATION, VISUALIZATION AND APPLICATIONS (PMA), 2012, : 262 - 269