Deep video compression based on Long-range Temporal Context Learning

被引:0
|
作者
Wu, Kejun [1 ]
Li, Zhenxing [1 ]
Yang, You [1 ]
Liu, Qiong [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
关键词
Deep learning; Video compression; Computational photography; Temporal context learning;
D O I
10.1016/j.cviu.2024.104127
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video compression allows for efficient storage and transmission of data, benefiting imaging and vision applications, e.g. computational imaging, photography, and displays by delivering high-quality videos. To exploit more informative contexts of video, we propose DVCL, a novel D eep V ideo C ompression based on L ong-range Temporal Context Learning. Aiming at high coding performance, this new compression paradigm makes full use of long-range temporal correlations derived from multiple reference frames to learn richer contexts. Motion vectors (MVs) are estimated to represent the motion relations of videos. By employing MVs, a long-range temporal context learning (LTCL) module is presented to extract context information from multiple reference frames, such that a more accurate and informative temporal contexts can be learned and constructed. The long-range temporal contexts serve as conditions and generate the predicted frames by contextual encoder and decoder. To address the challenge of imbalanced training, we develop a multi-stage training strategy to ensure the whole DVCL framework is trained progressively and stably. Extensive experiments demonstrate the proposed DVCL achieves the highest objective and subjective quality, while maintaining relatively low complexity. Specifically, 25.30% and 45.75% bitrate savings on average can be obtained than x265 codec at the same PSNR and MS-SSIM, respectively.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] High Efficiency Deep-learning Based Video Compression
    Tang, Lv
    Zhang, Xinfeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08)
  • [32] ECG-based cardiac arrhythmias detection through ensemble learning and fusion of deep spatial-temporal and long-range dependency features
    Din, Sadia
    Qaraqe, Marwa
    Mourad, Omar
    Qaraqe, Khalid
    Serpedin, Erchin
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 150
  • [33] Long-Range Temporal Correlations in Kinetic Roughening
    Xia, Hui
    Tang, Gang
    Lan, Yueheng
    JOURNAL OF STATISTICAL PHYSICS, 2020, 178 (03) : 800 - 813
  • [34] Interval timing by long-range temporal integration
    Simen, Patrick
    Balci, Fuat
    deSouza, Laura
    Cohen, Jonathan D.
    Holmes, Philip
    FRONTIERS IN INTEGRATIVE NEUROSCIENCE, 2011, 5
  • [35] Long-Range Temporal Correlations in Kinetic Roughening
    Hui Xia
    Gang Tang
    Yueheng Lan
    Journal of Statistical Physics, 2020, 178 : 800 - 813
  • [36] Fast Path Planning for Long-Range Planetary Roving Based on a Hierarchical Framework and Deep Reinforcement Learning
    Hu, Ruijun
    Zhang, Yulin
    AEROSPACE, 2022, 9 (02)
  • [37] Development of a Long-Range Hydrological Drought Prediction Framework Using Deep Learning
    Mohd Imran Khan
    Rajib Maity
    Water Resources Management, 2024, 38 : 1497 - 1509
  • [38] Online learning of long-range dependencies
    Zucchet, Nicolas
    Meier, Robert
    Schug, Simon
    Mujika, Asier
    Sacramento, Joao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [39] Active Learning With Long-Range Observation
    Lee, Jiho
    Kim, Eunwoo
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1990 - 1994
  • [40] Development of a Long-Range Hydrological Drought Prediction Framework Using Deep Learning
    Khan, Mohd Imran
    Maity, Rajib
    WATER RESOURCES MANAGEMENT, 2024, 38 (04) : 1497 - 1509