Neural texture transfer assisted video coding with adaptive up-sampling

被引:1
|
作者
Yu, Li [1 ,2 ]
Chang, Wenshuai [1 ,2 ]
Quan, Weize [3 ,4 ]
Xiao, Jimin [5 ]
Yan, Dong-Ming [3 ,4 ]
Gabbouj, Moncef [1 ,6 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit NLPR, Beijing 100049, Peoples R China
[4] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[5] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou 215028, Peoples R China
[6] Tampere Univ, Dept Comp Sci, Tampere, Finland
基金
中国国家自然科学基金;
关键词
High-efficiency video coding (HEVC); Reference-based super-resolution; Low bitrate; Video compression; Deep learning; Machine learning; LEARNING-BASED SUPERRESOLUTION;
D O I
10.1016/j.image.2022.116754
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning techniques have been extensively investigated for the purpose of further increasing the efficiency of traditional video compression. Some deep learning techniques for down/up-sampling-based video coding were found to be especially effective when the bandwidth or storage is limited. Existing works mainly differ in the super-resolution models used. Some works simply use a single image super-resolution model, ignoring the rich information in the correlation between video frames, while others explore the correlation between frames by simply concatenating the features across adjacent frames. This, however, may fail when the textures are not well aligned. In this paper, we propose to utilize neural texture transfer which exploits the semantic correlation between frames and is able to explore the correlated information even when the textures are not aligned. Meanwhile, an adaptive group of pictures (GOP) method is proposed to automatically decide whether a frame should be down-sampled or not. Experimental results show that the proposed method outperforms the standard HEVC and state-of-the-art methods under different compression configurations. When compared to standard HEVC, the BD-rate (PSNR) and BD-rate (SSIM) of the proposed method are up to-19.1% and-26.5%, respectively.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Frame Splitting Approach for Adaptive Up-sampling in Scalable Video Coding
    Shin, IlHong
    Lee, Hyun-Woo
    [J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2014, : 661 - 662
  • [2] Adaptive up-sampling method for H.264 scalable video coding
    Shin, Ilhong
    Park, Hyun Wook
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 3085 - +
  • [3] Adaptive Up-Sampling Method Using DCT for Spatial Scalability of Scalable Video Coding
    Shin, IlHong
    Park, Hyun Wook
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2009, 19 (02) : 206 - 214
  • [4] Component-adaptive up-sampling for inter layer interpolation in scalable video coding
    WANG ZhangZHANG JiXian LI HaiTao Institute of Photogrammetry and Remote SensingChinese Academy of Surveying and MappingBeijing China
    [J]. Science in China(Series F:Information Sciences), 2009, 52 (04) : 704 - 711
  • [5] Adaptive Wiener Filter based Chrominance Up-Sampling Enhancement Method for Video Coding
    Yang, Xu
    Chang, Yilin
    Li, Bingbing
    Yang, Fuzheng
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (04) : 1851 - 1856
  • [6] Component-adaptive up-sampling for inter layer interpolation in scalable video coding
    Wang Zhang
    Zhang JiXian
    Li HaiTao
    [J]. SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2009, 52 (04): : 704 - 711
  • [7] An Up-Sampling Based Texture Synthesis Scheme for Rapid Motion in High Resolution Video Coding
    Sun, Xiaowei
    Yin, Baocai
    Shi, Yunhui
    [J]. 2009 INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2009), VOLUMES 1 AND 2, 2009, : 136 - 139
  • [8] Component-adaptive up-sampling for inter layer interpolation in scalable video coding
    Zhang Wang
    JiXian Zhang
    HaiTao Li
    [J]. Science in China Series F: Information Sciences, 2009, 52 : 704 - 711
  • [10] Multiple Resolution Prediction With Deep Up-Sampling for Depth Video Coding
    Li, Ge
    Lei, Jianjun
    Pan, Zhaoqing
    Peng, Bo
    Ling, Nam
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6337 - 6346