Neural texture transfer assisted video coding with adaptive up-sampling

被引：1

作者：

Yu, Li ^{[1
,2
]}

Chang, Wenshuai ^{[1
,2
]}

Quan, Weize ^{[3
,4
]}

Xiao, Jimin ^{[5
]}

Yan, Dong-Ming ^{[3
,4
]}

Gabbouj, Moncef ^{[1
,6
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China

[2] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China

[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit NLPR, Beijing 100049, Peoples R China

[4] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

[5] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou 215028, Peoples R China

[6] Tampere Univ, Dept Comp Sci, Tampere, Finland

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2022年 / 107卷

基金：

中国国家自然科学基金;

关键词：

High-efficiency video coding (HEVC); Reference-based super-resolution; Low bitrate; Video compression; Deep learning; Machine learning; LEARNING-BASED SUPERRESOLUTION;

D O I：

10.1016/j.image.2022.116754

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep learning techniques have been extensively investigated for the purpose of further increasing the efficiency of traditional video compression. Some deep learning techniques for down/up-sampling-based video coding were found to be especially effective when the bandwidth or storage is limited. Existing works mainly differ in the super-resolution models used. Some works simply use a single image super-resolution model, ignoring the rich information in the correlation between video frames, while others explore the correlation between frames by simply concatenating the features across adjacent frames. This, however, may fail when the textures are not well aligned. In this paper, we propose to utilize neural texture transfer which exploits the semantic correlation between frames and is able to explore the correlated information even when the textures are not aligned. Meanwhile, an adaptive group of pictures (GOP) method is proposed to automatically decide whether a frame should be down-sampled or not. Experimental results show that the proposed method outperforms the standard HEVC and state-of-the-art methods under different compression configurations. When compared to standard HEVC, the BD-rate (PSNR) and BD-rate (SSIM) of the proposed method are up to-19.1% and-26.5%, respectively.

引用

页数：10

共 50 条

[1] Frame Splitting Approach for Adaptive Up-sampling in Scalable Video Coding
Shin, IlHong
Lee, Hyun-Woo
[J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2014, : 661 - 662
[2] Adaptive up-sampling method for H.264 scalable video coding
Shin, Ilhong
Park, Hyun Wook
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 3085 - +
[3] Adaptive Up-Sampling Method Using DCT for Spatial Scalability of Scalable Video Coding
Shin, IlHong
Park, Hyun Wook
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2009, 19 (02) : 206 - 214
[4] Component-adaptive up-sampling for inter layer interpolation in scalable video coding
WANG ZhangZHANG JiXian LI HaiTao Institute of Photogrammetry and Remote SensingChinese Academy of Surveying and MappingBeijing China
[J]. Science in China(Series F:Information Sciences), 2009, 52 (04) : 704 - 711
[5] Adaptive Wiener Filter based Chrominance Up-Sampling Enhancement Method for Video Coding
Yang, Xu
Chang, Yilin
Li, Bingbing
Yang, Fuzheng
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (04) : 1851 - 1856
[6] Component-adaptive up-sampling for inter layer interpolation in scalable video coding
Wang Zhang
Zhang JiXian
Li HaiTao
[J]. SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2009, 52 (04): : 704 - 711
[7] An Up-Sampling Based Texture Synthesis Scheme for Rapid Motion in High Resolution Video Coding
Sun, Xiaowei
Yin, Baocai
Shi, Yunhui
[J]. 2009 INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2009), VOLUMES 1 AND 2, 2009, : 136 - 139
[8] Component-adaptive up-sampling for inter layer interpolation in scalable video coding
Zhang Wang
JiXian Zhang
HaiTao Li
[J]. Science in China Series F: Information Sciences, 2009, 52 : 704 - 711
[9] Component-adaptive up-sampling for inter layer interpolation in scalable video coding
WANG Zhang
[J]. Science China(Information Sciences), 2009, (04) : 704 - 711
[10] Multiple Resolution Prediction With Deep Up-Sampling for Depth Video Coding
Li, Ge
Lei, Jianjun
Pan, Zhaoqing
Peng, Bo
Ling, Nam
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6337 - 6346

← 1 2 3 4 5 →