Textural Detail Preservation Network for Video Frame Interpolation

被引：0

作者：

Yoon, Kihwan ^{[1
,2
]}

Huh, Jingang ^{[1
]}

Kim, Yong Han ^{[2
]}

Kim, Sungjei ^{[1
]}

Jeong, Jinwoo ^{[1
]}

机构：

[1] Korea Elect Technol Inst KETI, Seongnam Si 13488, Gyeonggi Do, South Korea

[2] Univ Seoul, Sch Elect & Comp Engn, Seoul 02504, South Korea

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Video frame interpolation; textural detail preservation; perceptual loss; synthesis network; ENHANCEMENT;

D O I：

10.1109/ACCESS.2023.3294964

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The subjective image quality of the Video Frame Interpolation (VFI) result depends on whether image features such as edges, textures and blobs are preserved. With the development of deep learning, various algorithms have been proposed and the objective results of VFI have significantly improved. Moreover, perceptual loss has been used in a method that enhances subjective quality by preserving the features of the image, and as a result, the subjective quality is improved. Despite the quality enhancements achieved in VFI, no analysis has been performed to preserve specific features in the interpolated frames. Therefore, we conducted an analysis to preserve textural detail, such as film grain noise, which can represent the texture of an image, and weak textures, such as droplets or particles. Based on our analysis, we identify the importance of synthesis networks in textural detail preservation and propose an enhanced synthesis network, the Textural Detail Preservation Network (TDPNet). Furthermore, based on our analysis, we propose a Perceptual Training Method (PTM) to address the issue of degraded Peak Signal-to-Noise Ratio (PSNR) when simply applying perceptual loss and to preserve more textural detail. We also propose a Multi-scale Resolution Training Method (MRTM) to address the issue of poor performance when testing datasets with a resolution different from that of the training dataset. The experimental results of the proposed network was outperformed in LPIPS and DISTS on the Vimeo90K, HD, SNU-FILM and UVG datasets compared with the state-of-the-art VFI algorithms, and the subjective results were also outperformed. Furthermore, applying PTM improved PSNR results by an average of 0.293dB compared to simply applying perceptual loss.

引用

页码：71994 / 72006

页数：13

共 50 条

[41] Revisiting Adaptive Convolutions for Video Frame Interpolation
Niklaus, Simon
Mai, Long
Wang, Oliver
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1098 - 1108
[42] Progressive Motion Boosting for Video Frame Interpolation
Xiao, Jing
Xu, Kangmin
Hu, Mengshun
Liao, Liang
Wang, Zheng
Lin, Chia-Wen
Wang, Mi
Satoh, Shin'ichi
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8076 - 8090
[43] Directional Frame Interpolation for MPEG Compressed Video
Zhao, Chang
Gao, Xinwei
Fan, Xiaopeng
Zhao, Debin
VISUAL INFORMATION PROCESSING AND COMMUNICATION III, 2012, 8305
[44] Depth-Aware Video Frame Interpolation
Bao, Wenbo
Lai, Wei-Sheng
Ma, Chao
Zhang, Xiaoyun
Gao, Zhiyong
Yang, Ming-Hsuan
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3698 - 3707
[45] Motion-Aware Video Frame Interpolation
Han, Pengfei
Zhang, Fuhua
Zhao, Bin
Li, Xuelong
NEURAL NETWORKS, 2024, 178
[46] Video Frame Interpolation With Learnable Uncertainty and Decomposition
Yu, Zhiyang
Chen, Xijun
Ren, Shunqing
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2642 - 2646
[47] A SUBJECTIVE QUALITY STUDY FOR VIDEO FRAME INTERPOLATION
Danier, Duolikun
Zhang, Fan
Bull, David
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1361 - 1365
[48] DSF-Net: Dual-Stream Fused Network for Video Frame Interpolation
Zhang, Fuhua
Yang, Chuang
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1122 - 1126
[49] DSF-Net: Dual-Stream Fused Network for Video Frame Interpolation
Zhang, Fuhua
Yang, Chuang
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1122 - 1126
[50] Bi-Directional Pseudo-Three-Dimensional Network for Video Frame Interpolation
Luo, Yao
Pan, Jinshan
Tang, Jinhui
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6773 - 6788

← 1 2 3 4 5 →