Textural Detail Preservation Network for Video Frame Interpolation

被引:0
|
作者
Yoon, Kihwan [1 ,2 ]
Huh, Jingang [1 ]
Kim, Yong Han [2 ]
Kim, Sungjei [1 ]
Jeong, Jinwoo [1 ]
机构
[1] Korea Elect Technol Inst KETI, Seongnam Si 13488, Gyeonggi Do, South Korea
[2] Univ Seoul, Sch Elect & Comp Engn, Seoul 02504, South Korea
关键词
Video frame interpolation; textural detail preservation; perceptual loss; synthesis network; ENHANCEMENT;
D O I
10.1109/ACCESS.2023.3294964
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The subjective image quality of the Video Frame Interpolation (VFI) result depends on whether image features such as edges, textures and blobs are preserved. With the development of deep learning, various algorithms have been proposed and the objective results of VFI have significantly improved. Moreover, perceptual loss has been used in a method that enhances subjective quality by preserving the features of the image, and as a result, the subjective quality is improved. Despite the quality enhancements achieved in VFI, no analysis has been performed to preserve specific features in the interpolated frames. Therefore, we conducted an analysis to preserve textural detail, such as film grain noise, which can represent the texture of an image, and weak textures, such as droplets or particles. Based on our analysis, we identify the importance of synthesis networks in textural detail preservation and propose an enhanced synthesis network, the Textural Detail Preservation Network (TDPNet). Furthermore, based on our analysis, we propose a Perceptual Training Method (PTM) to address the issue of degraded Peak Signal-to-Noise Ratio (PSNR) when simply applying perceptual loss and to preserve more textural detail. We also propose a Multi-scale Resolution Training Method (MRTM) to address the issue of poor performance when testing datasets with a resolution different from that of the training dataset. The experimental results of the proposed network was outperformed in LPIPS and DISTS on the Vimeo90K, HD, SNU-FILM and UVG datasets compared with the state-of-the-art VFI algorithms, and the subjective results were also outperformed. Furthermore, applying PTM improved PSNR results by an average of 0.293dB compared to simply applying perceptual loss.
引用
收藏
页码:71994 / 72006
页数:13
相关论文
共 50 条
  • [21] XVFI: eXtreme Video Frame Interpolation
    Sim, Hyeonjun
    Oh, Jihyong
    Kim, Munchurl
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14469 - 14478
  • [22] Video Frame Interpolation: A Comprehensive Survey
    Dong, Jiong
    Ota, Kaoru
    Dong, Mianxiong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [23] Deep frame interpolation for video compression
    Begaint, Jean
    Galpin, Franck
    Guillotel, Philippe
    Guillemot, Christine
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 556 - 556
  • [24] Video Frame Interpolation with Flow Transformer
    Gao, Pan
    Tian, Haoyue
    Qin, Jie
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1933 - 1942
  • [25] A CONCATENATED MODEL FOR VIDEO FRAME INTERPOLATION
    Chen, Ying
    Smith, Mark J. T.
    2009 IEEE 13TH DIGITAL SIGNAL PROCESSING WORKSHOP & 5TH IEEE PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, PROCEEDINGS, 2009, : 565 - 569
  • [26] Deep Bayesian Video Frame Interpolation
    Yu, Zhiyang
    Zhang, Yu
    Xiang, Xujie
    Zou, Dongqing
    Chen, Xijun
    Ren, Jimmy S.
    COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 144 - 160
  • [27] Frame sequential interpolation for discrete level-of-detail rendering
    Scherzer, Daniel
    Wimmer, Michael
    COMPUTER GRAPHICS FORUM, 2008, 27 (04) : 1175 - 1181
  • [28] Optimizing Video Prediction via Video Frame Interpolation
    Wu, Yue
    Wen, Qiang
    Chen, Qifeng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17793 - 17802
  • [29] A Multi-Scale Position Feature Transform Network for Video Frame Interpolation
    Cheng, Xianhang
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 3968 - 3981
  • [30] Edge-Aware Network for Flow-Based Video Frame Interpolation
    Zhao, Bin
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 1401 - 1408