UPR-Net: A Unified Pyramid Recurrent Network for Video Frame Interpolation

被引:0
|
作者
Jin, Xin [1 ]
Wu, Longhai [1 ]
Chen, Jie [1 ]
Chen, Youxin [1 ]
Koo, Jayoon [2 ]
Hahm, Cheul-Hee [2 ]
Chen, Zhao-Min [3 ]
机构
[1] Samsung Elect China, R&D Ctr, Nanjing, Jiangsu, Peoples R China
[2] Samsung Elect Co, Suwon, Maetan 3-Dong, Gyeonggi Do, South Korea
[3] Wenzhou Univ, Coll Comp Sci & Artificial Intelligence, Adv Educ Pk Zone, Wenzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Video frame interpolation; Optical flow; Frame synthesis; Pyramid network; Recurrent network;
D O I
10.1007/s11263-024-02164-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Flow-guided synthesis provides a popular framework for video frame interpolation, where optical flow is firstly estimated to warp input frames, and then the intermediate frame is synthesized from warped representations. Within this framework, optical flow is typically estimated from coarse-to-fine by a pyramid network, but the intermediate frame is commonly synthesized in a single pass, missing the opportunity of refining possible imperfect synthesis for high-resolution and large motion cases. While cascading several synthesis networks is a natural idea, it is nontrivial to unify iterative estimation of both optical flow and intermediate frame into a compact, flexible, and general framework. In this paper, we present UPR-Net, a novel Unified Pyramid Recurrent Network for frame interpolation. Cast in a flexible pyramid framework, UPR-Net exploits lightweight recurrent modules for both bi-directional flow estimation and intermediate frame synthesis. At each pyramid level, it leverages estimated bi-directional flow to generate forward-warped representations for frame synthesis; across pyramid levels, it enables iterative refinement for both optical flow and intermediate frame. We show that our iterative synthesis significantly improves the interpolation robustness on large motion cases, and the recurrent module design enables flexible resolution-aware adaptation in testing. When trained on low-resolution data, UPR-Net can achieve excellent performance on both low- and high-resolution benchmarks. Despite being extremely lightweight (1.7M parameters), the base version of UPR-Net competes favorably with many methods that rely on much heavier architectures. Code and trained models are publicly available at: https://github.com/srcn-ivl/UPR-Net.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A Unified Pyramid Recurrent Network for Video Frame Interpolation
    Jin, Xin
    Wu, Longhai
    Chen, Jie
    Chen, Youxin
    Koo, Jayoon
    Hahm, Cheul-hee
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1578 - 1587
  • [2] Video Frame Interpolation and Enhancement via Pyramid Recurrent Framework
    Shen, Wang
    Bao, Wenbo
    Zhai, Guangtao
    Chen, Li
    Min, Xiongkuo
    Gao, Zhiyong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 277 - 292
  • [3] Multi-Frame Pyramid Refinement Network for Video Frame Interpolation
    Zhang, Haoxian
    Wang, Ronggang
    Zhao, Yang
    [J]. IEEE ACCESS, 2019, 7 : 130610 - 130621
  • [4] MVFI-Net: Motion-Aware Video Frame Interpolation Network
    Lin, Xuhu
    Zhao, Lili
    Liu, Xi
    Chen, Jianwen
    [J]. COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 340 - 356
  • [5] DSF-Net: Dual-Stream Fused Network for Video Frame Interpolation
    Zhang, Fuhua
    Yang, Chuang
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1122 - 1126
  • [6] DSF-Net: Dual-Stream Fused Network for Video Frame Interpolation
    Zhang, Fuhua
    Yang, Chuang
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1122 - 1126
  • [7] PDWN: Pyramid Deformable Warping Network for Video Interpolation
    Chen, Zhiqi
    Wang, Ran
    Liu, Haojie
    Wang, Yao
    [J]. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2021, 2 : 413 - 424
  • [8] Video frame interpolation via residual blocks and feature pyramid networks
    Yang, Xiaohui
    Zhang, Haoran
    Qu, Zhe
    Feng, Zhiquan
    Tian, Jinglan
    [J]. IET IMAGE PROCESSING, 2023, 17 (04) : 1060 - 1070
  • [9] Video Frame Interpolation Using Recurrent Convolutional Layers
    Zhang, Zhifeng
    Song, Li
    Xie, Rong
    Chen, Li
    [J]. 2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [10] UNIFIED FRAMEWORK OF FRAME SKIPPING AND INTERPOLATION FOR EFFICIENT VIDEO COMPRESSION
    Lee, Sunyoung
    Lee, Myungjung
    Choi, Kiho
    Jang, Euee S.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 670 - 674