A LIGHTWEIGHT NETWORK MODEL FOR VIDEO FRAME INTERPOLATION USING SPATIAL PYRAMIDS

被引:0
|
作者
Zhuang, Jiankai [1 ]
Qin, Zengchang [1 ,2 ]
Chen, Jialu [1 ]
Wan, Tao [3 ,4 ]
机构
[1] Beihang Univ, Sch ASEE, Intelligent Comp & Machine Learning Lab, Beijing, Peoples R China
[2] AI Res, Shenzhen, Peoples R China
[3] Beihang Univ, Sch BSME, Beijing, Peoples R China
[4] Beihang Univ, Beijing Adv Innovat Ctr Biomed Engn, Beijing, Peoples R China
关键词
Frame interpolation; Optical flow; Pyramid network; Deep learning;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In recent years, deep learning based video frame interpolation methods have shown impressive results in handling occlusion, blur and large motion. However, they are usually very heavy in terms of model size, and they hardly to be employed in i.e. mobile phones or other portable devices with limited computing power. To address the problem, we propose light-weighted Spatial Pyramid Frame Interpolation Network (SPFIN), a hierarchical network in a coarse-to-fine approach to reconstruct frames. At each pyramid level, we apply two light sub-networks to model optical flow and visibility mask instead of commonly used U-Net architecture. The flow and mask are up-sampled and optimized progressively. Finally, the intermediate frame is formed by linearly blending warped frames and masks. Experimental results on two benchmark problems show that our model has the smallest size, but better or comparable performance comparing to existing state-of-the art models.
引用
收藏
页码:543 / 547
页数:5
相关论文
共 50 条
  • [21] Video frame interpolation via spatial multi-scale modelling
    Qu, Zhe
    Liu, Weijing
    Cui, Lizhen
    Yang, Xiaohui
    IET COMPUTER VISION, 2024, 18 (04) : 458 - 472
  • [22] VIDEO FRAME INTERPOLATION VIA LOCAL LIGHTWEIGHT BIDIRECTIONAL ENCODING WITH CHANNEL ATTENTION CASCADE
    Ding, Xiangling
    Huang, Pu
    Zhang, Dengyong
    Zhao, Xianfeng
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1915 - 1919
  • [23] Video Frame Interpolation Based on Lightweight Convolutional Unit and Three-scale Encoder
    Liu, Weijing
    Yang, Xiaohui
    Feng, Zhiquan
    Xu, Tao
    Guo, Qingbei
    ACM International Conference Proceeding Series, 2023,
  • [24] PhaseNet for Video Frame Interpolation
    Meyer, Simone
    Djelouah, Abdelaziz
    McWilliams, Brian
    Sorkine-Hornung, Alexander
    Gross, Markus
    Schroers, Christopher
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 498 - 507
  • [25] Blurry Video Frame Interpolation
    Shen, Wang
    Bao, Wenbo
    Zhai, Guangtao
    Chen, Li
    Min, Xiongkuo
    Gao, Zhiyong
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5113 - 5122
  • [26] Video Frame Interpolation Transformer
    Shi, Zhihao
    Xu, Xiangyu
    Liu, Xiaohong
    Chen, Jun
    Yang, Ming-Hsuan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17461 - 17470
  • [27] Video Frame Interpolation with Transformer
    Lu, Liying
    Wu, Ruizheng
    Lin, Huaijia
    Lu, Jiangbo
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3522 - 3532
  • [28] A Fast 4K Video Frame Interpolation based on StepWise Optical Flow Computation and Video Spatial Interpolation
    Jeong, Jinwoo
    Hong, Minsoo
    Kim, Je Woo
    Kim, Sungjei
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1140 - 1143
  • [29] Progressive Motion Context Refine Network for Efficient Video Frame Interpolation
    Kong, Lingtong
    Liu, Jinfeng
    Yang, Jie
    arXiv, 2022,
  • [30] A NOVEL ALL-IN-ONE GRID NETWORK FOR VIDEO FRAME INTERPOLATION
    Xue, Fanyong
    Li, Jie
    Wu, Chentao
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1969 - 1973