A LIGHTWEIGHT NETWORK MODEL FOR VIDEO FRAME INTERPOLATION USING SPATIAL PYRAMIDS

被引：0

作者：

Zhuang, Jiankai ^{[1
]}

Qin, Zengchang ^{[1
,2
]}

Chen, Jialu ^{[1
]}

Wan, Tao ^{[3
,4
]}

机构：

[1] Beihang Univ, Sch ASEE, Intelligent Comp & Machine Learning Lab, Beijing, Peoples R China

[2] AI Res, Shenzhen, Peoples R China

[3] Beihang Univ, Sch BSME, Beijing, Peoples R China

[4] Beihang Univ, Beijing Adv Innovat Ctr Biomed Engn, Beijing, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2020年

关键词：

Frame interpolation; Optical flow; Pyramid network; Deep learning;

D O I：

暂无

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

In recent years, deep learning based video frame interpolation methods have shown impressive results in handling occlusion, blur and large motion. However, they are usually very heavy in terms of model size, and they hardly to be employed in i.e. mobile phones or other portable devices with limited computing power. To address the problem, we propose light-weighted Spatial Pyramid Frame Interpolation Network (SPFIN), a hierarchical network in a coarse-to-fine approach to reconstruct frames. At each pyramid level, we apply two light sub-networks to model optical flow and visibility mask instead of commonly used U-Net architecture. The flow and mask are up-sampled and optimized progressively. Finally, the intermediate frame is formed by linearly blending warped frames and masks. Experimental results on two benchmark problems show that our model has the smallest size, but better or comparable performance comparing to existing state-of-the art models.

引用

页码：543 / 547

页数：5

共 50 条

[21] Video frame interpolation via spatial multi-scale modelling
Qu, Zhe
Liu, Weijing
Cui, Lizhen
Yang, Xiaohui
IET COMPUTER VISION, 2024, 18 (04) : 458 - 472
[22] VIDEO FRAME INTERPOLATION VIA LOCAL LIGHTWEIGHT BIDIRECTIONAL ENCODING WITH CHANNEL ATTENTION CASCADE
Ding, Xiangling
Huang, Pu
Zhang, Dengyong
Zhao, Xianfeng
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1915 - 1919
[23] Video Frame Interpolation Based on Lightweight Convolutional Unit and Three-scale Encoder
Liu, Weijing
Yang, Xiaohui
Feng, Zhiquan
Xu, Tao
Guo, Qingbei
ACM International Conference Proceeding Series, 2023,
[24] PhaseNet for Video Frame Interpolation
Meyer, Simone
Djelouah, Abdelaziz
McWilliams, Brian
Sorkine-Hornung, Alexander
Gross, Markus
Schroers, Christopher
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 498 - 507
[25] Blurry Video Frame Interpolation
Shen, Wang
Bao, Wenbo
Zhai, Guangtao
Chen, Li
Min, Xiongkuo
Gao, Zhiyong
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5113 - 5122
[26] Video Frame Interpolation Transformer
Shi, Zhihao
Xu, Xiangyu
Liu, Xiaohong
Chen, Jun
Yang, Ming-Hsuan
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17461 - 17470
[27] Video Frame Interpolation with Transformer
Lu, Liying
Wu, Ruizheng
Lin, Huaijia
Lu, Jiangbo
Jia, Jiaya
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3522 - 3532
[28] A Fast 4K Video Frame Interpolation based on StepWise Optical Flow Computation and Video Spatial Interpolation
Jeong, Jinwoo
Hong, Minsoo
Kim, Je Woo
Kim, Sungjei
12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1140 - 1143
[29] Progressive Motion Context Refine Network for Efficient Video Frame Interpolation
Kong, Lingtong
Liu, Jinfeng
Yang, Jie
arXiv, 2022,
[30] A NOVEL ALL-IN-ONE GRID NETWORK FOR VIDEO FRAME INTERPOLATION
Xue, Fanyong
Li, Jie
Wu, Chentao
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1969 - 1973

← 1 2 3 4 5 →