XVFI: eXtreme Video Frame Interpolation

被引：30

作者：

Sim, Hyeonjun ^{[1
]}

Oh, Jihyong ^{[1
]}

Kim, Munchurl ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

关键词：

IMAGE QUALITY;

D O I：

10.1109/ICCV48922.2021.01422

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we firstly present a dataset (X4K1000FPS) of 4K videos of 1000 fps with the extreme motion to the research community for video frame interpolation (VFI), and propose an extreme VFI network, called XVFI-Net, that first handles the VFI for 4K videos with large motion. The XVFI-Net is based on a recursive multi-scale shared structure that consists of two cascaded modules for bidirectional optical flow learning between two input frames (BiOF-I) and for bidirectional optical flow learning from target to input frames (BiOF-T). The optical flows are stably approximated by a complementary flow reversal (CFR) proposed in BiOF-T module. During inference, the BiOFI module can start at any scale of input while the BiOFT module only operates at the original input scale so that the inference can be accelerated while maintaining highly accurate VFI performance. Extensive experimental results show that our XVFI-Net can successfully capture the essential information of objects with extremely large motions and complex textures while the state-of-the-art methods exhibit poor performance. Furthermore, our XVFI-Net framework also performs comparably on the previous lower resolution benchmark dataset, which shows a robustness of our algorithm as well. All source codes, pre-trained models, and proposed X4K1000FPS datasets are publicly available at https://github.com/JihyongOh/XVFI.

引用

页码：14469 / 14478

页数：10

共 50 条

[1] PhaseNet for Video Frame Interpolation
Meyer, Simone
Djelouah, Abdelaziz
McWilliams, Brian
Sorkine-Hornung, Alexander
Gross, Markus
Schroers, Christopher
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 498 - 507
[2] Blurry Video Frame Interpolation
Shen, Wang
Bao, Wenbo
Zhai, Guangtao
Chen, Li
Min, Xiongkuo
Gao, Zhiyong
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5113 - 5122
[3] Video Frame Interpolation Transformer
Shi, Zhihao
Xu, Xiangyu
Liu, Xiaohong
Chen, Jun
Yang, Ming-Hsuan
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17461 - 17470
[4] Video Frame Interpolation with Transformer
Lu, Liying
Wu, Ruizheng
Lin, Huaijia
Lu, Jiangbo
Jia, Jiaya
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3522 - 3532
[5] Softmax Splatting for Video Frame Interpolation
Niklaus, Simon
Liu, Feng
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5436 - 5445
[6] Exploring Discontinuity for Video Frame Interpolation
Lee, Sangjin
Lee, Hyeongmin
Shin, Chajin
Son, Hanbin
Lee, Sangyoun
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9791 - 9800
[7] Video Frame Interpolation: A Comprehensive Survey
Dong, Jiong
Ota, Kaoru
Dong, Mianxiong
[J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
[8] Deep frame interpolation for video compression
Begaint, Jean
Galpin, Franck
Guillotel, Philippe
Guillemot, Christine
[J]. 2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 556 - 556
[9] Video Frame Interpolation with Flow Transformer
Gao, Pan
Tian, Haoyue
Qin, Jie
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1933 - 1942
[10] Deep Bayesian Video Frame Interpolation
Yu, Zhiyang
Zhang, Yu
Xiang, Xujie
Zou, Dongqing
Chen, Xijun
Ren, Jimmy S.
[J]. COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 144 - 160

← 1 2 3 4 5 →