Video Object Segmentation-aware Video Frame Interpolation

被引：1

作者：

Yoo, Jun-Sang ^{[1
]}

Lee, Hongjae ^{[1
]}

Jung, Seung-Won ^{[1
]}

机构：

[1] Korea Univ, Seoul, South Korea

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1109/ICCV51070.2023.01132

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video frame interpolation (VFI) is a very active research topic due to its broad applicability to many applications, including video enhancement, video encoding, and slow- motion effects. VFI methods have been advanced by improving the overall image quality for challenging sequences containing occlusions, large motion, and dynamic texture. This mainstream research direction neglects that foreground and background regions have different importance in perceptual image quality. Moreover, accurate synthesis of moving objects can be of utmost importance in computer vision applications. In this paper, we propose a video object segmentation (VOS)-aware training framework called VOS-VFI that allows VFI models to interpolate frames with more precise object boundaries. Specifically, we exploit VOS as an auxiliary task to help train VFI models by providing additional loss functions, including segmentation loss and bi-directional consistency loss. From extensive experiments, we demonstrate that VOS-VFI can boost the performance of existing VFI models by rendering clear object boundaries. Moreover, VOS- VFI displays its effectiveness on multiple benchmarks for different applications, including video object segmentation, object pose estimation, and visual tracking. The code is available at https://github.com/junsang7777/VOS-VFI

引用

页码：12288 / 12299

页数：12

共 50 条

[31] Key-frame extraction for object-based video segmentation
Song, XM
Fan, GL
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 689 - 692
[32] Modified intelligent scissors and adaptive frame skipping for video object segmentation
Yang, GB
Yu, SF
REAL-TIME IMAGING, 2005, 11 (04) : 310 - 322
[33] Efficient frame-sequential label propagation for video object segmentation
Chen, Yadang
Hao, Chuanyan
Wu, Wen
Wu, Enhua
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (05) : 6117 - 6133
[34] Softmax Splatting for Video Frame Interpolation
Niklaus, Simon
Liu, Feng
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5436 - 5445
[35] Exploring Discontinuity for Video Frame Interpolation
Lee, Sangjin
Lee, Hyeongmin
Shin, Chajin
Son, Hanbin
Lee, Sangyoun
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9791 - 9800
[36] XVFI: eXtreme Video Frame Interpolation
Sim, Hyeonjun
Oh, Jihyong
Kim, Munchurl
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14469 - 14478
[37] Video Frame Interpolation with Flow Transformer
Gao, Pan
Tian, Haoyue
Qin, Jie
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1933 - 1942
[38] Video Frame Interpolation: A Comprehensive Survey
Dong, Jiong
Ota, Kaoru
Dong, Mianxiong
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
[39] Deep frame interpolation for video compression
Begaint, Jean
Galpin, Franck
Guillotel, Philippe
Guillemot, Christine
2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 556 - 556
[40] A CONCATENATED MODEL FOR VIDEO FRAME INTERPOLATION
Chen, Ying
Smith, Mark J. T.
2009 IEEE 13TH DIGITAL SIGNAL PROCESSING WORKSHOP & 5TH IEEE PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, PROCEEDINGS, 2009, : 565 - 569

← 1 2 3 4 5 →