Video Object Segmentation-aware Video Frame Interpolation

被引：1

作者：

Yoo, Jun-Sang ^{[1
]}

Lee, Hongjae ^{[1
]}

Jung, Seung-Won ^{[1
]}

机构：

[1] Korea Univ, Seoul, South Korea

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1109/ICCV51070.2023.01132

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video frame interpolation (VFI) is a very active research topic due to its broad applicability to many applications, including video enhancement, video encoding, and slow- motion effects. VFI methods have been advanced by improving the overall image quality for challenging sequences containing occlusions, large motion, and dynamic texture. This mainstream research direction neglects that foreground and background regions have different importance in perceptual image quality. Moreover, accurate synthesis of moving objects can be of utmost importance in computer vision applications. In this paper, we propose a video object segmentation (VOS)-aware training framework called VOS-VFI that allows VFI models to interpolate frames with more precise object boundaries. Specifically, we exploit VOS as an auxiliary task to help train VFI models by providing additional loss functions, including segmentation loss and bi-directional consistency loss. From extensive experiments, we demonstrate that VOS-VFI can boost the performance of existing VFI models by rendering clear object boundaries. Moreover, VOS- VFI displays its effectiveness on multiple benchmarks for different applications, including video object segmentation, object pose estimation, and visual tracking. The code is available at https://github.com/junsang7777/VOS-VFI

引用

页码：12288 / 12299

页数：12

共 50 条

[1] An adaptive frame skipping and VOP interpolation algorithm for video object segmentation
Yang, GB
Zhang, ZY
CHINESE JOURNAL OF ELECTRONICS, 2004, 13 (03): : 453 - 458
[2] Depth-Aware Video Frame Interpolation
Bao, Wenbo
Lai, Wei-Sheng
Ma, Chao
Zhang, Xiaoyun
Gao, Zhiyong
Yang, Ming-Hsuan
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3698 - 3707
[3] Motion-Aware Video Frame Interpolation
Han, Pengfei
Zhang, Fuhua
Zhao, Bin
Li, Xuelong
NEURAL NETWORKS, 2024, 178
[4] Texture-aware Video Frame Interpolation
Danier, Duolikun
Bull, David
2021 PICTURE CODING SYMPOSIUM (PCS), 2021, : 226 - 230
[5] Video Object Segmentation for Content-Aware Video Compression
Sun, Lu
Decombas, Marc
Lang, Jochen
2016 13TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2016, : 116 - 123
[6] SAIN: SIMILARITY-AWARE VIDEO FRAME INTERPOLATION
Lv, Yue
Yang, Wenming
Zuo, Wangmeng
Liao, Qingmin
Zhu, Rui
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1920 - 1924
[7] Context-aware Synthesis for Video Frame Interpolation
Niklaus, Simon
Liu, Feng
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1701 - 1710
[8] Saliency-Aware Video Object Segmentation
Wang, Wenguan
Shen, Jianbing
Yang, Ruigang
Porikli, Fatih
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (01) : 20 - 33
[9] A Temporally-Aware Interpolation Network for Video Frame Inpainting
Sun, Ximeng
Szeto, Ryan
Corso, Jason J.
COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 249 - 264
[10] A Temporally-Aware Interpolation Network for Video Frame Inpainting
Szeto, Ryan
Sun, Ximeng
Lu, Kunyi
Corso, Jason J.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1053 - 1068

← 1 2 3 4 5 →