Video Object Segmentation-aware Video Frame Interpolation

被引:1
|
作者
Yoo, Jun-Sang [1 ]
Lee, Hongjae [1 ]
Jung, Seung-Won [1 ]
机构
[1] Korea Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/ICCV51070.2023.01132
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video frame interpolation (VFI) is a very active research topic due to its broad applicability to many applications, including video enhancement, video encoding, and slow- motion effects. VFI methods have been advanced by improving the overall image quality for challenging sequences containing occlusions, large motion, and dynamic texture. This mainstream research direction neglects that foreground and background regions have different importance in perceptual image quality. Moreover, accurate synthesis of moving objects can be of utmost importance in computer vision applications. In this paper, we propose a video object segmentation (VOS)-aware training framework called VOS-VFI that allows VFI models to interpolate frames with more precise object boundaries. Specifically, we exploit VOS as an auxiliary task to help train VFI models by providing additional loss functions, including segmentation loss and bi-directional consistency loss. From extensive experiments, we demonstrate that VOS-VFI can boost the performance of existing VFI models by rendering clear object boundaries. Moreover, VOS- VFI displays its effectiveness on multiple benchmarks for different applications, including video object segmentation, object pose estimation, and visual tracking. The code is available at https://github.com/junsang7777/VOS-VFI
引用
收藏
页码:12288 / 12299
页数:12
相关论文
共 50 条
  • [1] An adaptive frame skipping and VOP interpolation algorithm for video object segmentation
    Yang, GB
    Zhang, ZY
    CHINESE JOURNAL OF ELECTRONICS, 2004, 13 (03): : 453 - 458
  • [2] Depth-Aware Video Frame Interpolation
    Bao, Wenbo
    Lai, Wei-Sheng
    Ma, Chao
    Zhang, Xiaoyun
    Gao, Zhiyong
    Yang, Ming-Hsuan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3698 - 3707
  • [3] Motion-Aware Video Frame Interpolation
    Han, Pengfei
    Zhang, Fuhua
    Zhao, Bin
    Li, Xuelong
    NEURAL NETWORKS, 2024, 178
  • [4] Texture-aware Video Frame Interpolation
    Danier, Duolikun
    Bull, David
    2021 PICTURE CODING SYMPOSIUM (PCS), 2021, : 226 - 230
  • [5] Video Object Segmentation for Content-Aware Video Compression
    Sun, Lu
    Decombas, Marc
    Lang, Jochen
    2016 13TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2016, : 116 - 123
  • [6] SAIN: SIMILARITY-AWARE VIDEO FRAME INTERPOLATION
    Lv, Yue
    Yang, Wenming
    Zuo, Wangmeng
    Liao, Qingmin
    Zhu, Rui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1920 - 1924
  • [7] Context-aware Synthesis for Video Frame Interpolation
    Niklaus, Simon
    Liu, Feng
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1701 - 1710
  • [8] Saliency-Aware Video Object Segmentation
    Wang, Wenguan
    Shen, Jianbing
    Yang, Ruigang
    Porikli, Fatih
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (01) : 20 - 33
  • [9] A Temporally-Aware Interpolation Network for Video Frame Inpainting
    Sun, Ximeng
    Szeto, Ryan
    Corso, Jason J.
    COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 249 - 264
  • [10] A Temporally-Aware Interpolation Network for Video Frame Inpainting
    Szeto, Ryan
    Sun, Ximeng
    Lu, Kunyi
    Corso, Jason J.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1053 - 1068