Video Object Segmentation-aware Video Frame Interpolation

被引:1
|
作者
Yoo, Jun-Sang [1 ]
Lee, Hongjae [1 ]
Jung, Seung-Won [1 ]
机构
[1] Korea Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/ICCV51070.2023.01132
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video frame interpolation (VFI) is a very active research topic due to its broad applicability to many applications, including video enhancement, video encoding, and slow- motion effects. VFI methods have been advanced by improving the overall image quality for challenging sequences containing occlusions, large motion, and dynamic texture. This mainstream research direction neglects that foreground and background regions have different importance in perceptual image quality. Moreover, accurate synthesis of moving objects can be of utmost importance in computer vision applications. In this paper, we propose a video object segmentation (VOS)-aware training framework called VOS-VFI that allows VFI models to interpolate frames with more precise object boundaries. Specifically, we exploit VOS as an auxiliary task to help train VFI models by providing additional loss functions, including segmentation loss and bi-directional consistency loss. From extensive experiments, we demonstrate that VOS-VFI can boost the performance of existing VFI models by rendering clear object boundaries. Moreover, VOS- VFI displays its effectiveness on multiple benchmarks for different applications, including video object segmentation, object pose estimation, and visual tracking. The code is available at https://github.com/junsang7777/VOS-VFI
引用
收藏
页码:12288 / 12299
页数:12
相关论文
共 50 条
  • [31] Key-frame extraction for object-based video segmentation
    Song, XM
    Fan, GL
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 689 - 692
  • [32] Modified intelligent scissors and adaptive frame skipping for video object segmentation
    Yang, GB
    Yu, SF
    REAL-TIME IMAGING, 2005, 11 (04) : 310 - 322
  • [33] Efficient frame-sequential label propagation for video object segmentation
    Chen, Yadang
    Hao, Chuanyan
    Wu, Wen
    Wu, Enhua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (05) : 6117 - 6133
  • [34] Softmax Splatting for Video Frame Interpolation
    Niklaus, Simon
    Liu, Feng
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5436 - 5445
  • [35] Exploring Discontinuity for Video Frame Interpolation
    Lee, Sangjin
    Lee, Hyeongmin
    Shin, Chajin
    Son, Hanbin
    Lee, Sangyoun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9791 - 9800
  • [36] XVFI: eXtreme Video Frame Interpolation
    Sim, Hyeonjun
    Oh, Jihyong
    Kim, Munchurl
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14469 - 14478
  • [37] Video Frame Interpolation with Flow Transformer
    Gao, Pan
    Tian, Haoyue
    Qin, Jie
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1933 - 1942
  • [38] Video Frame Interpolation: A Comprehensive Survey
    Dong, Jiong
    Ota, Kaoru
    Dong, Mianxiong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [39] Deep frame interpolation for video compression
    Begaint, Jean
    Galpin, Franck
    Guillotel, Philippe
    Guillemot, Christine
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 556 - 556
  • [40] A CONCATENATED MODEL FOR VIDEO FRAME INTERPOLATION
    Chen, Ying
    Smith, Mark J. T.
    2009 IEEE 13TH DIGITAL SIGNAL PROCESSING WORKSHOP & 5TH IEEE PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, PROCEEDINGS, 2009, : 565 - 569