Make One-Shot Video Object Segmentation Efficient Again

被引:0
|
作者
Meinhardt, Tim [1 ]
Leal-Taixe, Laura [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video object segmentation (VOS) describes the task of segmenting a set of objects in each frame of a video. In the semi-supervised setting, the first mask of each object is provided at test time. Following the one-shot principle, fine-tuning VOS methods train a segmentation model separately on each given object mask. However, recently the VOS community has deemed such a test time optimization and its impact on the test runtime as unfeasible. To mitigate the inefficiencies of previous fine-tuning approaches, we present efficient One-Shot Video Object Segmentation (e-OSVOS). In contrast to most VOS approaches, e-OSVOS decouples the object detection task and predicts only local segmentation masks by applying a modified version of Mask R-CNN. The one-shot test runtime and performance are optimized without a laborious and handcrafted hyperparameter search. To this end, we meta learn the model initialization and learning rates for the test time optimization. To achieve an optimal learning behavior, we predict individual learning rates at a neuron level. Furthermore, we apply an online adaptation to address the common performance degradation throughout a sequence by continuously fine-tuning the model on previous mask predictions supported by a frame-to-frame bounding box propagation. e-OSVOS provides state-of-the-art results on DAVIS 2016, DAVIS 2017 and YouTube-VOS for one-shot fine-tuning methods while reducing the test runtime substantially. Code is available at https://github.com/dvl-tum/e-osvos.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] One-Shot Learning with Pseudo-Labeling for Cattle Video Segmentation in Smart Livestock Farming
    Qiao, Yongliang
    Xue, Tengfei
    Kong, He
    Clark, Cameron
    Lomax, Sabrina
    Rafique, Khalid
    Sukkarieh, Salah
    ANIMALS, 2022, 12 (05):
  • [22] Two-shot Video Object Segmentation
    Yan, Kun
    Li, Xiao
    Wei, Fangyun
    Wang, Jinglu
    Zhang, Chenbin
    Wang, Ping
    Lu, Yan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2257 - 2267
  • [23] One-Shot Unsupervised Domain Adaptation for Object Detection
    Wan, Zhiqiang
    Li, Lusi
    Li, Hepeng
    He, Haibo
    Ni, Zhen
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [24] Repurposing GANs for One-Shot Semantic Part Segmentation
    Rewatbowornwong, Pitchaporn
    Tritrong, Nontawat
    Suwajanakorn, Supasorn
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 5114 - 5125
  • [25] Repurposing GANs for One-shot Semantic Part Segmentation
    Tritrong, Nontawat
    Rewatbowornwong, Pitchaporn
    Suwajanakorn, Supasorn
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4473 - 4483
  • [26] OSCD: A one-shot conditional object detection framework
    Fu, Kun
    Zhang, Tengfei
    Zhang, Yue
    Sun, Xian
    NEUROCOMPUTING, 2021, 425 : 243 - 255
  • [27] Adaptive Image Transformer for One-Shot Object Detection
    Chen, Ding-Jie
    Hsieh, He-Yen
    Liu, Tyng-Luh
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12242 - 12251
  • [28] Prototype Comparison Convolutional Networks for One-Shot Segmentation
    Li, Lingbo
    Li, Zhichun
    Guo, Fusen
    Yang, Haoyu
    Wei, Jingtian
    Yang, Zhengyi
    IEEE ACCESS, 2024, 12 : 54978 - 54990
  • [29] Augmentative contrastive learning for one-shot object detection
    Du, Yaoyang
    Liu, Fang
    Jiao, Licheng
    Hao, Zehua
    Li, Shuo
    Liu, Xu
    Liu, Jing
    NEUROCOMPUTING, 2022, 513 : 13 - 24
  • [30] Efficient methods for one-shot quantum communication
    Anurag Anshu
    Rahul Jain
    npj Quantum Information, 8