Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation

被引:1
|
作者
Wang, Qiuyue [1 ]
Zhang, Songyang [1 ]
He, Xuming [1 ]
机构
[1] ShanghaiTech Univ, Shanghai, Peoples R China
关键词
Few-shot Video Instance Segmentation; Few-shot Object Detection; Few-shot Learning;
D O I
10.1109/ICIP46576.2022.9897620
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional video instance segmentation (VIS) aims to detect, segment, and track object instances from a known class set in videos. In real-world applications, however, video instance segmentation typically need to cope with novel-class instances and to fast adapt with a few labeled videos. In this work, we aim to tackle the task of few-shot video instance segmentation (FVIS), which is challenging due to large variations in object appearance and motion. We propose a robust temporally coherent strategy, termed as VTFA, based on a two-stage fine-tuning approach. VTFA enforces the instance segmentation of novel classes to be temporally smooth and reduces the classification bias between novel and base classes. The proposed Memory-aware Temporal Context Encoding Module (MTCE) in VTFA encodes the temporal context information, which contributes to the consistency in the final predictions. We also propose a loss named Instance-level Pair-wise Contrastive (IPC) Loss on both the novel and base classes to enhance the robustness of instance classification. To validate our method, we develop a YouTube-VIS-FS benchmark to compare our method with several baselines. The experimental evaluation shows that our strategy is superior or competitive to those strong baselines.
引用
收藏
页码:251 / 255
页数:5
相关论文
共 50 条
  • [21] Temporally-coherent terawatt attosecond XFEL synchronized with a few cycle laser
    Sandeep Kumar
    Yong Woon Parc
    Alexandra S. Landsman
    Dong Eon Kim
    Scientific Reports, 6
  • [22] Active Instance Selection for Few-Shot Classification
    Shin, Junsup
    Kang, Youngwook
    Jung, Seungjin
    Choi, Jongwon
    IEEE ACCESS, 2022, 10 : 133186 - 133195
  • [23] Holistic Prototype Attention Network for Few-Shot Video Object Segmentation
    Tang, Yin
    Chen, Tao
    Jiang, Xiruo
    Yao, Yazhou
    Xie, Guo-Sen
    Shen, Heng-Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 6699 - 6709
  • [24] Adversarially Robust Prototypical Few-Shot Segmentation with Neural-ODEs
    Pandey, Prashant
    Vardhan, Aleti
    Chasmai, Mustafa
    Sur, Tanuj
    Lall, Brejesh
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VIII, 2022, 13438 : 77 - 87
  • [25] Contour-Based Wild Animal Instance Segmentation Using a Few-Shot Detector
    Tang, Jiaxi
    Zhao, Yaqin
    Feng, Liqi
    Zhao, Wenxuan
    ANIMALS, 2022, 12 (15):
  • [26] Incremental few-shot instance segmentation without fine-tuning on novel classes
    Zhang, Luofeng
    Weng, Libo
    Zhang, Yuanming
    Gao, Fei
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 254
  • [27] Few-shot Video-to-Video Synthesis
    Wang, Ting-Chun
    Liu, Ming-Yu
    Tao, Andrew
    Liu, Guilin
    Kautz, Jan
    Catanzaro, Bryan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [28] Generalized Few-shot Semantic Segmentation
    Tian, Zhuotao
    Lai, Xin
    Jiang, Li
    Liu, Shu
    Shu, Michelle
    Zhao, Hengshuang
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
  • [29] Few-Shot Video Object Detection
    Fan, Qi
    Tang, Chi-Keung
    Tai, Yu-Wing
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 76 - 98
  • [30] Weakly Supervised Few-Shot and Zero-Shot Semantic Segmentation with Mean Instance Aware Prompt Learning
    Pandey, Prashant
    Chasmai, Mustafa
    Natarajan, Monish
    Lall, Brejesh
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1 - 6