Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation

被引：1

作者：

Wang, Qiuyue ^{[1
]}

Zhang, Songyang ^{[1
]}

He, Xuming ^{[1
]}

机构：

[1] ShanghaiTech Univ, Shanghai, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2022年

关键词：

Few-shot Video Instance Segmentation; Few-shot Object Detection; Few-shot Learning;

D O I：

10.1109/ICIP46576.2022.9897620

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Traditional video instance segmentation (VIS) aims to detect, segment, and track object instances from a known class set in videos. In real-world applications, however, video instance segmentation typically need to cope with novel-class instances and to fast adapt with a few labeled videos. In this work, we aim to tackle the task of few-shot video instance segmentation (FVIS), which is challenging due to large variations in object appearance and motion. We propose a robust temporally coherent strategy, termed as VTFA, based on a two-stage fine-tuning approach. VTFA enforces the instance segmentation of novel classes to be temporally smooth and reduces the classification bias between novel and base classes. The proposed Memory-aware Temporal Context Encoding Module (MTCE) in VTFA encodes the temporal context information, which contributes to the consistency in the final predictions. We also propose a loss named Instance-level Pair-wise Contrastive (IPC) Loss on both the novel and base classes to enhance the robustness of instance classification. To validate our method, we develop a YouTube-VIS-FS benchmark to compare our method with several baselines. The experimental evaluation shows that our strategy is superior or competitive to those strong baselines.

引用

页码：251 / 255

页数：5

共 50 条

[21] Temporally-coherent terawatt attosecond XFEL synchronized with a few cycle laser
Sandeep Kumar
Yong Woon Parc
Alexandra S. Landsman
Dong Eon Kim
Scientific Reports, 6
[22] Active Instance Selection for Few-Shot Classification
Shin, Junsup
Kang, Youngwook
Jung, Seungjin
Choi, Jongwon
IEEE ACCESS, 2022, 10 : 133186 - 133195
[23] Holistic Prototype Attention Network for Few-Shot Video Object Segmentation
Tang, Yin
Chen, Tao
Jiang, Xiruo
Yao, Yazhou
Xie, Guo-Sen
Shen, Heng-Tao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 6699 - 6709
[24] Adversarially Robust Prototypical Few-Shot Segmentation with Neural-ODEs
Pandey, Prashant
Vardhan, Aleti
Chasmai, Mustafa
Sur, Tanuj
Lall, Brejesh
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VIII, 2022, 13438 : 77 - 87
[25] Contour-Based Wild Animal Instance Segmentation Using a Few-Shot Detector
Tang, Jiaxi
Zhao, Yaqin
Feng, Liqi
Zhao, Wenxuan
ANIMALS, 2022, 12 (15):
[26] Incremental few-shot instance segmentation without fine-tuning on novel classes
Zhang, Luofeng
Weng, Libo
Zhang, Yuanming
Gao, Fei
COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 254
[27] Few-shot Video-to-Video Synthesis
Wang, Ting-Chun
Liu, Ming-Yu
Tao, Andrew
Liu, Guilin
Kautz, Jan
Catanzaro, Bryan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[28] Generalized Few-shot Semantic Segmentation
Tian, Zhuotao
Lai, Xin
Jiang, Li
Liu, Shu
Shu, Michelle
Zhao, Hengshuang
Jia, Jiaya
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
[29] Few-Shot Video Object Detection
Fan, Qi
Tang, Chi-Keung
Tai, Yu-Wing
COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 76 - 98
[30] Weakly Supervised Few-Shot and Zero-Shot Semantic Segmentation with Mean Instance Aware Prompt Learning
Pandey, Prashant
Chasmai, Mustafa
Natarajan, Monish
Lall, Brejesh
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1 - 6

← 1 2 3 4 5 →