Exploring the Better Correlation for Few-Shot Video Object Segmentation

被引:0
|
作者
Luo, Naisong [1 ]
Wang, Yuan [1 ]
Sun, Rui [1 ]
Xiong, Guoxin [1 ]
Zhang, Tianzhu [1 ,2 ]
Wu, Feng [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci, Hefei 230027, Peoples R China
[2] Deep Space Explorat Lab, Hefei 230088, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot video object segmentation; video object segmentation; few-shot learning;
D O I
10.1109/TCSVT.2024.3491214
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot video object segmentation (FSVOS) aims to achieve accurate segmentation of novel objects in given video sequences, where the target objects are specified by limited annotated images as support. Most previous top-performing methods adopt the support-query semantic correlation learning paradigm or the intra-query temporal correlation learning paradigm. Nevertheless, they either fail to model temporal consistency across frames, resulting in inconsecutive segmentation, or lose diverse support object information, leading to incomplete segmentation. Therefore, we argue that it is more desirable to achieve both correlations in a collaborative manner. In this work, we delve into the issues present in the combination of few-shot image segmentation methods and video object segmentation methods and propose a dedicated Collaborative Correlation Network (CoCoNet) to address these problems, including a pixel correlation calibration module and a temporal correlation mining module. The proposed CoCoNet enjoys several merits. First, the pixel correlation calibration module aims to mitigate the noise issue in support-query correlation by integrating the affinity learning strategy and the prototype learning strategy. Specifically, we employ Optimal Transport to enrich pixel correlation with contextual information, thereby reducing intra-class differences between support and query. Second, the temporal correlation mining module is responsible for alleviating the issue of uncertainty in the initial frame and establishing reliable guidance for subsequent frames of the query video. With the collaboration of these two modules, our CoCoNet can effectively establish support-query and temporal correlation simultaneously and achieve accurate FSVOS. Extensive experimental results on two challenging benchmarks demonstrate that our method performs favorably against state-of-the-art FSVOS methods.
引用
收藏
页码:2133 / 2146
页数:14
相关论文
共 50 条
  • [41] Incremental Few-Shot Object Detection for Robotics
    Li, Yiting
    Zhu, Haiyue
    Tian, Sichao
    Feng, Fan
    Ma, Jun
    Teo, Chek Sing
    Xiang, Cheng
    Vadakkepat, Prahlad
    Lee, Tong Heng
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8447 - 8453
  • [42] Few-Shot Object Detection with Weight Imprinting
    Yan, Dingtian
    Huang, Jitao
    Sun, Hai
    Ding, Fuqiang
    COGNITIVE COMPUTATION, 2023, 15 (05) : 1725 - 1735
  • [43] Spatial reasoning for few-shot object detection
    Kim, Geonuk
    Jung, Hong-Gyu
    Lee, Seong-Whan
    PATTERN RECOGNITION, 2021, 120
  • [44] Adaptive Agent Transformer for Few-Shot Segmentation
    Wang, Yuan
    Sun, Rui
    Zhang, Zhe
    Zhang, Tianzhu
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 36 - 52
  • [45] Eliminating Feature Ambiguity for Few-Shot Segmentation
    Xu, Qianxiong
    Lin, Guosheng
    Loy, Chen Change
    Long, Cheng
    Li, Ziyue
    Zhao, Rui
    COMPUTER VISION - ECCV 2024, PT III, 2025, 15061 : 416 - 433
  • [46] Attentional prototype inference for few-shot segmentation
    Sun, Haoliang
    Lu, Xiankai
    Wang, Haochen
    Yin, Yilong
    Zhen, Xiantong
    Snoek, Cees G. M.
    Shao, Ling
    PATTERN RECOGNITION, 2023, 142
  • [47] Intermediate prototype network for few-shot segmentation
    Luo, Xiaoliu
    Duan, Zhao
    Zhang, Taiping
    SIGNAL PROCESSING, 2023, 203
  • [48] Few-Shot Microscopy Image Cell Segmentation
    Dawoud, Youssef
    Hornauer, Julia
    Carneiro, Gustavo
    Belagiannis, Vasileios
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 139 - 154
  • [49] Few-Shot Panoptic Segmentation With Foundation Models
    Kaeppeler, Markus
    Petek, Kursat
    Voedisch, Niclas
    Burgar, Wolfram
    Valada, Abhinav
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 7718 - 7724
  • [50] On the Texture Bias for Few-Shot CNN Segmentation
    Azad, Reza
    Fayjie, Abdur R.
    Kauffmann, Claude
    Ben Ayed, Ismail
    Pedersoli, Marco
    Dolz, Jose
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2673 - 2682