Holistic Prototype Attention Network for Few-Shot Video Object Segmentation

被引:8
|
作者
Tang, Yin [1 ]
Chen, Tao [1 ]
Jiang, Xiruo [1 ]
Yao, Yazhou [1 ]
Xie, Guo-Sen [1 ]
Shen, Heng-Tao [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypes; Task analysis; Object segmentation; Semantic segmentation; Semantics; Feature extraction; Annotations; Few-shot video object segmentation; video object segmentation; few-shot semantic segmentation;
D O I
10.1109/TCSVT.2023.3296629
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot video object segmentation (FSVOS) aims to segment dynamic objects of unseen classes by resorting to a small set of support images that contain pixel-level object annotations. Existing methods have demonstrated that the domain agent-based attention mechanism is effective in FSVOS by learning the correlation between support images and query frames. However, the agent frame contains redundant pixel information and background noise, resulting in inferior segmentation performance. Moreover, existing methods tend to ignore inter-frame correlations in query videos. To alleviate the above dilemma, we propose a holistic prototype attention network (HPAN) for advancing FSVOS. Specifically, HPAN introduces a prototype graph attention module (PGAM) and a bidirectional prototype attention module (BPAM), transferring informative knowledge from seen to unseen classes. PGAM generates local prototypes from all foreground features and then utilizes their internal correlations to enhance the representation of the holistic prototypes. BPAM exploits the holistic information from support images and video frames by fusing co-attention and self-attention to achieve support-query semantic consistency and inner-frame temporal consistency. Extensive experiments on YouTube-FSVOS have been provided to demonstrate the effectiveness and superiority of our proposed HPAN method. Our source code and models are available anonymously at https://github.com/NUST-Machine-Intelligence-Laboratory/HPAN.
引用
收藏
页码:6699 / 6709
页数:11
相关论文
共 50 条
  • [41] Cross Attention Network for Few-shot Classification
    Hou, Ruibing
    Chang, Hong
    Ma, Bingpeng
    Shan, Shiguang
    Chen, Xilin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [42] Spatial Attention Network for Few-Shot Learning
    He, Xianhao
    Qiao, Peng
    Dou, Yong
    Niu, Xin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 567 - 578
  • [43] A LOCATION-SENSITIVE LOCAL PROTOTYPE NETWORK FOR FEW-SHOT MEDICAL IMAGE SEGMENTATION
    Yu, Qinji
    Dang, Kang
    Tajbakhsh, Nima
    Terzopoulos, Demetri
    Ding, Xiaowei
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 262 - 266
  • [44] Dual-Guided Prototype Alignment Network for Few-Shot Medical Image Segmentation
    Shen, Yue
    Fan, Wanshu
    Wang, Cong
    Liu, Wenfei
    Wang, Wei
    Zhang, Qiang
    Zhou, Dongsheng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [45] Multi-prototype collaborative perception enhancement network for few-shot semantic segmentation
    Chang, Zhaobin
    Gao, Xiong
    Kong, Dongyi
    Li, Na
    Lu, Yonggang
    VISUAL COMPUTER, 2024,
  • [46] A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation
    Chen, Sihan
    Chen, Yadang
    Zheng, Yuhui
    Yang, Zhi-Xin
    Wu, Enhua
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 659 - 667
  • [47] Attention Relational Network for Few-Shot Learning
    Shuai, Jia
    Chen, JiaMing
    Yang, Meng
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING, PT II, 2019, 11936 : 163 - 174
  • [48] Prototype-Guided Graph Reasoning Network for Few-Shot Medical Image Segmentation
    Huang, Wendong
    Hu, Jinwu
    Xiao, Junhao
    Wei, Yang
    Bi, Xiuli
    Xiao, Bin
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (02) : 761 - 773
  • [49] SAPENet: Self-Attention based Prototype Enhancement Network for Few-shot Learning
    Huang, Xilang
    Choi, Seon Han
    PATTERN RECOGNITION, 2023, 135
  • [50] Universal-Prototype Enhancing for Few-Shot Object Detection
    Wu, Aming
    Han, Yahong
    Zhu, Linchao
    Yang, Yi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9547 - 9556