Dynamic Transformer for Few-shot Instance Segmentation

被引:2
|
作者
Wang, Haochen [1 ]
Liu, Jie [1 ]
Liu, Yongtuo [1 ]
Maji, Subhransu [2 ]
Sonke, Jan-Jakob [3 ]
Gavves, Efstratios [1 ]
机构
[1] Univ Amsterdam, Amsterdam, Netherlands
[2] Univ Massachusetts, Amherst, MA USA
[3] Netherlands Canc Inst, Amsterdam, Netherlands
关键词
Few-shot Instance Segmentation; Dynamic Queries; Semantic-induced Transformer Decoder;
D O I
10.1145/3503161.3548227
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Few-shot instance segmentation aims to train an instance segmentation model that can fast adapt to novel classes with only a few reference images. Existing methods are usually derived from standard detection models and tackle few-shot instance segmentation indirectly by conducting classification, box regression, and mask prediction on a large set of redundant proposals followed by indispensable post-processing, e.g., Non-Maximum Suppression. Such complicated hand-crafted procedures and hyper-parameters lead to degraded optimization and insufficient generalization ability. In this work, we propose an end-to-end Dynamic Transformer Network, DTN for short, to directly segment all target object instances from arbitrary categories given by reference images, relieving the requirements of dense proposal generation and post-processing. Specifically, a small set of Dynamic Queries, conditioned on reference images, are exclusively assigned to target object instances and generate all the instance segmentation masks of reference categories simultaneously. Moreover, a Semantic-induced Transformer Decoder is introduced to constrain the cross-attention between dynamic queries and target images within the pixels of the reference category, which suppresses the noisy interaction with the background and irrelevant categories. Extensive experiments are conducted on the COCO-20(i) dataset. The experiment results demonstrate that our proposed Dynamic Transformer Network significantly outperforms the state-of-the-arts.
引用
收藏
页码:2969 / 2977
页数:9
相关论文
共 50 条
  • [1] Incremental Few-Shot Instance Segmentation
    Ganea, Dan Andrei
    Boom, Bas
    Poppe, Ronald
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1185 - 1194
  • [2] Adaptive Agent Transformer for Few-Shot Segmentation
    Wang, Yuan
    Sun, Rui
    Zhang, Zhe
    Zhang, Tianzhu
    [J]. COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 36 - 52
  • [3] Mask Matching Transformer for Few-Shot Segmentation
    Jiao, Siyu
    Zhang, Gengwei
    Navasardyan, Shant
    Chen, Ling
    Zhao, Yao
    Wei, Yunchao
    Shi, Humphrey
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] A lightweight siamese transformer for few-shot semantic segmentation
    Zhu, Hegui
    Zhou, Yange
    Jiang, Cong
    Yang, Lianping
    Jiang, Wuming
    Wang, Zhimu
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7455 - 7469
  • [5] A lightweight siamese transformer for few-shot semantic segmentation
    Hegui Zhu
    Yange Zhou
    Cong Jiang
    Lianping Yang
    Wuming Jiang
    Zhimu Wang
    [J]. Neural Computing and Applications, 2024, 36 : 7455 - 7469
  • [6] TPN: Triple parts network for few-shot instance segmentation
    Wang, Haotian
    Zhou, Shibin
    Xu, Xinzheng
    Zhang, Guopeng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (30) : 46439 - 46455
  • [7] CellTranspose: Few-shot Domain Adaptation for Cellular Instance Segmentation
    Keaton, Matthew R.
    Zaveri, Ram J.
    Doretto, Gianfranco
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 455 - 466
  • [8] TPN: Triple parts network for few-shot instance segmentation
    Haotian Wang
    Shibin Zhou
    Xinzheng Xu
    Guopeng Zhang
    [J]. Multimedia Tools and Applications, 2023, 82 : 46439 - 46455
  • [9] Few-Shot Segmentation via Cycle-Consistent Transformer
    Zhang, Gengwei
    Kang, Guoliang
    Yang, Yi
    Wei, Yunchao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation
    Wang, Yuan
    Luo, Naisong
    Zhang, Tianzhu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,