ISDA: POSITION-AWARE INSTANCE SEGMENTATION WITH DEFORMABLE ATTENTION

被引:3
|
作者
Ying, Kaining [1 ]
Wang, Zhenhua [1 ]
Bai, Cong [1 ]
Zhou, Pengfei [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Instance segmentation; end-to-end; deformable attention; position-aware kernel;
D O I
10.1109/ICASSP43922.2022.9747246
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most instance segmentation models are not end-to-end trainable due to either the incorporation of proposal estimation (RPN) as a pre-processing or non-maximum suppression (NMS) as a post-processing. Here we propose a novel end-to-end instance segmentation method termed ISDA. It reshapes the task into predicting a set of object masks, which are generated via traditional convolution operation with learned position-aware kernels and features of objects. Such kernels and features are learned by leveraging a deformable attention network with multi-scale representation. Thanks to the introduced set-prediction mechanism, the proposed method is NMS-free. Empirically, ISDA outperforms Mask R-CNN (the strong baseline) by 2.6 points on MS-COCO, and achieves leading performance compared with recent models. Code will be available soon.
引用
收藏
页码:2619 / 2623
页数:5
相关论文
共 50 条
  • [41] Position-aware compositional embeddings for compressed recommendation systems
    Mu, Zongshen
    Zhuang, Yueting
    Tang, Siliang
    NEUROCOMPUTING, 2024, 592
  • [42] Position-Aware ListMLE: A Sequential Learning Process for Ranking
    Lan, Yanyan
    Zhu, Yadong
    Guo, Jiafeng
    Niu, Shuzi
    Cheng, Xueqi
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 449 - 458
  • [43] AFINITI: attention-aware feature integration for nuclei instance segmentation and type identification
    Esha Sadia Nasir
    Shahzad Rasool
    Raheel Nawaz
    Muhammad Moazam Fraz
    Neural Computing and Applications, 2024, 36 (29) : 18343 - 18361
  • [44] Boundary-aware Instance Segmentation
    Hayder, Zeeshan
    He, Xuming
    Salzmann, Mathieu
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 587 - 595
  • [45] Position-Aware Recalibration Module: Learning From Feature Semantics and Feature Position
    Ma, Xu
    Fu, Song
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 797 - 803
  • [46] MVP-Net: Multi-view FPN with Position-Aware Attention for Deep Universal Lesion Detection
    Li, Zihao
    Zhang, Shu
    Zhang, Junge
    Huang, Kaiqi
    Wang, Yizhou
    Yu, Yizhou
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019, 11769 : 13 - 21
  • [47] Position-Aware Masked Autoencoder for Histopathology WSI Representation Learning
    Wu, Kun
    Zheng, Yushan
    Shi, Jun
    Xie, Fengying
    Jiang, Zhiguo
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 714 - 724
  • [48] Unbiased Ad Click Prediction for Position-aware Advertising Systems
    Yuan, Bowen
    Liu, Yaxu
    Hsia, Jui-Yang
    Dong, Zhenhua
    Lin, Chih-Jen
    RECSYS 2020: 14TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2020, : 368 - 377
  • [49] A Character Position-Aware Compression Framework for Screen Text Image
    Zhu, Chen
    Lu, Guo
    Chen, Huanbang
    Feng, Donghui
    Wang, Shen
    Zhao, Yan
    Xie, Rong
    Song, Li
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8821 - 8835
  • [50] PosKHG: A Position-Aware Knowledge Hypergraph Model for Link Prediction
    Zirui Chen
    Xin Wang
    Chenxu Wang
    Zhao Li
    Data Science and Engineering, 2023, 8 : 135 - 145