ISDA: POSITION-AWARE INSTANCE SEGMENTATION WITH DEFORMABLE ATTENTION

被引:3
|
作者
Ying, Kaining [1 ]
Wang, Zhenhua [1 ]
Bai, Cong [1 ]
Zhou, Pengfei [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Instance segmentation; end-to-end; deformable attention; position-aware kernel;
D O I
10.1109/ICASSP43922.2022.9747246
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most instance segmentation models are not end-to-end trainable due to either the incorporation of proposal estimation (RPN) as a pre-processing or non-maximum suppression (NMS) as a post-processing. Here we propose a novel end-to-end instance segmentation method termed ISDA. It reshapes the task into predicting a set of object masks, which are generated via traditional convolution operation with learned position-aware kernels and features of objects. Such kernels and features are learned by leveraging a deformable attention network with multi-scale representation. Thanks to the introduced set-prediction mechanism, the proposed method is NMS-free. Empirically, ISDA outperforms Mask R-CNN (the strong baseline) by 2.6 points on MS-COCO, and achieves leading performance compared with recent models. Code will be available soon.
引用
收藏
页码:2619 / 2623
页数:5
相关论文
共 50 条
  • [1] Compact Position-Aware Attention Network for Image Semantic Segmentation
    Xu, Yajun
    Mao, Zhendong
    Zhang, Peng
    Wang, Bin
    MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 639 - 650
  • [2] Dynamic Anchor Box-based Instance Decoding and Position-aware Instance Association for Online Video Instance Segmentation
    Chun H.-J.
    Kim I.
    Journal of Institute of Control, Robotics and Systems, 2023, 29 (09) : 755 - 766
  • [3] Position-aware Attention for Enhancing the Machine Comprehension
    Liu, Weijie
    Zhao, Jianbo
    Li, Mingzheng
    Li, Si
    Guo, Jun
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 20 - 24
  • [4] Geometry Attention Transformer with position-aware LSTMs for image
    Wang, Chi
    Shen, Yulin
    Ji, Luping
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 201
  • [5] DEFORMABLE VISTR: SPATIO TEMPORAL DEFORMABLE ATTENTION FOR VIDEO INSTANCE SEGMENTATION
    Yarram, Sudhir
    Wu, Jialian
    Ji, Pan
    Xu, Yi
    Yuan, Junsong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3303 - 3307
  • [6] A position-aware attention network with progressive detailing for land use semantic segmentation of Remote Sensing images
    Feng, Jiangfan
    Zheng, Wei
    Gu, Zhujun
    Guo, Dongen
    Qin, Rui
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (21) : 6762 - 6801
  • [7] Position-aware context attention for session-based recommendation
    Cao, Yi
    Zhang, Weifeng
    Song, Bo
    Pan, Weike
    Xu, Congfu
    NEUROCOMPUTING, 2020, 376 : 65 - 72
  • [8] Position-Aware Tooth Segmentation and Numbering with Prior Knowledge Injected
    Li, Changlin
    He, Jian
    Wang, Gaige
    Liu, Kuilong
    Yang, Changyuan
    CROSS-CULTURAL DESIGN, PT III, CCD 2023, 2023, 14024 : 457 - 475
  • [9] Enhancing Sindhi Word Segmentation Using Subword Representation Learning and Position-Aware Self-Attention
    Ali, Wazir
    Kumar, Jay
    Tumani, Saifullah
    Nour, Redhwan
    Noor, Adeeb
    Xu, Zenglin
    IEEE ACCESS, 2024, 12 : 183133 - 183142
  • [10] Joint entity and relation extraction with position-aware attention and relation embedding
    Chen, Tiantian
    Zhou, Lianke
    Wang, Nianbin
    Chen, Xirui
    APPLIED SOFT COMPUTING, 2022, 119