ISDA: POSITION-AWARE INSTANCE SEGMENTATION WITH DEFORMABLE ATTENTION

被引:3
|
作者
Ying, Kaining [1 ]
Wang, Zhenhua [1 ]
Bai, Cong [1 ]
Zhou, Pengfei [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Instance segmentation; end-to-end; deformable attention; position-aware kernel;
D O I
10.1109/ICASSP43922.2022.9747246
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most instance segmentation models are not end-to-end trainable due to either the incorporation of proposal estimation (RPN) as a pre-processing or non-maximum suppression (NMS) as a post-processing. Here we propose a novel end-to-end instance segmentation method termed ISDA. It reshapes the task into predicting a set of object masks, which are generated via traditional convolution operation with learned position-aware kernels and features of objects. Such kernels and features are learned by leveraging a deformable attention network with multi-scale representation. Thanks to the introduced set-prediction mechanism, the proposed method is NMS-free. Empirically, ISDA outperforms Mask R-CNN (the strong baseline) by 2.6 points on MS-COCO, and achieves leading performance compared with recent models. Code will be available soon.
引用
收藏
页码:2619 / 2623
页数:5
相关论文
共 50 条
  • [21] A Position-Aware Transformer for Image Captioning
    Deng, Zelin
    Zhou, Bo
    He, Pei
    Huang, Jianfeng
    Alfarraj, Osama
    Tolba, Amr
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (01): : 2065 - 2081
  • [22] Position-Aware Attention Mechanism–Based Bi-graph for Dialogue Relation Extraction
    Guiduo Duan
    Yunrui Dong
    Jiayu Miao
    Tianxi Huang
    Cognitive Computation, 2023, 15 : 359 - 372
  • [23] Position-aware Interactive Attention Network for multi-intent spoken language understanding
    Sun, Pengfei
    Cao, Han
    Yu, Hongli
    Cui, Yachao
    Wang, Lei
    NEUROCOMPUTING, 2024, 600
  • [24] Position-aware activity recognition with wearable devices
    Sztyler, Timo
    Stuckenschmidt, Heiner
    Petrich, Wolfgang
    PERVASIVE AND MOBILE COMPUTING, 2017, 38 : 281 - 295
  • [25] Position-aware image captioning with spatial relation
    Duan, Yiqun
    Wang, Zhen
    Wang, Jingya
    Wang, Yu-Kai
    Lin, Chin-Teng
    Neurocomputing, 2022, 497 : 28 - 38
  • [26] Position-Aware Safe Boundary Interpolation Oversampling
    Liu, Yongxu
    Liu, Yan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5519 - 5526
  • [27] Position-aware image captioning with spatial relation
    Duan, Yiqun
    Wang, Zhen
    Wang, Jingya
    Wang, Yu-Kai
    Lin, Chin-Teng
    NEUROCOMPUTING, 2022, 497 : 28 - 38
  • [28] MULTI-SCALE POSITION-AWARE CELL NUCLEUS MASK ATTENTION FOR TUMOR BUDDING DETECTION
    Zhang, Wenwen
    Lian, Jie
    Dong, Bingying
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [29] POSITION-AWARE ACTIVITY RECOGNITION ON MOBILE PHONES
    Coskun, Doruk
    Incel, Ozlem Durmaz
    Ozgovde, Atay
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1930 - 1933
  • [30] Position-Aware Communication via Self-Attention for Multi-Agent Reinforcement Learning
    Shih, Tsan-Hua
    Lin, Hsien-, I
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,