ISDA: POSITION-AWARE INSTANCE SEGMENTATION WITH DEFORMABLE ATTENTION

被引:3
|
作者
Ying, Kaining [1 ]
Wang, Zhenhua [1 ]
Bai, Cong [1 ]
Zhou, Pengfei [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Instance segmentation; end-to-end; deformable attention; position-aware kernel;
D O I
10.1109/ICASSP43922.2022.9747246
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most instance segmentation models are not end-to-end trainable due to either the incorporation of proposal estimation (RPN) as a pre-processing or non-maximum suppression (NMS) as a post-processing. Here we propose a novel end-to-end instance segmentation method termed ISDA. It reshapes the task into predicting a set of object masks, which are generated via traditional convolution operation with learned position-aware kernels and features of objects. Such kernels and features are learned by leveraging a deformable attention network with multi-scale representation. Thanks to the introduced set-prediction mechanism, the proposed method is NMS-free. Empirically, ISDA outperforms Mask R-CNN (the strong baseline) by 2.6 points on MS-COCO, and achieves leading performance compared with recent models. Code will be available soon.
引用
收藏
页码:2619 / 2623
页数:5
相关论文
共 50 条
  • [31] Position-Aware Attention Mechanism-Based Bi-graph for Dialogue Relation Extraction
    Duan, Guiduo
    Dong, Yunrui
    Miao, Jiayu
    Huang, Tianxi
    COGNITIVE COMPUTATION, 2023, 15 (01) : 359 - 372
  • [32] Aspect and Opinion Terms Co-extraction Using Position-Aware Attention and Auxiliary Labels
    Liu, Chao
    Wei, Xintong
    Yu, Min
    Li, Gang
    Ma, Xiangmei
    Jiang, Jianguo
    Huang, Weiqing
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 162 - 173
  • [33] DCATNet: polyp segmentation with deformable convolution and contextual-aware attention network
    Zenan Wang
    Tianshu Li
    Ming Liu
    Jue Jiang
    Xinjuan Liu
    BMC Medical Imaging, 25 (1)
  • [34] Position-aware multimedia mobile learning systems in museums
    Chou, LD
    Wu, CH
    Ho, SP
    Lee, CC
    Proceedings of the IASTED International Conference on Web-Based Education, 2004, : 148 - 150
  • [35] Position-Aware Tagging for Aspect Sentiment Triplet Extraction
    Xu, Lu
    Li, Hao
    Lu, Wei
    Bing, Lidong
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2339 - 2349
  • [36] Retinal Vessel Segmentation Method Based on Position-Aware Circular Convolution with Multi-Scale Input
    Jiang, Zhongchuan
    Wu, Yun
    Computer Engineering and Applications, 2023, 59 (21) : 242 - 250
  • [37] PAS: A Position-Aware Similarity Measurement for Sequential Recommendation
    Zeng, Zijie
    Lin, Jing
    Pan, Weike
    Ming, Zhong
    Lu, Zhongqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [38] Position-Aware Relational Transformer for Knowledge Graph Embedding
    Li, Guangyao
    Sun, Zequn
    Hu, Wei
    Cheng, Gong
    Qu, Yuzhong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 11580 - 11594
  • [39] PA-GGAN: SESSION-BASED RECOMMENDATION WITH POSITION-AWARE GATED GRAPH ATTENTION NETWORK
    Wang, Jinshan
    Xu, Qianfang
    Lei, Jiahuan
    Lin, Chaoqun
    Xiao, Bo
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [40] A POSITION-AWARE LINEAR SOLID CONSTITUTIVE MODEL FOR PERIDYNAMICS
    Mitchell, John A.
    Silling, Stewart A.
    Littlewood, David J.
    JOURNAL OF MECHANICS OF MATERIALS AND STRUCTURES, 2015, 10 (05) : 539 - 557