ISDA: POSITION-AWARE INSTANCE SEGMENTATION WITH DEFORMABLE ATTENTION

被引：3

作者：

Ying, Kaining ^{[1
]}

Wang, Zhenhua ^{[1
]}

Bai, Cong ^{[1
]}

Zhou, Pengfei ^{[1
]}

机构：

[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

基金：

中国国家自然科学基金;

关键词：

Instance segmentation; end-to-end; deformable attention; position-aware kernel;

D O I：

10.1109/ICASSP43922.2022.9747246

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Most instance segmentation models are not end-to-end trainable due to either the incorporation of proposal estimation (RPN) as a pre-processing or non-maximum suppression (NMS) as a post-processing. Here we propose a novel end-to-end instance segmentation method termed ISDA. It reshapes the task into predicting a set of object masks, which are generated via traditional convolution operation with learned position-aware kernels and features of objects. Such kernels and features are learned by leveraging a deformable attention network with multi-scale representation. Thanks to the introduced set-prediction mechanism, the proposed method is NMS-free. Empirically, ISDA outperforms Mask R-CNN (the strong baseline) by 2.6 points on MS-COCO, and achieves leading performance compared with recent models. Code will be available soon.

引用

页码：2619 / 2623

页数：5

共 50 条

[1] Compact Position-Aware Attention Network for Image Semantic Segmentation
Xu, Yajun
Mao, Zhendong
Zhang, Peng
Wang, Bin
MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 639 - 650
[2] Dynamic Anchor Box-based Instance Decoding and Position-aware Instance Association for Online Video Instance Segmentation
Chun H.-J.
Kim I.
Journal of Institute of Control, Robotics and Systems, 2023, 29 (09) : 755 - 766
[3] Position-aware Attention for Enhancing the Machine Comprehension
Liu, Weijie
Zhao, Jianbo
Li, Mingzheng
Li, Si
Guo, Jun
PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 20 - 24
[4] Geometry Attention Transformer with position-aware LSTMs for image
Wang, Chi
Shen, Yulin
Ji, Luping
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 201
[5] DEFORMABLE VISTR: SPATIO TEMPORAL DEFORMABLE ATTENTION FOR VIDEO INSTANCE SEGMENTATION
Yarram, Sudhir
Wu, Jialian
Ji, Pan
Xu, Yi
Yuan, Junsong
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3303 - 3307
[6] A position-aware attention network with progressive detailing for land use semantic segmentation of Remote Sensing images
Feng, Jiangfan
Zheng, Wei
Gu, Zhujun
Guo, Dongen
Qin, Rui
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (21) : 6762 - 6801
[7] Position-aware context attention for session-based recommendation
Cao, Yi
Zhang, Weifeng
Song, Bo
Pan, Weike
Xu, Congfu
NEUROCOMPUTING, 2020, 376 : 65 - 72
[8] Position-Aware Tooth Segmentation and Numbering with Prior Knowledge Injected
Li, Changlin
He, Jian
Wang, Gaige
Liu, Kuilong
Yang, Changyuan
CROSS-CULTURAL DESIGN, PT III, CCD 2023, 2023, 14024 : 457 - 475
[9] Enhancing Sindhi Word Segmentation Using Subword Representation Learning and Position-Aware Self-Attention
Ali, Wazir
Kumar, Jay
Tumani, Saifullah
Nour, Redhwan
Noor, Adeeb
Xu, Zenglin
IEEE ACCESS, 2024, 12 : 183133 - 183142
[10] Joint entity and relation extraction with position-aware attention and relation embedding
Chen, Tiantian
Zhou, Lianke
Wang, Nianbin
Chen, Xirui
APPLIED SOFT COMPUTING, 2022, 119

← 1 2 3 4 5 →