SOIT: Segmenting Objects with Instance-Aware Transformers

被引:0
|
作者
Yu, Xiaodong [1 ]
Shi, Dahu [1 ]
Wei, Xing [2 ]
Ren, Ye [1 ]
Ye, Tingqun [1 ]
Tan, Wenming [1 ]
机构
[1] Hikvis Res Inst, Hangzhou, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an end-to-end instance segmentation framework, termed SOIT, that Segments Objects with Instance-aware Transformers. Inspired by DETR, our method views instance segmentation as a direct set prediction problem and effectively removes the need for many hand-crafted components like Rot cropping, one-to-many label assignment, and non-maximum suppression (NMS). In SOIT, multiple queries are learned to directly reason a set of object embeddings of semantic category, bounding-box location, and pixel-wise mask in parallel under the global image context. The class and bounding-box can be easily embedded by a fixed-length vector. The pixel-wise mask, especially, is embedded by a group of parameters to construct a lightweight instance-aware transformer. Afterward, a full-resolution mask is produced by the instance-aware transformer without involving any RoF-based operation. Overall, SOIT introduces a simple single-stage instance segmentation framework that is both Rot- and NMS-free. Experimental results on the MS COCO dataset demonstrate that SOIT outperforms state-of-the-art instance segmentation approaches significantly. Moreover, the joint learning of multiple tasks in a unified query embedding can also substantially improve the detection performance. Code is available at https://github.com/yuxiaodongHRI/SOIT.
引用
收藏
页码:3188 / 3196
页数:9
相关论文
共 50 条
  • [31] Artistic Instance-Aware Image Filtering by Convolutional Neural Networks
    Tehrani, Milad
    Bagheri, Mahnoosh
    Ahmadi, Mahdi
    Norouzi, Alireza
    Karimi, Nader
    Samavi, Shadrokh
    2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 710 - 714
  • [32] Instance-aware Image and Sentence Matching with Selective Multimodal LSTM
    Huang, Yan
    Wang, Wei
    Wang, Liang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7254 - 7262
  • [33] INSTANCE-AWARE SIMPLIFICATION OF 3D POLYGONAL MESHES
    Azim, Tahir
    Cheslack-Postava, Ewen
    Levis, Philip
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [34] Joint EM Image Denoising and Segmentation with Instance-Aware Interaction
    Wang, Zhicheng
    Li, Jiacheng
    Chen, Yinda
    Shou, Jiateng
    Deng, Shiyu
    Huang, Wei
    Xiong, Zhiwei
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VII, 2024, 15007 : 403 - 413
  • [35] Self-Guided Instance-Aware Network for Depth Completion and Enhancement
    Luo, Zhongzhen
    Zhang, Fengjia
    Fu, Guoyi
    Xu, Jiajie
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 10905 - 10911
  • [36] INSTA-BNN: Binary Neural Network with INSTAnce-aware Threshold
    Lee, Changhun
    Kim, Hyungjun
    Park, Eunhyeok
    Kim, Jae-Joon
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17279 - 17288
  • [37] InstaSAM: Instance-Aware Segment Any Nuclei Model with Point Annotations
    Nam, Siwoo
    Namgung, Hyun
    Jeong, Jaehoon
    Luna, Miguel
    Kim, Soopil
    Chikontwe, Philip
    Park, Sang Hyun
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IV, 2024, 15004 : 232 - 242
  • [38] Learning instance-aware object detection using determinantal point processes
    Kim, Nuri
    Lee, Donghoon
    Oh, Songhwai
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 201
  • [39] Mitigating Forgetting in Online Continual Learning via Instance-Aware Parameterization
    Chen, Hung-Jen
    Cheng, An-Chieh
    Juan, Da-Cheng
    Wei, Wei
    Sun, Min
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [40] Instance-Aware Monocular 3D Semantic Scene Completion
    Xiao, Haihong
    Xu, Hongbin
    Kang, Wenxiong
    Li, Yuqiong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 6543 - 6554