SOIT: Segmenting Objects with Instance-Aware Transformers

被引:0
|
作者
Yu, Xiaodong [1 ]
Shi, Dahu [1 ]
Wei, Xing [2 ]
Ren, Ye [1 ]
Ye, Tingqun [1 ]
Tan, Wenming [1 ]
机构
[1] Hikvis Res Inst, Hangzhou, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an end-to-end instance segmentation framework, termed SOIT, that Segments Objects with Instance-aware Transformers. Inspired by DETR, our method views instance segmentation as a direct set prediction problem and effectively removes the need for many hand-crafted components like Rot cropping, one-to-many label assignment, and non-maximum suppression (NMS). In SOIT, multiple queries are learned to directly reason a set of object embeddings of semantic category, bounding-box location, and pixel-wise mask in parallel under the global image context. The class and bounding-box can be easily embedded by a fixed-length vector. The pixel-wise mask, especially, is embedded by a group of parameters to construct a lightweight instance-aware transformer. Afterward, a full-resolution mask is produced by the instance-aware transformer without involving any RoF-based operation. Overall, SOIT introduces a simple single-stage instance segmentation framework that is both Rot- and NMS-free. Experimental results on the MS COCO dataset demonstrate that SOIT outperforms state-of-the-art instance segmentation approaches significantly. Moreover, the joint learning of multiple tasks in a unified query embedding can also substantially improve the detection performance. Code is available at https://github.com/yuxiaodongHRI/SOIT.
引用
收藏
页码:3188 / 3196
页数:9
相关论文
共 50 条
  • [41] Instance-Aware Deep Graph Learning for Multi-Label Classification
    Wang, Yun
    Zhang, Tong
    Zhou, Chuanwei
    Cui, Zhen
    Yang, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 90 - 99
  • [42] Instance-Aware Diffusion Model for Gland Segmentation in Colon Histology Images
    Sun, Mengxue
    Huang, Wenhui
    Zheng, Yuanjie
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 662 - 672
  • [43] SEMANTIC AND INSTANCE-AWARE PIXEL-ADAPTIVE CONVOLUTION FOR PANOPTIC SEGMENTATION
    Song, Sumin
    Sagong, Min-Cheol
    Jung, Seung-Won
    Ko, Sung-Jea
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 16 - 20
  • [44] Instance-Aware Distillation for Efficient Object Detection in Remote Sensing Images
    Li, Cong
    Cheng, Gong
    Wang, Guangxing
    Zhou, Peicheng
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [45] Real-Time Instance-Aware Segmentation and Semantic Mapping on Edge Devices
    Lu, Junjie
    Tian, Bailing
    Shen, Hongming
    Zhang, Xuewei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [46] Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency
    Lee, Seokju
    Im, Sunghoon
    Lin, Stephen
    Kweon, In So
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1863 - 1872
  • [47] Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered Scenes
    Yu, Zhiyuan
    Qin, Zheng
    Zheng, Lintao
    Xu, Kai
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 19605 - 19614
  • [48] Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery
    Grinvald, Margarita
    Furrer, Fadri
    Novkovic, Tonci
    Chung, Jen Jen
    Cadena, Cesar
    Siegwart, Roland
    Nieto, Juan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) : 3037 - 3044
  • [49] Instance-aware diversity feature generation for unsupervised person re-identification
    Zhang, Xiaowei
    Dou, Xiao
    Zhao, Xinpeng
    Li, Guocong
    Wang, Zekang
    DISPLAYS, 2024, 83
  • [50] Disentangled Face Attribute Editing via Instance-Aware Latent Space Search
    Han, Yuxuan
    Yang, Jiaolong
    Fu, Ying
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 715 - 721