SOIT: Segmenting Objects with Instance-Aware Transformers

被引:0
|
作者
Yu, Xiaodong [1 ]
Shi, Dahu [1 ]
Wei, Xing [2 ]
Ren, Ye [1 ]
Ye, Tingqun [1 ]
Tan, Wenming [1 ]
机构
[1] Hikvis Res Inst, Hangzhou, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an end-to-end instance segmentation framework, termed SOIT, that Segments Objects with Instance-aware Transformers. Inspired by DETR, our method views instance segmentation as a direct set prediction problem and effectively removes the need for many hand-crafted components like Rot cropping, one-to-many label assignment, and non-maximum suppression (NMS). In SOIT, multiple queries are learned to directly reason a set of object embeddings of semantic category, bounding-box location, and pixel-wise mask in parallel under the global image context. The class and bounding-box can be easily embedded by a fixed-length vector. The pixel-wise mask, especially, is embedded by a group of parameters to construct a lightweight instance-aware transformer. Afterward, a full-resolution mask is produced by the instance-aware transformer without involving any RoF-based operation. Overall, SOIT introduces a simple single-stage instance segmentation framework that is both Rot- and NMS-free. Experimental results on the MS COCO dataset demonstrate that SOIT outperforms state-of-the-art instance segmentation approaches significantly. Moreover, the joint learning of multiple tasks in a unified query embedding can also substantially improve the detection performance. Code is available at https://github.com/yuxiaodongHRI/SOIT.
引用
收藏
页码:3188 / 3196
页数:9
相关论文
共 50 条
  • [21] DISTILLING DETR-LIKE DETECTORS WITH INSTANCE-AWARE FEATURE
    Wang, Honglie
    Xu, Jian
    Sun, Shouqian
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1186 - 1190
  • [22] Instance-Aware Domain Generalization for Face Anti-Spoofing
    Zhou, Qianyu
    Zhang, Ke-Yue
    Yao, Taiping
    Lu, Xuequan
    Yi, Ran
    Ding, Shouhong
    Ma, Lizhuang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 20453 - 20463
  • [23] MARS: An Instance-Aware, Modular and Realistic Simulator for Autonomous Driving
    Wu, Zirui
    Liu, Tianyu
    Luo, Liyi
    Zhong, Zhide
    Chen, Jianteng
    Xiao, Hongmin
    Hou, Chao
    Lou, Haozhe
    Chen, Yuantao
    Yang, Runyi
    Huang, Yuxin
    Ye, Xiaoyu
    Yan, Zike
    Shi, Yongliang
    Liao, Yiyi
    Zhao, Hao
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 3 - 15
  • [24] Instance-aware Contrastive Learning for Occluded Human Mesh Reconstruction
    Gwon, Mi-Gyeong
    Um, Gi-Mun
    Cheong, Won-Sik
    Kim, Wonjun
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10553 - 10562
  • [25] InsMOS: Instance-Aware Moving Object Segmentation in LiDAR Data
    Wang, Neng
    Shi, Chenghao
    Guo, Ruibin
    Lu, Huimin
    Zheng, Zhiqiang
    Chen, Xieyuanli
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7598 - 7605
  • [26] Instance-Aware Predictive Navigation in Multi-Agent Environments
    Cao, Jinkun
    Wang, Xin
    Darrell, Trevor
    Yu, Fisher
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 5096 - 5102
  • [27] InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
    Kim, Soohyun
    Baek, Jongbeom
    Park, Jihye
    Kim, Gyeongnyeon
    Kim, Seungryong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18300 - 18310
  • [28] Instance-Aware Hashing for Multi-Label Image Retrieval
    Lai, Hanjiang
    Yan, Pan
    Shu, Xiangbo
    Wei, Yunchao
    Yan, Shuicheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2469 - 2479
  • [29] Progressive Instance-Aware Feature Learning for Compositional Action Recognition
    Yan, Rui
    Xie, Lingxi
    Shu, Xiangbo
    Zhang, Liyan
    Tang, Jinhui
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 10317 - 10330
  • [30] Deep Correlation Filter Tracking With Shepherded Instance-Aware Proposals
    Liang, Yanjie
    Wu, Qiangqiang
    Liu, Yi
    Yan, Yan
    Wang, Hanzi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11408 - 11421