SOIT: Segmenting Objects with Instance-Aware Transformers

被引:0
|
作者
Yu, Xiaodong [1 ]
Shi, Dahu [1 ]
Wei, Xing [2 ]
Ren, Ye [1 ]
Ye, Tingqun [1 ]
Tan, Wenming [1 ]
机构
[1] Hikvis Res Inst, Hangzhou, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an end-to-end instance segmentation framework, termed SOIT, that Segments Objects with Instance-aware Transformers. Inspired by DETR, our method views instance segmentation as a direct set prediction problem and effectively removes the need for many hand-crafted components like Rot cropping, one-to-many label assignment, and non-maximum suppression (NMS). In SOIT, multiple queries are learned to directly reason a set of object embeddings of semantic category, bounding-box location, and pixel-wise mask in parallel under the global image context. The class and bounding-box can be easily embedded by a fixed-length vector. The pixel-wise mask, especially, is embedded by a group of parameters to construct a lightweight instance-aware transformer. Afterward, a full-resolution mask is produced by the instance-aware transformer without involving any RoF-based operation. Overall, SOIT introduces a simple single-stage instance segmentation framework that is both Rot- and NMS-free. Experimental results on the MS COCO dataset demonstrate that SOIT outperforms state-of-the-art instance segmentation approaches significantly. Moreover, the joint learning of multiple tasks in a unified query embedding can also substantially improve the detection performance. Code is available at https://github.com/yuxiaodongHRI/SOIT.
引用
收藏
页码:3188 / 3196
页数:9
相关论文
共 50 条
  • [1] Instance-aware Image Colorization
    Su, Jheng-Wei
    Chu, Hung-Kuo
    Huang, Jia-Bin
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 7965 - 7974
  • [2] Instance-aware image dehazing
    Chao, Qingqing
    Yan, Jinqiang
    Sun, Tianmeng
    Li, Silong
    Chi, Jieru
    Yang, Guowei
    Chen, Chenglizhao
    Yu, Teng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [3] Instance-Aware Scene Layout Forecasting
    Qiao, Xiaotian
    Zheng, Quanlong
    Cao, Ying
    Lau, Rynson W. H.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (02) : 504 - 516
  • [4] Instance-Aware Scene Layout Forecasting
    Xiaotian Qiao
    Quanlong Zheng
    Ying Cao
    Rynson W. H. Lau
    International Journal of Computer Vision, 2022, 130 : 504 - 516
  • [5] SOTR: Segmenting Objects with Transformers
    Guo, Ruohao
    Niu, Dantong
    Qu, Liao
    Li, Zhenbo
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7137 - 7146
  • [6] Indoor Instance-Aware Semantic Mapping Using Instance Segmentation
    Jiang, Yinpeng
    Ma, Xudong
    Fang, Fang
    Kang, Xuewen
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3549 - 3554
  • [7] iPose: Instance-Aware 6D Pose Estimation of Partly Occluded Objects
    Jafari, Omid Hosseini
    Mustikovela, Siva Karthik
    Pertsch, Karl
    Brachmann, Eric
    Rother, Carsten
    COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 477 - 492
  • [8] Instance-aware Detailed Action Labeling in Videos
    Yang, Hongtao
    He, Xuming
    Porikli, Fatih
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1623 - 1632
  • [9] Fully Convolutional Instance-aware Semantic Segmentation
    Li, Yi
    Qi, Haozhi
    Dai, Jifeng
    Ji, Xiangyang
    Wei, Yichen
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4438 - 4446
  • [10] InstaNAS: Instance-Aware Neural Architecture Search
    Cheng, An-Chieh
    Lin, Chieh Hubert
    Juan, Da-Cheng
    Wei, Wei
    Sun, Min
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3577 - 3584