Coarse Mask Guided Interactive Object Segmentation

被引:3
|
作者
Li, Jing [1 ,2 ]
Fan, Junsong [3 ,4 ]
Wang, Yuxi [3 ,4 ]
Yang, Yuran [5 ]
Zhang, Zhaoxiang [4 ,6 ,7 ,8 ]
机构
[1] Chinese Acad Sci CASIA, Inst Automat, Ctr Res Intelligent Percept & Comp CRIPAC, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci UCAS, Sch Artificial Intelligence, Beijing 100190, Peoples R China
[3] Chinese Acad Sci CASIA, Inst Automat, Ctr Res Intelligent Percept & Comp CRIPAC, Beijing 100190, Peoples R China
[4] HKISI CAS, Ctr Artificial Intelligence & Robot, Hong Kong, Peoples R China
[5] Tencent Maps, Beijing 100101, Peoples R China
[6] Chinese Acad Sci CASIA, Inst Automat, Beijing 100190, Peoples R China
[7] Univ Chinese Acad Sci UCAS, Sch Future Technol, Beijing 100049, Peoples R China
[8] State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Segmentation; interactive; transformer; annotation tool; RANDOM-WALKS; IMAGE; CUT;
D O I
10.1109/TIP.2023.3322564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interactive object segmentation aims to produce object masks with user interactions, such as clicks, bounding boxes, and scribbles. Click point is the most popular interactive cue for its efficiency, and related deep learning methods have attracted lots of interest in recent years. Most works encode click points as gaussian maps and concatenate them with images as the model's input. However, the spatial and semantic information of gaussian maps would be noised through multiple convolution layers and won't be fully exploited by top layers for mask prediction. To pass click information to top layers exactly and efficiently, we propose a coarse mask guided model (CMG) which predicts coarse masks with a coarse module to guide the object mask prediction. Specifically, the coarse module encodes user clicks as query features and enriches their semantic information with backbone features through transformer layers, coarse masks are generated based on the enriched query feature and fed into CMG's decoder. Benefiting from the efficiency of transformer, CMG's coarse module and decoder module are lightweight and computationally efficient, making the interaction process more smooth. Experiments on several segmentation benchmarks demonstrate the effectiveness of our method, and we get new state-of-the-art results compared with previous works.
引用
收藏
页码:5808 / 5822
页数:15
相关论文
共 50 条
  • [31] Complementary Coarse-to-Fine Matching for Video Object Segmentation
    Chen, Zhen
    Yang, Ming
    Zhang, Shiliang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [32] Coarse adaptive color image segmentation for visual object classification
    Pujol, A.
    Chen, L.
    PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 157 - 160
  • [33] User Interactive Object Extraction with Sequential Image Segmentation
    Saglam, Ali
    Baykan, Nurdan Akhan
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [34] An interactive authoring system for video object segmentation and annotation
    Luo, HT
    Eleftheriadis, A
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2002, 17 (07) : 559 - 572
  • [35] Interactive Object Segmentation With Inside-Outside Guidance
    Zhang, Shiyin
    Wei, Shikui
    Liew, Jun Hao
    Han, Kunyang
    Zhao, Yao
    Wei, Yunchao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8594 - 8605
  • [36] Interactive Object Segmentation System from a Video Sequence
    Bae, Guntae
    Kwak, Sooyeong
    Byun, Hyeran
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: INFORMATION AND INTERACTION, PT II, 2009, 5618 : 221 - 228
  • [37] Coherent Parametric Contours for Interactive Video Object Segmentation
    Lu, Yao
    Bai, Xue
    Shapiro, Linda
    Wang, Jue
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 642 - 650
  • [38] Siamese Network with Interactive Transformer for Video Object Segmentation
    Lan, Meng
    Zhang, Jing
    He, Fengxiang
    Zhang, Lefei
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1228 - 1236
  • [39] Multiple object tracking with segmentation and interactive multiple model
    Qi, Ke
    Xu, Wenhao
    Chen, Wenbin
    Tao, Xi
    Chen, Peijia
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 99
  • [40] A model-based interactive object segmentation procedure
    Zou, J
    WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 522 - 527