Referring Image Matting

被引:7
|
作者
Li, Jizhizi [1 ]
Zhang, Jing [1 ]
Tao, Dacheng [1 ]
机构
[1] Univ Sydney, Sydney, Australia
关键词
D O I
10.1109/CVPR52729.2023.02150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Different from conventional image matting, which either requires user-defined scribbles/trimap to extract a specific foreground object or directly extracts all the foreground objects in the image indiscriminately, we introduce a new task named Referring Image Matting (RIM) in this paper, which aims to extract the meticulous alpha matte of the specific object that best matches the given natural language description, thus enabling a more natural and simpler instruction for image matting. First, we establish a large-scale challenging dataset RefMatte by designing a comprehensive image composition and expression generation engine to automatically produce high-quality images along with diverse text attributes based on public datasets. RefMatte consists of 230 object categories, 47,500 images, 118,749 expression-region entities, and 474,996 expressions. Additionally, we construct a real-world test set with 100 high-resolution natural images and manually annotate complex phrases to evaluate the out-of-domain generalization abilities of RIM methods. Furthermore, we present a novel baseline method CLIPMat for RIM, including a context-embedded prompt, a text-driven semantic pop-up, and a multi-level details extractor. Extensive experiments on RefMatte in both keyword and expression settings validate the superiority of CLIPMat over representative methods. We hope this work could provide novel insights into image matting and encourage more followup studies. The dataset, code and models are available at https://github.com/JizhiziLi/RIM.
引用
收藏
页码:22448 / 22457
页数:10
相关论文
共 50 条
  • [1] Deep Image Matting
    Xu, Ning
    Price, Brian
    Cohen, Scott
    Huang, Thomas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 311 - 320
  • [2] Disentangled Image Matting
    Cai, Shaofan
    Zhang, Xiaoshuai
    Fan, Haoqiang
    Huang, Haibin
    Liu, Jiangyu
    Liu, Jiaming
    Liu, Jiaying
    Wang, Jue
    Sun, Jian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8818 - 8827
  • [3] Semantic Image Matting
    Sun, Yanan
    Tang, Chi-Keung
    Tai, Yu-Wing
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11115 - 11124
  • [4] Unsupervised and reliable image matting based on modified spectral matting
    Hu, Wu-Chih
    Jhu, Jia-Jie
    Lin, Cheng-Pin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (04) : 665 - 676
  • [5] A Survey on Image Matting Techniques
    Boda, Jagruti
    Pandya, Dhatri
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 765 - 770
  • [6] Natural image and video matting
    Abhilash, R.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL IV, PROCEEDINGS, 2007, : 471 - 477
  • [7] Easy matting - A stroke based approach for continuous image matting
    Guan, Yu
    Chen, Wei
    Liang, Xiao
    Ding, Zi'ang
    Peng, Qunsheng
    COMPUTER GRAPHICS FORUM, 2006, 25 (03) : 567 - 576
  • [8] Image and Video Matting: A Survey
    Wang, Jue
    Cohen, Michael F.
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2007, 3 (02): : 97 - 180
  • [9] Image Matting with Transductive Inference
    Wang, Jue
    COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, MIRAGE 2011, 2011, 6930 : 239 - 250
  • [10] A Study on Image Matting Techniques
    Parihar, Anil Singh
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (IEEE - ICRAIE-2020), 2020,