Referring Image Matting

被引:7
|
作者
Li, Jizhizi [1 ]
Zhang, Jing [1 ]
Tao, Dacheng [1 ]
机构
[1] Univ Sydney, Sydney, Australia
关键词
D O I
10.1109/CVPR52729.2023.02150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Different from conventional image matting, which either requires user-defined scribbles/trimap to extract a specific foreground object or directly extracts all the foreground objects in the image indiscriminately, we introduce a new task named Referring Image Matting (RIM) in this paper, which aims to extract the meticulous alpha matte of the specific object that best matches the given natural language description, thus enabling a more natural and simpler instruction for image matting. First, we establish a large-scale challenging dataset RefMatte by designing a comprehensive image composition and expression generation engine to automatically produce high-quality images along with diverse text attributes based on public datasets. RefMatte consists of 230 object categories, 47,500 images, 118,749 expression-region entities, and 474,996 expressions. Additionally, we construct a real-world test set with 100 high-resolution natural images and manually annotate complex phrases to evaluate the out-of-domain generalization abilities of RIM methods. Furthermore, we present a novel baseline method CLIPMat for RIM, including a context-embedded prompt, a text-driven semantic pop-up, and a multi-level details extractor. Extensive experiments on RefMatte in both keyword and expression settings validate the superiority of CLIPMat over representative methods. We hope this work could provide novel insights into image matting and encourage more followup studies. The dataset, code and models are available at https://github.com/JizhiziLi/RIM.
引用
收藏
页码:22448 / 22457
页数:10
相关论文
共 50 条
  • [41] Automatic Image Matting Using Component-Hue-Difference-Based Spectral Matting
    Hu, Wu-Chih
    Hsu, Jung-Fu
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2012), PT II, 2012, 7197 : 148 - 157
  • [42] Deep Learning Methods in Image Matting: A Survey
    Huang, Lingtao
    Liu, Xipeng
    Wang, Xuelin
    Li, Jiangqi
    Tan, Benying
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [43] Deep Image Matting With Sparse User Interactions
    Wei, Tianyi
    Chen, Dongdong
    Zhou, Wenbo
    Liao, Jing
    Zhao, Hanqing
    Zhang, Weiming
    Hua, Gang
    Yu, Nenghai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 881 - 895
  • [44] Automatic image matting and fusing for portrait synthesis
    Zhike YI
    Wenfeng SONG
    Shuai LI
    Aimin HAO
    ScienceChina(InformationSciences), 2022, 65 (02) : 235 - 237
  • [45] Text-Guided Portrait Image Matting
    Xu Y.
    Yao X.
    Liu B.
    Quan Y.
    Ji H.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (08): : 1 - 13
  • [46] A Survey on Pre-Processing in Image Matting
    Gui-Lin Yao
    Journal of Computer Science and Technology, 2017, 32 : 122 - 138
  • [47] A Hierarchical Framework on Affinity Based Image Matting
    Yao G.-L.
    Zhao Z.-J.
    Su X.-D.
    Xin H.-T.
    Hu W.
    Qin X.-L.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (01): : 209 - 223
  • [48] Image matting in the perception granular deep learning
    Hu, Hong
    Pang, Liang
    Shi, Zhongzhi
    KNOWLEDGE-BASED SYSTEMS, 2016, 102 : 51 - 63
  • [49] Research on image matting technology based on image edge detection
    Zhang Rongxue
    Zhao Tingting
    Renna
    Wang Hongjiang
    2012 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2012, : 1199 - 1203
  • [50] Natural image matting based on surrogate model
    Liang, Yihui
    Gou, Hongshan
    Feng, Fujian
    Liu, Guisong
    Huang, Han
    APPLIED SOFT COMPUTING, 2023, 143