Text-Guided Portrait Image Matting

被引：0

作者：

Xu Y. ^{[1
]}

Yao X. ^{[1
]}

Liu B. ^{[1
]}

Quan Y. ^{[1
]}

Ji H. ^{[2
]}

机构：

[1] School of Computer Science and Engineering, South China University of Technology, Guangzhou

[2] Department of Mathematics, National University of Singapore

来源：

IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 08期

关键词：

Annotations; Artificial intelligence; Artificial neural networks; Attention; Batch production systems; Cross-modal Learning; Data mining; Feature extraction; Image Matting; Text Gudiance; Training;

D O I：

10.1109/TAI.2024.3363120

中图分类号：

学科分类号：

摘要：

Image matting is a technique used to separate the foreground of an image from the background, which estimates an alpha matte that indicates pixel-wise degree of transparency. To precisely extract target objects and address the ambiguity of solutions in image matting, many existing approaches employ a trimap or background image provided by the user as additional input to guide the matting process. This paper introduces a novel matting paradigm termed text-guided image matting, utilizing a textual description of the foreground object as a guiding element. In contrast to trimap or background-based methods, text-guided matting offers a user-friendly interface, providing semantic clues for the objects of interest. Moreover, it facilitates batch processing across multiple frames featuring the same objects of interest. The proposed text-guided matting approach is implemented through a deep neural network comprising three-stage cross-modal feature fusion and two-step alpha matte prediction. Experimental results on portrait matting demonstrate the competitive performance of our text-guided approach compared to existing trimap-based and background-based methods. IEEE

引用

页码：1 / 13

页数：12

共 50 条

[1] Text-Guided Image Inpainting
Zhang, Zijian
Zhao, Zhou
Zhang, Zhu
Huai, Baoxing
Yuan, Jing
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4079 - 4087
[2] Text-Guided Neural Image Inpainting
Zhang, Lisai
Chen, Qingcai
Hu, Baotian
Jiang, Shuoran
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1302 - 1310
[3] A TEXT-GUIDED GRAPH STRUCTURE FOR IMAGE CAPTIONING
Wang, Depeng
Hu, Zhenzhen
Zhou, Yuanen
Liu, Xueliang
Wu, Le
Hong, Richang
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
[4] Bimodal text-guided image inpainting algorithm
Li H.
Chen J.
Yu P.
Li H.
Zhang Y.
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (10): : 2547 - 2557
[5] Text-Guided Customizable Image Synthesis and Manipulation
Zhang, Zhiqiang
Fu, Chen
Weng, Wei
Zhou, Jinjia
APPLIED SCIENCES-BASEL, 2022, 12 (20):
[6] Text-Guided Attention Model for Image Captioning
Mun, Jonghwan
Cho, Minsu
Han, Bohyung
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4233 - 4239
[7] Text-Guided Sketch-to-Photo Image Synthesis
Osahor, Uche
Nasrabadi, Nasser M.
IEEE ACCESS, 2022, 10 : 98278 - 98289
[8] A Text-Guided Generation and Refinement Model for Image Captioning
Wang, Depeng
Hu, Zhenzhen
Zhou, Yuanen
Hong, Richang
Wang, Meng
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2966 - 2977
[9] Text-guided image-to-sketch diffusion models☆
Ke, Aihua
Huang, Yujie
Cai, Bo
Yang, Jie
KNOWLEDGE-BASED SYSTEMS, 2024, 304
[10] Learning Adapters for Text-Guided Portrait Stylization with Pretrained Diffusion Models
Yang, Mintu
Hou, Xianxu
Li, Hao
Shen, Linlin
Fan, Lixin
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 247 - 258

← 1 2 3 4 5 →