Few-Shot Text Style Transfer via Deep Feature Similarity

被引:22
|
作者
Zhu, Anna [1 ]
Lu, Xiongbo [1 ]
Bai, Xiang [2 ]
Uchida, Seiichi [3 ]
Iwana, Brian Kenji [3 ]
Xiong, Shengwu [1 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 430070, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[3] Kyushu Univ, Sch Informat Sci & Elect Engn, Fukuoka 8190395, Japan
基金
中国国家自然科学基金;
关键词
Feature extraction; Rendering (computer graphics); Gallium nitride; Image color analysis; Generative adversarial networks; Task analysis; Painting; Few-shot; deep similarity; character content; text style transfer; discriminative network;
D O I
10.1109/TIP.2020.2995062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating text to have a consistent style with only a few observed highly-stylized text samples is a difficult task for image processing. The text style involving the typography, i.e., font, stroke, color, decoration, effects, etc., should be considered for transfer. In this paper, we propose a novel approach to stylize target text by decoding weighted deep features from only a few referenced samples. The deep features, including content and style features of each referenced text, are extracted from a Convolutional Neural Network (CNN) that is optimized for character recognition. Then, we calculate the similarity scores of the target text and the referenced samples by measuring the distance along the corresponding channels from the content features of the CNN when considering only the content, and assign them as the weights for aggregating the deep features. To enforce the stylized text to be realistic, a discriminative network with adversarial loss is employed. We demonstrate the effectiveness of our network by conducting experiments on three different datasets which have various styles, fonts, languages, etc. Additionally, the coefficients for character style transfer, including the character content, the effect of similarity matrix, the number of referenced characters, the similarity between characters, and performance evaluation by a new protocol are analyzed for better understanding our proposed framework.
引用
收藏
页码:6932 / 6946
页数:15
相关论文
共 50 条
  • [21] Few-Shot Object Detection via Knowledge Transfer
    Kim, Geonuk
    Jung, Hong-Gyu
    Lee, Seong-Whan
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3564 - 3569
  • [22] Master: Meta Style Transformer for Controllable Zero-Shot and Few-Shot Artistic Style Transfer
    Tang, Hao
    Liu, Songhua
    Lin, Tianwei
    Huang, Shaoli
    Li, Fu
    He, Dongliang
    Wang, Xinchao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18329 - 18338
  • [23] Feature hallucination via Maximum A Posteriori for few-shot learning
    Wu, Jiaying
    Dong, Ning
    Liu, Fan
    Yang, Sai
    Hu, Jinglu
    KNOWLEDGE-BASED SYSTEMS, 2021, 225
  • [24] FRDet: Few-shot object detection via feature reconstruction
    Chen, Zhihao
    Mao, Yingchi
    Qian, Yong
    Pan, Zhenxiang
    Xu, Shufang
    IET IMAGE PROCESSING, 2023, 17 (12) : 3599 - 3615
  • [25] Boosting Few-Shot Learning via Attentive Feature Regularization
    Zhu, Xingyu
    Wang, Shuo
    Lu, Jinda
    Hao, Yanbin
    Liu, Haifeng
    He, Xiangnan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7793 - 7801
  • [26] Few-Shot Object Detection via Variational Feature Aggregation
    Han, Jiaming
    Ren, Yuqiang
    Ding, Jian
    Yan, Ke
    Xia, Gui-Song
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 755 - 763
  • [27] Few-Shot Learning via Feature Hallucination with Variational Inference
    Luo, Qinxuan
    Wang, Lingfeng
    Lv, Jingguo
    Xiang, Shiming
    Pan, Chunhong
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3962 - 3971
  • [28] Few-shot short-text classification with language representations and centroid similarity
    Liu, Wenfu
    Pang, Jianmin
    Li, Nan
    Yue, Feng
    Liu, Guangming
    APPLIED INTELLIGENCE, 2023, 53 (07) : 8061 - 8072
  • [29] Few-shot short-text classification with language representations and centroid similarity
    Wenfu Liu
    Jianmin Pang
    Nan Li
    Feng Yue
    Guangming Liu
    Applied Intelligence, 2023, 53 : 8061 - 8072
  • [30] Multi-Content GAN for Few-Shot Font Style Transfer
    Azadil, Samaneh
    Fisher, Matthew
    Kim, Vladimir
    Wang, Zhaowen
    Shechtman, Eli
    Darrell, Trevor
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7564 - 7573