Bimodal Neural Style Transfer for Image Generation Based on Text Prompts

被引:0
|
作者
Gutierrez, Diego [1 ]
Mendoza, Marcelo [2 ]
机构
[1] Univ Tecn Federico Santa Maria, Dept Informat, Av Vicuna Mackenna 3939, Santiago, Chile
[2] Pontificia Univ Catolica Chile, Dept Comp Sci, Av Vicuna Mackenna 6840, Santiago, Chile
来源
关键词
Generative models; Creative AI; Image generation;
D O I
10.1007/978-3-031-34732-0_29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural networks have become one of the essential areas in Artificial Intelligence due to their extraordinary capacity to address problems in different domains. This ability led to the proposal of novel architectures and models to tackle challenging tasks such as neural style transfer. We propose a novel methodology for bimodal style transfer using text as input. We initially retrieve one image and a short descriptive text, which are mapped into a multimodal common latent space. Then, a new image is retrieved using an image retrieval engine. Finally, we use a generative model, which allows us to create artistic images by combining content and style. The proposed system can retrieve semantically similar images concerning a descriptive text (prompt), achieving great precision rates in image retrieval applied to the SemArt dataset. The transfer style neural model also preserves the image's high quality, combining style and content.
引用
收藏
页码:379 / 390
页数:12
相关论文
共 50 条
  • [21] Generative adversarial text-to-image generation with style image constraint
    Zekang Wang
    Li Liu
    Huaxiang Zhang
    Dongmei Liu
    Yu Song
    Multimedia Systems, 2023, 29 : 3291 - 3303
  • [22] An Edge Filter Based Approach of Neural Style Transfer to the Image Stylization
    Bagwari, Shubham
    Choudhary, Kanika
    Raikwar, Suresh
    Nijhawan, Rahul
    Kumar, Sunil
    Shah, Mohd Asif
    IEEE ACCESS, 2022, 10 : 104612 - 104621
  • [23] Cartoon-Style Image Rendering Transfer Based on Neural Networks
    Wang, Lei
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [24] Dermoscopic Image Classification with Neural Style Transfer
    Li, Yutong
    Zhu, Ruoqing
    Yeh, Mike
    Qu, Annie
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (04) : 1318 - 1331
  • [25] Increasing Image Memorability with Neural Style Transfer
    Siarohin, Aliaksandr
    Zen, Gloria
    Majtanovic, Cveta
    Alameda-Pineda, Xavier
    Ricci, Elisa
    Sebe, Nicu
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (02)
  • [26] RIATIG: Reliable and Imperceptible Adversarial Text-to-Image Generation with Natural Prompts
    Liu, Han
    Wu, Yuhao
    Zhai, Shixuan
    Yuan, Bo
    Zhang, Ning
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 20585 - 20594
  • [27] Bimodal Network Architectures for Automatic Generation of Image Annotation from Text
    Moradi, Mehdi
    Madani, Ali
    Gur, Yaniv
    Guo, Yufan
    Syeda-Mahmood, Tanveer
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT I, 2018, 11070 : 449 - 456
  • [28] Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
    Pan, Zhihong
    Zhou, Xin
    Tian, Hao
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4450 - 4460
  • [29] SEM-CS: SEMANTIC CLIPSTYLER FOR TEXT-BASED IMAGE STYLE TRANSFER
    Kamra, Chanda Grover
    Mastan, Indra Deep
    Gupta, Debayan
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 395 - 399
  • [30] TST-GAN: A Legal Document Generation Model Based on Text Style Transfer
    Li, Xiaolin
    Huang, Lei
    Zhou, Yifan
    Shao, Changcheng
    2021 4TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION ENGINEERING (RCAE 2021), 2021, : 90 - 93