Affective Image Filter: Reflecting Emotions from Text to Images

被引:0
|
作者
Weng, Shuchen [1 ,2 ,4 ]
Zhang, Peixuan [3 ]
Chang, Zheng [3 ]
Wang, Xinlong [4 ]
Li, Si [3 ]
Shi, Boxin [1 ,2 ]
机构
[1] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[2] Peking Univ, Sch Comp Sci, Natl Engn Res Ctr Visual Technol, Beijing, Peoples R China
[3] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
[4] Beijing Acad Artificial Intelligence, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1109/ICCV51070.2023.00992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding the emotions in text and presenting them visually is a very challenging problem that requires a deep understanding of natural language and high-quality image synthesis simultaneously. In this work, we propose Affective Image Filter (AIF), a novel model that is able to understand the visually-abstract emotions from the text and reflect them to visually-concrete images with appropriate colors and textures. We build our model based on the multi-modal transformer architecture, which unifies both images and texts into tokens and encodes the emotional prior knowledge. Various loss functions are proposed to understand complex emotions and produce appropriate visualization. In addition, we collect and contribute a new dataset with abundant aesthetic images and emotional texts for training and evaluating the AIF model. We carefully design four quantitative metrics and conduct a user study to comprehensively evaluate the performance, which demonstrates our AIF model outperforms state-of-the-art methods and could evoke specific emotional responses from human observers.
引用
收藏
页码:10776 / 10785
页数:10
相关论文
共 50 条
  • [1] Mining Social Emotions from Affective Text
    Bao, Shenghua
    Xu, Shengliang
    Zhang, Li
    Yan, Rong
    Su, Zhong
    Han, Dingyi
    Yu, Yong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (09) : 1658 - 1670
  • [2] The elephant in the room: reflecting on text-to-image generative AI and global health images
    Alenichev, Arsenii
    Kingori, Patricia
    Shaffer, Jonathan
    Grietens, Koen Peeters
    [J]. BMJ GLOBAL HEALTH, 2024, 9 (04):
  • [3] From Image to Text: Using Images in the Writing Process
    Andrzejczak, Nancy
    Trainin, Guy
    Poldberg, Monique
    [J]. INTERNATIONAL JOURNAL OF EDUCATION AND THE ARTS, 2005, 6 (12):
  • [4] From Text to Images: Weighting Schemes for Image Retrieval
    Tirilly, Pierre
    Claveau, Vincent
    Gros, Patrick
    [J]. JOURNAL OF MULTIMEDIA, 2015, 10 (01): : 1 - 21
  • [5] On the Complementarity of Images and Text for the Expression of Emotions in Social Media
    Khlyzova, Anna
    Silberer, Carina
    Klinger, Roman
    [J]. PROCEEDINGS OF THE 12TH WORKSHOP ON COMPUTATIONAL APPROACHES TO SUBJECTIVITY, SENTIMENT & SOCIAL MEDIA ANALYSIS, 2022, : 1 - 15
  • [6] Affective Music Recommendation System Reflecting the Mood of Input Image
    Sasaki, Shoto
    Hirai, Tatsunori
    Ohya, Hayato
    Morishima, Shigeo
    [J]. 2013 INTERNATIONAL CONFERENCE ON CULTURE AND COMPUTING (CULTURE AND COMPUTING 2013), 2013, : 153 - 154
  • [7] Gabor filter based text extraction from digital document images
    Qiao, Yu-Long
    Li, Meng
    Lu, Zhe-Ming
    Sun, Sheng-He
    [J]. IIH-MSP: 2006 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 2006, : 297 - +
  • [8] Image Enhancer-Text Extraction from Still Images
    Ladha, Uma
    Alshi, Ankita
    Shah, Dhara
    Sawant, Rupali
    [J]. INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ENGINEERING (ACSE 2014), 2014, : 216 - 220
  • [10] Text and image generation from intracranial electroencephalography using an embedding space for text and images
    Ikegawa, Yuya
    Fukuma, Ryohei
    Sugano, Hidenori
    Oshino, Satoru
    Tani, Naoki
    Tamura, Kentaro
    Iimura, Yasushi
    Suzuki, Hiroharu
    Yamamoto, Shota
    Fujita, Yuya
    Nishimoto, Shinji
    Kishima, Haruhiko
    Yanagisawa, Takufumi
    [J]. JOURNAL OF NEURAL ENGINEERING, 2024, 21 (03)