Emotion Reinforced Visual Storytelling

被引:17
|
作者
Li, Nanxing [1 ,2 ]
Liu, Bei [3 ]
Han, Zhizhong [4 ]
Liu, Yu-Shen [1 ,2 ]
Fu, Jianlong [3 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
[4] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Storytelling; Multi-Modal; Emotion; Reinforcement Learning;
D O I
10.1145/3323873.3325050
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Automatic story generation from a sequence of images, i.e., visual storytelling, has attracted extensive attention. The challenges mainly drive from modeling rich visually-inspired human emotions, which results in generating diverse yet realistic stories even from the same sequence of images. Existing works usually adopt sequence-based generative adversarial networks (GAN) by encoding deterministic image content (e.g., concept, attribute), while neglecting probabilistic inference from an image over emotion space. In this paper, we take one step further to create human-level stories by modeling image content with emotions, and generating textual paragraph via emotion reinforced adversarial learning. Firstly, we introduce the concept of emotion engaged in visual storytelling. The emotion feature is a representation of the emotional content of the generated story, which enables our model to capture human emotion. Secondly, stories are generated by recurrent neural network, and further optimized by emotion reinforced adversarial learning with three critics, in which visual relevance, language style, and emotion consistency can be ensured. Our model is able to generate stories based on not only emotions generated by our novel emotion generator, but also customized emotions. The introduction of emotion brings more variety and realistic to visual storytelling. We evaluate the proposed model on the largest visual storytelling dataset (VIST). The superior performance to state-of-the-art methods are shown with extensive experiments.
引用
收藏
页码:297 / 305
页数:9
相关论文
共 50 条
  • [1] Emotion Aware Reinforcement Network for Visual Storytelling
    Li, Xin
    Cai, Hanqing
    Jiang, Tianling
    Liu, Chunping
    Ji, Yi
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 26 - 37
  • [2] Emotion and storytelling
    Juliá, MP
    [J]. ARBOR-CIENCIA PENSAMIENTO Y CULTURA, 2004, 177 (697) : 125 - 156
  • [3] Mapmaking as visual storytelling: the movement and emotion of managing sex work in the urban landscape
    Jordeno, Sara
    Horning, Amber
    [J]. CRIME LAW AND SOCIAL CHANGE, 2024, 81 (05) : 537 - 558
  • [4] Visual Storytelling by Novelette
    Addone, Agnese
    De Donato, Renato
    Palmieri, Giuseppina
    Pellegrino, Maria Angela
    Petta, Andrea
    Scarano, Vittorio
    Serra, Luigi
    [J]. 2020 24TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV 2020), 2020, : 723 - 728
  • [5] Emotion and narrative: Perspectives in autobiographical storytelling
    Randall, William
    [J]. BRITISH JOURNAL OF PSYCHOLOGY, 2020, 111 (01) : 152 - 154
  • [6] Changing emotion: The use of therapeutic storytelling
    Parker, TS
    Wampler, KS
    [J]. JOURNAL OF MARITAL AND FAMILY THERAPY, 2006, 32 (02) : 155 - 166
  • [7] Emotion and Narrative: Perspectives in Autobiographical Storytelling
    Makela, Petra
    [J]. EMOTIONS AND SOCIETY, 2020, 2 (01): : 109 - 111
  • [8] Visual Storytelling: Inspiring a New Visual Language
    Riccomini, Donald R.
    [J]. TECHNICAL COMMUNICATION, 2012, 59 (04) : 341 - 341
  • [9] Visual Storytelling of Development Sessions
    Minelli, Roberto
    Baracchi, Lorenzo
    Mocci, Andrea
    Lanza, Michele
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2014, : 416 - 420
  • [10] VISUAL STORYTELLING IN STREET PHOTOGRAPHY
    Isik, Atila
    [J]. ANADOLU UNIVERSITESI SANAT & TASARIM DERGISI-ANADOLU UNIVERSITY JOURNAL OF ART & DESIGN, 2023, 13 (02): : 511 - 525