Visual Storytelling with Question-Answer Plans

被引:0
|
作者
Liu, Danyang [1 ]
Lapata, Mirella [1 ]
Keller, Frank [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Inst Language Cognit & Computat, 10 Crichton St, Edinburgh EH8 9AB, Midlothian, Scotland
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023 | 2023年
基金
英国工程与自然科学研究理事会;
关键词
KNOWLEDGE; WRITE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual storytelling aims to generate compelling narratives from image sequences. Existing models often focus on enhancing the representation of the image sequence, e.g., with external knowledge sources or advanced graph structures. Despite recent progress, the stories are often repetitive, illogical, and lacking in detail. To mitigate these issues, we present a novel framework which integrates visual representations with pretrained language models and planning. Our model translates the image sequence into a visual prefix, a sequence of continuous embeddings which language models can interpret. It also leverages a sequence of question-answer pairs as a blueprint plan for selecting salient visual concepts and determining how they should be assembled into a narrative. Automatic and human evaluation on the VIST benchmark (Huang et al., 2016) demonstrates that blueprint-based models generate stories that are more coherent, interesting, and natural compared to competitive baselines and state-of-the-art systems.
引用
收藏
页码:5800 / 5813
页数:14
相关论文
共 50 条
  • [21] GENERATION OF MULTIPURPOSE INTELLECTUAL QUESTION-ANSWER SYSTEMS
    PREOBRAZHENSKIY, AB
    RYBINA, GV
    KHOROSHEVSKIY, VF
    ENGINEERING CYBERNETICS, 1979, 17 (06): : 115 - 122
  • [22] Temporal Preparation for Speaking in Question-Answer Sequences
    Magyari, Lilla
    De Ruiter, Jan P.
    Levinson, Stephen C.
    FRONTIERS IN PSYCHOLOGY, 2017, 8
  • [23] The question-answer adjacency pair in dementia discourse
    Varela Suarez, Ana
    INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2018, 28 (01) : 86 - 101
  • [24] Question-Answer Sequences in Survey-Interviews
    Wil Dijkstra
    Yfke Ongena
    Quality and Quantity, 2006, 40 : 983 - 1011
  • [25] THE FIXATION OF KNOWLEDGE AND THE QUESTION-ANSWER PROCESS OF INQUIRY
    Tiercelin, Claudine
    GRAZER PHILOSOPHISCHE STUDIEN, 2008, 77 (01) : 23 - 44
  • [26] Question-answer sequences in survey-interviews
    Dijkstra, Wil
    Ongena, Yfke
    QUALITY & QUANTITY, 2006, 40 (06) : 983 - 1011
  • [27] DGQAN: Dual Graph Question-Answer Attention Networks for Answer Selection
    Yang, Haitian
    Zhao, Xuan
    Wang, Yan
    Li, Min
    Chen, Wei
    Huang, Weiqing
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1230 - 1239
  • [28] GameOfThronesQA: Answer-Aware Question-Answer Pairs for TV Series
    Lahiri, Aritra Kumar
    Hu, Qinmin Vivian
    ADVANCES IN INFORMATION RETRIEVAL, PT II, 2022, 13186 : 180 - 189
  • [29] Medical Question-Answer Matching Base on Adversarial Training
    Fu, Jieqiong
    Sun, Yawei
    Liu, Jianyi
    Li, Jinbin
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2022, 45 (04): : 37 - 43
  • [30] An Interactive Question-Answer System with Dialogue for a Receptionist Avatar
    Wantroba, Ewerton J.
    Romero, Roseli A. F.
    2015 12TH LATIN AMERICAN ROBOTICS SYMPOSIUM AND 2015 3RD BRAZILIAN SYMPOSIUM ON ROBOTICS (LARS-SBR), 2015, : 360 - 365