Visual Storytelling with Question-Answer Plans

被引:0
|
作者
Liu, Danyang [1 ]
Lapata, Mirella [1 ]
Keller, Frank [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Inst Language Cognit & Computat, 10 Crichton St, Edinburgh EH8 9AB, Midlothian, Scotland
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023 | 2023年
基金
英国工程与自然科学研究理事会;
关键词
KNOWLEDGE; WRITE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual storytelling aims to generate compelling narratives from image sequences. Existing models often focus on enhancing the representation of the image sequence, e.g., with external knowledge sources or advanced graph structures. Despite recent progress, the stories are often repetitive, illogical, and lacking in detail. To mitigate these issues, we present a novel framework which integrates visual representations with pretrained language models and planning. Our model translates the image sequence into a visual prefix, a sequence of continuous embeddings which language models can interpret. It also leverages a sequence of question-answer pairs as a blueprint plan for selecting salient visual concepts and determining how they should be assembled into a narrative. Automatic and human evaluation on the VIST benchmark (Huang et al., 2016) demonstrates that blueprint-based models generate stories that are more coherent, interesting, and natural compared to competitive baselines and state-of-the-art systems.
引用
收藏
页码:5800 / 5813
页数:14
相关论文
共 50 条
  • [41] Question-Answer Selection in User to User Marketplace Conversations
    Kumar, Girish
    Henderson, Matthew
    Chan, Shannon
    Nguyen, Hoang
    Ngoo, Lucas
    9TH INTERNATIONAL WORKSHOP ON SPOKEN DIALOGUE SYSTEM TECHNOLOGY, 2019, 579 : 397 - 403
  • [42] A proposal for a web information extraction and question-answer system
    Saias, Jose
    Quaresma, Paulo
    ADVANCES IN INTELLIGENT WEB MASTERING, 2007, 43 : 316 - +
  • [43] Enriching Domain Ontologies using Question-Answer Datasets
    Subhashree, S.
    Kumar, P. Sreenivasa
    PROCEEDINGS OF THE ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA (CODS-COMAD'18), 2018, : 329 - 332
  • [44] Open Information Extraction from Question-Answer Pairs
    Bhutani, Nikita
    Suhara, Yoshihiko
    Tan, Wang-Chiew
    Halevy, Alon
    Jagadish, H. V.
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2294 - 2305
  • [45] Question-Answer Expert System for Ship Collision Avoidance
    Sosnin, Peter
    PROCEEDINGS ELMAR-2009, 2009, : 185 - 188
  • [46] STRUCTURALLY MEANINGFUL OF QUESTION-ANSWER UNITY IN THE GENRE OF PREACHING
    Itskovich, T. V.
    JOURNAL OF MINING INSTITUTE, 2005, 160 (01): : 30 - 32
  • [47] Question-answer maps as an epistemological tool in teacher education
    Florensa, I.
    Bosch, M.
    Gascon, J.
    JOURNAL OF MATHEMATICS TEACHER EDUCATION, 2021, 24 (02) : 203 - 225
  • [48] AI-Driven Question-answer Service Matching
    Yang, Mengqin
    Zhong, Jiang
    Hu, Peiyun
    Cui, Lei
    2017 SECOND INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE), 2017, : 141 - 145
  • [49] Automatic question-answer pairs generation and question similarity mechanism in question answering system
    Shivani G. Aithal
    Abishek B. Rao
    Sanjay Singh
    Applied Intelligence, 2021, 51 : 8484 - 8497
  • [50] Automatic question-answer pairs generation and question similarity mechanism in question answering system
    Aithal, Shivani G.
    Rao, Abishek B.
    Singh, Sanjay
    APPLIED INTELLIGENCE, 2021, 51 (11) : 8484 - 8497