A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

被引:133
|
作者
Guan, Jian [1 ,3 ,4 ]
Huang, Fei [1 ,3 ,4 ]
Zhao, Zhihao [2 ]
Zhu, Xiaoyan [1 ,3 ,4 ]
Huang, Minlie [1 ,3 ,4 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Beihang Univ, Sch Software, Beijing, Peoples R China
[3] Inst Artificial Intelligence, State Key Lab Intelligent Technol & Syst, Hong Kong, Peoples R China
[4] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
10.1162/tacl_a_00302
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Story generation, namely, generating a reasonable story from a leading context, is an important but challenging task. In spite of the success in modeling fluency and local coherence, existing neural language generation models (e.g., GPT-2) still suffer from repetition, logic conflicts, and lack of long-range coherence in generated stories. We conjecture that this is because of the difficulty of associating relevant commonsense knowledge, understanding the causal relationships, and planning entities and events with proper temporal order. In this paper, we devise a knowledge-enhanced pretraining model for commonsense story generation.We propose to utilize commonsense knowledge from external knowledge bases to generate reasonable stories. To further capture the causal and temporal dependencies between the sentences in a reasonable story, we use multi-task learning, which combines a discriminative objective to distinguish true and fake stories during fine-tuning. Automatic and manual evaluation shows that our model can generate more reasonable stories than state-of-the-art baselines, particularly in terms of logic and global coherence.
引用
收藏
页码:93 / 108
页数:16
相关论文
共 50 条
  • [41] Knowledge-enhanced latent semantic indexing
    Guo, D
    Berry, MW
    Thompson, BB
    Bailin, S
    INFORMATION RETRIEVAL, 2003, 6 (02): : 225 - 250
  • [42] Improving the Applicability of Knowledge-Enhanced Dialogue Generation Systems by Using Heterogeneous Knowledge from Multiple Sources
    Wu, Sixing
    Wang, Minghui
    Li, Ying
    Zhang, Dawei
    Wu, Zhonghai
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1149 - 1157
  • [43] KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation
    Xing, Yiran
    Shi, Zai
    Meng, Zhao
    Lakemeyer, Gerhard
    Ma, Yunpu
    Wattenhofer, Roger
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 525 - 535
  • [44] Some thoughts on knowledge-enhanced machine learning
    Cozman, Fabio Gagliardi
    Munhoz, Hugo Neri
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 136 : 308 - 324
  • [45] Knowledge-enhanced graph convolutional network for recommendation
    Xianlun Tang
    Jingming Yang
    Deyi Xiong
    Yang Luo
    Huimin Wang
    Deguang Peng
    Multimedia Tools and Applications, 2022, 81 : 28899 - 28916
  • [46] Knowledge-enhanced graph convolutional network for recommendation
    Tang, Xianlun
    Yang, Jingming
    Xiong, Deyi
    Luo, Yang
    Wang, Huimin
    Peng, Deguang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 28899 - 28916
  • [47] Knowledge-Enhanced Retrieval: A Scheme for Question Answering
    Lin, Fake
    Cao, Weican
    Zhang, Wen
    Chen, Liyi
    Hong, Yuan
    Xu, Tong
    Tan, Chang
    CCKS 2021 - EVALUATION TRACK, 2022, 1553 : 102 - 113
  • [48] Knowledge-enhanced document embeddings for text classification
    Sinoara, Roberta A.
    Camacho-Collados, Jose
    Rossi, Rafael G.
    Navigli, Roberto
    Rezende, Solange O.
    KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 955 - 971
  • [49] KARGEN: Knowledge-Enhanced Automated Radiology Report Generation Using Large Language Models
    Li, Yingshu
    Wang, Zhanyu
    Liu, Yunyi
    Wang, Lei
    Liu, Lingqiao
    Zhou, Luping
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 382 - 392
  • [50] Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model
    Chen, Xiaolin
    Song, Xuemeng
    Jing, Liqiang
    Li, Shuo
    Hu, Linmei
    Nie, Liqiang
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (02)