A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

被引:133
|
作者
Guan, Jian [1 ,3 ,4 ]
Huang, Fei [1 ,3 ,4 ]
Zhao, Zhihao [2 ]
Zhu, Xiaoyan [1 ,3 ,4 ]
Huang, Minlie [1 ,3 ,4 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Beihang Univ, Sch Software, Beijing, Peoples R China
[3] Inst Artificial Intelligence, State Key Lab Intelligent Technol & Syst, Hong Kong, Peoples R China
[4] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
10.1162/tacl_a_00302
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Story generation, namely, generating a reasonable story from a leading context, is an important but challenging task. In spite of the success in modeling fluency and local coherence, existing neural language generation models (e.g., GPT-2) still suffer from repetition, logic conflicts, and lack of long-range coherence in generated stories. We conjecture that this is because of the difficulty of associating relevant commonsense knowledge, understanding the causal relationships, and planning entities and events with proper temporal order. In this paper, we devise a knowledge-enhanced pretraining model for commonsense story generation.We propose to utilize commonsense knowledge from external knowledge bases to generate reasonable stories. To further capture the causal and temporal dependencies between the sentences in a reasonable story, we use multi-task learning, which combines a discriminative objective to distinguish true and fake stories during fine-tuning. Automatic and manual evaluation shows that our model can generate more reasonable stories than state-of-the-art baselines, particularly in terms of logic and global coherence.
引用
收藏
页码:93 / 108
页数:16
相关论文
共 50 条
  • [21] A plug-and-play knowledge-enhanced module for medical reports generation
    Han, Qinyu
    Yang, Zhihao
    Lin, Hongfei
    Qin, Tian
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [22] On Knowledge-Enhanced Document Clustering
    Rege, Manjeet
    Koruthu, Josan
    Bailey, Reynold
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2012, 2 (03) : 72 - 82
  • [23] On engineering design generation with XML-based knowledge-enhanced grammars
    Rudolph, S
    Noser, H
    FROM KNOWLEDGE INTENSIVE CAD TO KNOWLEDGE INTENSIVE ENGINEERING, 2002, 79 : 213 - 225
  • [24] Structured Self-Supervised Pretraining for Commonsense Knowledge Graph Completion
    Huang, Jiayuan
    Du, Yangkai
    Tao, Shuting
    Xu, Kun
    Xie, Pengtao
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 1268 - 1284
  • [25] LEMON: A Knowledge-Enhanced, Type-Constrained, and Grammar-Guided Model for Question Generation Over Knowledge Graphs
    Bi, Sheng
    Miao, Zeyi
    Min, Qizhi
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2025, 18 : 256 - 272
  • [26] Knowledge-Enhanced Causal Reinforcement Learning Model for Interactive Recommendation
    Nie, Weizhi
    Wen, Xin
    Liu, Jing
    Chen, Jiawei
    Wu, Jiancan
    Jin, Guoqing
    Lu, Jing
    Liu, An-An
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1129 - 1142
  • [27] Commonsense Knowledge Graph Completion Via Contrastive Pretraining and Node Clustering
    Wu, Siwei
    Shen, Xiangqing
    Xia, Rui
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 13977 - 13989
  • [28] Explicit and implicit knowledge-enhanced model for event causality identification
    Chen, Siyuan
    Mao, Kezhi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [29] Lorentz equivariant model for knowledge-enhanced hyperbolic collaborative filtering
    Huang, Bosong
    Yu, Weihao
    Xie, Ruzhong
    Luo, Junming
    Xiao, Jing
    Huang, Jin
    KNOWLEDGE-BASED SYSTEMS, 2024, 291
  • [30] A simple and efficient dialogue generation model incorporating commonsense knowledge
    Son, Geonyeong
    Kim, Misuk
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249