A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

被引:133
|
作者
Guan, Jian [1 ,3 ,4 ]
Huang, Fei [1 ,3 ,4 ]
Zhao, Zhihao [2 ]
Zhu, Xiaoyan [1 ,3 ,4 ]
Huang, Minlie [1 ,3 ,4 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Beihang Univ, Sch Software, Beijing, Peoples R China
[3] Inst Artificial Intelligence, State Key Lab Intelligent Technol & Syst, Hong Kong, Peoples R China
[4] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
10.1162/tacl_a_00302
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Story generation, namely, generating a reasonable story from a leading context, is an important but challenging task. In spite of the success in modeling fluency and local coherence, existing neural language generation models (e.g., GPT-2) still suffer from repetition, logic conflicts, and lack of long-range coherence in generated stories. We conjecture that this is because of the difficulty of associating relevant commonsense knowledge, understanding the causal relationships, and planning entities and events with proper temporal order. In this paper, we devise a knowledge-enhanced pretraining model for commonsense story generation.We propose to utilize commonsense knowledge from external knowledge bases to generate reasonable stories. To further capture the causal and temporal dependencies between the sentences in a reasonable story, we use multi-task learning, which combines a discriminative objective to distinguish true and fake stories during fine-tuning. Automatic and manual evaluation shows that our model can generate more reasonable stories than state-of-the-art baselines, particularly in terms of logic and global coherence.
引用
收藏
页码:93 / 108
页数:16
相关论文
共 50 条
  • [1] Knowledge-Enhanced Visual-Language Pretraining for Computational Pathology
    Zhou, Xiao
    Zhang, Xiaoman
    Wu, Chaoyi
    Zhang, Ya
    Xie, Weidi
    Wang, Yanfeng
    COMPUTER VISION - ECCV 2024, PT LII, 2025, 15110 : 345 - 362
  • [2] A Survey of Knowledge-enhanced Text Generation
    Yu, Wenhao
    Zhu, Chenguang
    Li, Zaitang
    Hu, Zhiting
    Wang, Qingyun
    Ji, Heng
    Jiang, Meng
    ACM COMPUTING SURVEYS, 2022, 54 (11S)
  • [3] Knowledge-enhanced Prompt Learning for Open-domain Commonsense Reasoning
    Zhao, Xujiang
    Liu, Yanchi
    Cheng, Wei
    Oishi, Mika
    Osaki, Takao
    Matsuda, Katsushi
    Chen, Haifeng
    NEC Technical Journal, 2024, 17 (02): : 91 - 95
  • [4] Benchmarking Knowledge-Enhanced Commonsense Question Answering via Knowledge-to-Text Transformation
    Bian, Ning
    Han, Xianpei
    Chen, Bo
    Sun, Le
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12574 - 12582
  • [5] CEG: A joint model for causal commonsense events enhanced story ending generation
    Zhang, Yushi
    Yang, Yan
    Gu, Ming
    Gao, Feng
    Chen, Chengcai
    He, Liang
    PLOS ONE, 2023, 18 (05):
  • [6] Story Ending Generation with Incremental Encoding and Commonsense Knowledge
    Guan, Jian
    Wang, Yansen
    Huang, Minlie
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6473 - 6480
  • [7] Knowledge-Enhanced Evidence Retrieval for Counterargument Generation
    Jo, Yohan
    Yoo, Haneul
    Bak, JinYeong
    Oh, Alice
    Reed, Chris
    Hovy, Eduard
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3074 - 3094
  • [8] Retrieval Enhanced Model for Commonsense Generation
    Wang, Han
    Liu, Yang
    Zhu, Chenguang
    Shou, Linjun
    Gong, Ming
    Xu, Yichong
    Zeng, Michael
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3056 - 3062
  • [9] Multiple Knowledge-Enhanced Meteorological Social Briefing Generation
    Shi, Kaize
    Peng, Xueping
    Lu, Hao
    Zhu, Yifan
    Niu, Zhendong
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 2002 - 2013
  • [10] A Knowledge-Enhanced Framework for Imitative Transportation Trajectory Generation
    Zhu, Qingyan
    Chen, Yize
    Wang, Hao
    Zeng, Zhenyu
    Liu, Hao
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 823 - 832