A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

被引:133
|
作者
Guan, Jian [1 ,3 ,4 ]
Huang, Fei [1 ,3 ,4 ]
Zhao, Zhihao [2 ]
Zhu, Xiaoyan [1 ,3 ,4 ]
Huang, Minlie [1 ,3 ,4 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Beihang Univ, Sch Software, Beijing, Peoples R China
[3] Inst Artificial Intelligence, State Key Lab Intelligent Technol & Syst, Hong Kong, Peoples R China
[4] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
10.1162/tacl_a_00302
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Story generation, namely, generating a reasonable story from a leading context, is an important but challenging task. In spite of the success in modeling fluency and local coherence, existing neural language generation models (e.g., GPT-2) still suffer from repetition, logic conflicts, and lack of long-range coherence in generated stories. We conjecture that this is because of the difficulty of associating relevant commonsense knowledge, understanding the causal relationships, and planning entities and events with proper temporal order. In this paper, we devise a knowledge-enhanced pretraining model for commonsense story generation.We propose to utilize commonsense knowledge from external knowledge bases to generate reasonable stories. To further capture the causal and temporal dependencies between the sentences in a reasonable story, we use multi-task learning, which combines a discriminative objective to distinguish true and fake stories during fine-tuning. Automatic and manual evaluation shows that our model can generate more reasonable stories than state-of-the-art baselines, particularly in terms of logic and global coherence.
引用
收藏
页码:93 / 108
页数:16
相关论文
共 50 条
  • [31] Story Completion with Explicit Modeling of Commonsense Knowledge
    Zhang, Mingda
    Ye, Keren
    Hwa, Rebecca
    Kovashka, Adriana
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1543 - 1546
  • [32] Knowledge-Enhanced Latent Semantic Indexing
    David Guo
    Michael W. Berry
    Bryan B. Thompson
    Sidney Bailin
    Information Retrieval, 2003, 6 : 225 - 250
  • [33] Medication Recommendation Based on a Knowledge-enhanced Pre-training Model
    Wang, Mengzhen
    Chen, Jianhui
    Lin, Shaofu
    PROCEEDINGS OF 2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS AND SPECIAL SESSIONS: (WI-IAT WORKSHOP/SPECIAL SESSION 2021), 2021, : 290 - 294
  • [34] KEPLET: Knowledge-Enhanced Pretrained Language Model with Topic Entity Awareness
    Li, Yichuan
    Han, Jialong
    Lee, Kyumin
    Ma, Chengyuan
    Yao, Benjamin
    Liu, Derek
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 6864 - 6876
  • [35] Incorporating Structured Commonsense Knowledge in Story Completion
    Chen, Jiaao
    Chen, Jianshu
    Yu, Zhou
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6244 - 6251
  • [36] EHR-KnowGen: Knowledge-enhanced multimodal learning for disease diagnosis generation
    Niu, Shuai
    Ma, Jing
    Bai, Liang
    Wang, Zhihua
    Guo, Li
    Yang, Xian
    INFORMATION FUSION, 2024, 102
  • [37] Ontological issues for knowledge-enhanced search
    McGuinness, DL
    FORMAL ONTOLOGY IN INFORMATION SYSTEMS, 1998, 46 : 302 - 316
  • [38] Knowledge-Enhanced Scene Graph Generation with Multimodal Relation Alignment (Student Abstract)
    Fu, Ze
    Feng, Junhao
    Zheng, Changmeng
    Cai, Yi
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12947 - 12948
  • [39] Knowledge-Enhanced Learning for KG Embedding
    Zhang, Haodi
    Chen, Zhao
    Nie, Jinyin
    Jiang, Di
    Fan, Lixin
    Wu, Kaishun
    2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 843 - 850
  • [40] A survey on knowledge-enhanced multimodal learning
    Lymperaiou, Maria
    Stamou, Giorgos
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (10)