A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

被引：133

作者：

Guan, Jian ^{[1
,3
,4
]}

Huang, Fei ^{[1
,3
,4
]}

Zhao, Zhihao ^{[2
]}

Zhu, Xiaoyan ^{[1
,3
,4
]}

Huang, Minlie ^{[1
,3
,4
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China

[2] Beihang Univ, Sch Software, Beijing, Peoples R China

[3] Inst Artificial Intelligence, State Key Lab Intelligent Technol & Syst, Hong Kong, Peoples R China

[4] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China

来源：

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS | 2020年 / 8卷

基金：

国家重点研发计划; 美国国家科学基金会;

关键词：

D O I：

10.1162/tacl_a_00302

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Story generation, namely, generating a reasonable story from a leading context, is an important but challenging task. In spite of the success in modeling fluency and local coherence, existing neural language generation models (e.g., GPT-2) still suffer from repetition, logic conflicts, and lack of long-range coherence in generated stories. We conjecture that this is because of the difficulty of associating relevant commonsense knowledge, understanding the causal relationships, and planning entities and events with proper temporal order. In this paper, we devise a knowledge-enhanced pretraining model for commonsense story generation.We propose to utilize commonsense knowledge from external knowledge bases to generate reasonable stories. To further capture the causal and temporal dependencies between the sentences in a reasonable story, we use multi-task learning, which combines a discriminative objective to distinguish true and fake stories during fine-tuning. Automatic and manual evaluation shows that our model can generate more reasonable stories than state-of-the-art baselines, particularly in terms of logic and global coherence.

引用

页码：93 / 108

页数：16

共 50 条

[41] Knowledge-enhanced latent semantic indexing
Guo, D
Berry, MW
Thompson, BB
Bailin, S
INFORMATION RETRIEVAL, 2003, 6 (02): : 225 - 250
[42] Improving the Applicability of Knowledge-Enhanced Dialogue Generation Systems by Using Heterogeneous Knowledge from Multiple Sources
Wu, Sixing
Wang, Minghui
Li, Ying
Zhang, Dawei
Wu, Zhonghai
WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1149 - 1157
[43] KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation
Xing, Yiran
Shi, Zai
Meng, Zhao
Lakemeyer, Gerhard
Ma, Yunpu
Wattenhofer, Roger
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 525 - 535
[44] Some thoughts on knowledge-enhanced machine learning
Cozman, Fabio Gagliardi
Munhoz, Hugo Neri
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 136 : 308 - 324
[45] Knowledge-enhanced graph convolutional network for recommendation
Xianlun Tang
Jingming Yang
Deyi Xiong
Yang Luo
Huimin Wang
Deguang Peng
Multimedia Tools and Applications, 2022, 81 : 28899 - 28916
[46] Knowledge-enhanced graph convolutional network for recommendation
Tang, Xianlun
Yang, Jingming
Xiong, Deyi
Luo, Yang
Wang, Huimin
Peng, Deguang
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 28899 - 28916
[47] Knowledge-Enhanced Retrieval: A Scheme for Question Answering
Lin, Fake
Cao, Weican
Zhang, Wen
Chen, Liyi
Hong, Yuan
Xu, Tong
Tan, Chang
CCKS 2021 - EVALUATION TRACK, 2022, 1553 : 102 - 113
[48] Knowledge-enhanced document embeddings for text classification
Sinoara, Roberta A.
Camacho-Collados, Jose
Rossi, Rafael G.
Navigli, Roberto
Rezende, Solange O.
KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 955 - 971
[49] KARGEN: Knowledge-Enhanced Automated Radiology Report Generation Using Large Language Models
Li, Yingshu
Wang, Zhanyu
Liu, Yunyi
Wang, Lei
Liu, Lingqiao
Zhou, Luping
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 382 - 392
[50] Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model
Chen, Xiaolin
Song, Xuemeng
Jing, Liqiang
Li, Shuo
Hu, Linmei
Nie, Liqiang
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (02)

← 1 2 3 4 5 →