Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension

被引：0

作者：

Yang, An ^{[1
,2
]}

Wang, Quan ^{[2
]}

Liu, Jing ^{[2
]}

Liu, Kai ^{[2
]}

Lyu, Yajuan ^{[2
]}

Wu, Hua ^{[2
]}

She, Qiaoqiao ^{[2
]}

Li, Sujian ^{[1
]}

机构：

[1] Peking Univ, MOE, Key Lab Computat Linguist, Beijing, Peoples R China

[2] Baidu Inc, Beijing, Peoples R China

来源：

57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019) | 2019年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine reading comprehension (MRC) is a crucial and challenging task in NLP. Recently, pre-trained language models (LMs), especially BERT, have achieved remarkable success, presenting new state-of-the-art results in MRC. In this work, we investigate the potential of leveraging external knowledge bases (KBs) to further improve BERT for MRC. We introduce KT-NET, which employs an attention mechanism to adaptively select desired knowledge from KBs, and then fuses selected knowledge with BERT to enable context- and knowledge-aware predictions. We believe this would combine the merits of both deep LMs and curated KBs towards better MRC. Experimental results indicate that KT-NET offers significant and consistent improvements over BERT, outperforming competitive baselines on ReCoRD and SQuAD1.1 benchmarks. Notably, it ranks the 1st place on the ReCoRD leaderboard, and is also the best single model on the SQuAD1.1 leaderboard at the time of submission (March 4th, 2019).(1)

引用

页码：2346 / 2357

页数：12

共 50 条

[21] Continual knowledge infusion into pre-trained biomedical language models
Jha, Kishlay
Zhang, Aidong
[J]. BIOINFORMATICS, 2022, 38 (02) : 494 - 502
[22] Pre-trained Affective Word Representations
Chawla, Kushal
Khosla, Sopan
Chhaya, Niyati
Jaidka, Kokil
[J]. 2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2019,
[23] Enhancing Turkish Sentiment Analysis Using Pre-Trained Language Models
Koksal, Omer
[J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
[24] Prompt learning for metonymy resolution: Enhancing performance with internal prior knowledge of pre-trained language models
Zhao, Biao
Jin, Weiqiang
Zhang, Yu
Huang, Subin
Yang, Guang
[J]. KNOWLEDGE-BASED SYSTEMS, 2023, 279
[25] Improved Quality Estimation of Machine Translation with Pre-trained Language Representation
Miao, Guoyi
Di, Hui
Xu, Jinan
Yang, Zhongcheng
Chen, Yufeng
Ouchi, Kazushige
[J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 406 - 417
[26] BiTimeBERT: Extending Pre-Trained Language Representations with Bi-Temporal Information
Wang, Jiexin
Jatowt, Adam
Yoshikawa, Masatoshi
Cai, Yi
[J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 812 - 821
[27] Pre-trained language models with domain knowledge for biomedical extractive summarization
Xie, Qianqian
Bishop, Jennifer Amy
Tiwari, Prayag
Ananiadou, Sophia
[J]. Knowledge-Based Systems, 2022, 252
[28] Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Bhargava, Prajjwal
Ng, Vincent
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12317 - 12325
[29] Connecting Pre-trained Language Models and Downstream Tasks via Properties of Representations
Wu, Chenwei
Lee, Holden
Ge, Rong
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[30] Plug-and-Play Knowledge Injection for Pre-trained Language Models
Zhang, Zhengyan
Zeng, Zhiyuan
Lin, Yankai
Wang, Huadong
Ye, Deming
Xiao, Chaojun
Han, Xu
Liu, Zhiyuan
Li, Peng
Sun, Maosong
Zhou, Jie
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10641 - 10656

← 1 2 3 4 5 →