MEM-KGC: Masked Entity Model for Knowledge Graph Completion With Pre-Trained Language Model

被引：9

作者：

Choi, Bonggeun ^{[1
]}

Jang, Daesik ^{[2
]}

Ko, Youngjoong ^{[2
]}

机构：

[1] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon 16419, Gyeonggi Do, South Korea

[2] Sungkyunkwan Univ, Dept Comp Sci & Engn, Suwon 16419, Gyeonggi Do, South Korea

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Task analysis; Predictive models; Training; Bit error rate; Semantics; Micromechanical devices; Licenses; Knowledge graph completion; link prediction; masked language model; pre-trained language model;

D O I：

10.1109/ACCESS.2021.3113329

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The knowledge graph completion (KGC) task aims to predict missing links in knowledge graphs. Recently, several KGC models based on translational distance or semantic matching methods have been proposed and have achieved meaningful results. However, existing models have a significant shortcoming-they cannot train entity embedding when an entity does not appear in the training phase. As a result, such models use randomly initialized embeddings for entities that are unseen in the training phase and cause a critical decrease in performance during the test phase. To solve this problem, we propose a new approach that performs KGC task by utilizing the masked language model (MLM) that is used for a pre-trained language model. Given a triple (head entity, relation, tail entity), we mask the tail entity and consider the head entity and the relation as a context for the tail entity. The model then predicts the masked entity from among all entities. Then, the task is conducted by the same process as an MLM, which predicts a masked token with a given context of tokens. Our experimental results show that the proposed model achieves significantly improved performances when unseen entities appear during the test phase and achieves state-of-the-art performance on the WN18RR dataset.

引用

页码：132025 / 132032

页数：8

共 50 条

[1] Pre-trained Language Model with Prompts for Temporal Knowledge Graph Completion
Xu, Wenjie
Liu, Ben
Peng, Miao
Jia, Xu
Peng, Min
[J]. arXiv, 2023,
[2] Commonsense Knowledge Base Completion with Relational Graph Attention Network and Pre-trained Language Model
Ju, Jinghao
Yang, Deqing
Liu, Jingping
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4104 - 4108
[3] SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models
Wang, Liang
Zhao, Wei
Wei, Zhuoyu
Liu, Jingming
[J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4281 - 4294
[4] NMT Enhancement based on Knowledge Graph Mining with Pre-trained Language Model
Yang, Hao
Qin, Ying
Deng, Yao
Wang, Minghan
[J]. 2020 22ND INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): DIGITAL SECURITY GLOBAL AGENDA FOR SAFE SOCIETY!, 2020, : 185 - 189
[5] Interpretability of Entity Matching Based on Pre-trained Language Model
Liang Z.
Wang H.-Z.
Dai J.-J.
Shao X.-Y.
Ding X.-O.
Mu T.-Y.
[J]. Ruan Jian Xue Bao/Journal of Software, 2023, 34 (03): : 1087 - 1108
[6] Knowledge graph extension with a pre-trained language model via unified learning method
Choi, Bonggeun
Ko, Youngjoong
[J]. KNOWLEDGE-BASED SYSTEMS, 2023, 262
[7] Enriching Pre-trained Language Model with Entity Information for Relation Classification
Wu, Shanchan
He, Yifan
[J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2361 - 2364
[8] MedBERT: A Pre-trained Language Model for Biomedical Named Entity Recognition
Vasantharajan, Charangan
Tun, Kyaw Zin
Thi-Nga, Ho
Jain, Sparsh
Rong, Tong
Siong, Chng Eng
[J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1482 - 1488
[9] Knowledge Graph Completion Using a Pre-Trained Language Model Based on Categorical Information and Multi-Layer Residual Attention
Rao, Qiang
Wang, Tiejun
Guo, Xiaoran
Wang, Kaijie
Yan, Yue
[J]. APPLIED SCIENCES-BASEL, 2024, 14 (11):
[10] Knowledge Enhanced Pre-trained Language Model for Product Summarization
Yin, Wenbo
Ren, Junxiang
Wu, Yuejiao
Song, Ruilin
Liu, Lang
Cheng, Zhen
Wang, Sibo
[J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 263 - 273

← 1 2 3 4 5 →