Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding

被引:0
|
作者
Zhang, Taolin [1 ,2 ]
Xu, Ruyao [1 ]
Wang, Chengyu [2 ]
Duan, Zhongjie [1 ]
Chen, Cen [1 ]
Qiu, Minghui [2 ]
Cheng, Dawei [3 ]
He, Xiaofeng [1 ]
Qian, Weining [1 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Tongji Univ, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge-Enhanced Pre-trained Language Models (KEPLMs) improve the performance of various downstream NLP tasks by injecting knowledge facts from large-scale Knowledge Graphs (KGs). However, existing methods for pre-training KEPLMs with relational triples are difficult to be adapted to close domains due to the lack of sufficient domain graph semantics. In this paper, we propose a Knowledgeenhanced lANGuAge Representation learning framework for various clOsed dOmains (KAN-GAROO) via capturing the implicit graph structure among the entities. Specifically, since the entity coverage rates of closed-domain KGs can be relatively low and may exhibit the global sparsity phenomenon for knowledge injection, we consider not only the shallow relational representations of triples but also the hyperbolic embeddings of deep hierarchical entityclass structures for effective knowledge fusion. Moreover, as two closed-domain entities under the same entity-class often have locally dense neighbor subgraphs counted by max point bi-connected component, we further propose a data augmentation strategy based on contrastive learning over subgraphs to construct hard negative samples of higher quality. It makes the underlying KELPMs better distinguish the semantics of these neighboring entities to further complement the global semantic sparsity. In the experiments, we evaluate KANGAROO over various knowledge-aware and general NLP tasks in both full and few-shot learning settings, outperforming various KEPLM training paradigms performance in closed-domains significantly.
引用
收藏
页码:15663 / 15676
页数:14
相关论文
共 50 条
  • [41] Knowledge-enhanced visual-language pre-training on chest radiology images
    Zhang, Xiaoman
    Wu, Chaoyi
    Zhang, Ya
    Xie, Weidi
    Wang, Yanfeng
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [42] VCounselor: a psychological intervention chat agent based on a knowledge-enhanced large language model
    Zhang, Hanzhong
    Qiao, Zhijian
    Wang, Haoyang
    Duan, Bowen
    Yin, Jibin
    MULTIMEDIA SYSTEMS, 2024, 30 (06)
  • [43] KARGEN: Knowledge-Enhanced Automated Radiology Report Generation Using Large Language Models
    Li, Yingshu
    Wang, Zhanyu
    Liu, Yunyi
    Wang, Lei
    Liu, Lingqiao
    Zhou, Luping
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 382 - 392
  • [44] Knowledge-enhanced visual-language pre-training on chest radiology images
    Xiaoman Zhang
    Chaoyi Wu
    Ya Zhang
    Weidi Xie
    Yanfeng Wang
    Nature Communications, 14
  • [45] Does the Correctness of Factual Knowledge Matter for Factual Knowledge-Enhanced Pre-trained Language Models?
    Cao, Boxi
    Tang, Qiaoyu
    Lin, Hongyu
    Han, Xianpei
    Sun, Le
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2327 - 2340
  • [46] Language Models Learning for Domain-Specific Natural Language User Interaction
    Bai, Shuanhu
    Huang, Chien-Lin
    Tan, Yeow-Kee
    Ma, Bin
    2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2009), VOLS 1-4, 2009, : 2480 - 2485
  • [47] Knowledge Enhanced Language Model for Biomedical Natural Language Processing: Introducing a New Language Model for BioNLP
    Naseem, Usman
    Zhang, Qi
    Hu, Liang
    Hussain, Sadam
    Wang, Shoujin
    IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2025, 11 (01): : 89 - 94
  • [48] A Domain Specific Language for Contextual Design
    Barn, Balbir S.
    Clark, Tony
    HUMAN-CENTRED SOFTWARE ENGINEERING, 2010, 6409 : 46 - 61
  • [49] Some thoughts on knowledge-enhanced machine learning
    Cozman, Fabio Gagliardi
    Munhoz, Hugo Neri
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 136 : 308 - 324
  • [50] METALINGUISTIC KNOWLEDGE AND UNDERSTANDING IN ADULT LANGUAGE-LEARNING
    KALIN, M
    BILINGUALISM AND THE INDIVIDUAL, 1988, 42 : 117 - 132