Enhance prototypical networks with hybrid attention and confusing loss function for few-shot relation classification

被引:9
|
作者
Li, Yibing [1 ,2 ,3 ]
Ma, Zuchang [1 ]
Gao, Lisheng [1 ]
Wu, Yichen [1 ,2 ,4 ]
Xie, Fei [3 ]
Ren, Xiaoye [3 ]
机构
[1] Chinese Acad Sci, Hefei Inst Phys Sci, Inst Intelligent Machines, Anhui Prov Key Lab Med Phys & Technol, Hefei 230031, Peoples R China
[2] Univ Sci & Technol China, Sci Isl Branch Grad Sch, Hefei 230026, Peoples R China
[3] Hefei Normal Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[4] Anhui Jianzhu Univ, Sch Elect & Informat Engn, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Relation classification; Few-shot learning; Hybrid attention; Loss; BERT;
D O I
10.1016/j.neucom.2022.04.067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relation classification (RC) is a fundamental task to building knowledge graphs and describing semantic formalization. It aims to classify a relation between the head and the tail entities in a sentence. The existing RC method mainly adopts the distant supervision (DS) scheme. However, DS still has the problem of long-tail and suffers from data sparsity. Recently, few-shot learning (FSL) has attracted people's attention. It solves the long-tail problem by learning from few-shot samples. The prototypical networks have a better effect on FSL, which classifies a relation by distance. However, the prototypical networks and their related variants did not consider the critical role of entity words. In addition, not all sentences in support set equally contributed to classifying relations. Furthermore, an entity pair in a sentence may have true and confusing relations, which is difficult for the RC model to distinguish them. A new context encoder BERT_FE is proposed to address those problems, which uses the BERT model as pre-training and fuses the information of head and tail entities by entity word-level attention (WLA). At the same time, the sentence-level attention (SLA) is proposed to give more weight to sentences of the support set similar to the query instance and improve the classification accuracy. A confusing loss function (CLF) is designed to enhance the model's ability to distinguish between true and confusing relations. The experiment results demonstrate that our proposed model (HACLF) is better than several baseline models. (c) 2022 Elsevier B.V. All rights reserved.
引用
下载
收藏
页码:362 / 372
页数:11
相关论文
共 50 条
  • [41] Global Prototypical Network for Few-Shot Hyperspectral Image Classification
    Zhang, Chengye
    Yue, Jun
    Qin, Qiming
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 (13) : 4748 - 4759
  • [42] Bimodal semantic fusion prototypical network for few-shot classification
    Huang, Xilang
    Choi, Seon Han
    INFORMATION FUSION, 2024, 109
  • [43] Hybrid attentive prototypical network for few-shot action recognition
    Ruan, Zanxi
    Wei, Yingmei
    Guo, Yanming
    Xie, Yuxiang
    COMPLEX & INTELLIGENT SYSTEMS, 2024, : 8249 - 8272
  • [44] Multiscale attention for few-shot image classification
    Zhou, Tong
    Dong, Changyin
    Song, Junshu
    Zhang, Zhiqiang
    Wang, Zhen
    Chang, Bo
    Chen, Dechun
    COMPUTATIONAL INTELLIGENCE, 2024, 40 (02)
  • [45] Few-shot classification with Fork Attention Adapter
    Sun, Jieqi
    Li, Jian
    PATTERN RECOGNITION, 2024, 156
  • [46] Cross Attention Network for Few-shot Classification
    Hou, Ruibing
    Chang, Hong
    Ma, Bingpeng
    Shan, Shiguang
    Chen, Xilin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [47] Few-shot emotion recognition in conversation with sequential prototypical networks
    Guibon, Gael
    Labeau, Matthieu
    Lefeuvre, Luce
    Clavel, Chloe
    SOFTWARE IMPACTS, 2022, 12
  • [48] Few-Shot Emotion Recognition in Conversation with Sequential Prototypical Networks
    Guibon, Gael
    Labeau, Matthieu
    Flamein, Helene
    Lefeuvre, Luce
    Clavel, Chloe
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 6858 - 6870
  • [49] Large Margin Prototypical Network for Few-shot Relation Classification with Fine-grained Features
    Fan, Miao
    Bai, Yeqi
    Sun, Mingming
    Li, Ping
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2353 - 2356
  • [50] Transductive Prototypical Attention Network for Few-shot SAR Target Recognition
    Yu, Xuelian
    Liu, Sen
    Ren, Haohao
    Zou, Lin
    Zhou, Yun
    Wang, Xuegang
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,