CDANER: Contrastive Learning with Cross-domain Attention for Few-shot Named Entity Recognition

被引:2
|
作者
Li, Wei [1 ,2 ]
Li, Hui [2 ]
Ge, Jingguo [1 ,2 ]
Zhang, Lei [2 ]
Li, Liangxiong [2 ]
Wu, Bingzhen [2 ]
机构
[1] Univ Chinese Acad Sci, Sch Cyberspace Secur, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/IJCNN54540.2023.10191439
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot Named Entity Recognition (NER) aims to recognize unseen name entities based on a tiny support set that consists of seen name entities and labels, which is obviously different from traditional supervised NER methods. Contrastive learning has become a popular solution for few-shot NER, which improves the robustness of NER to handle unlabeled entities by learning a similarity metric to measure the semantic similarity between test samples and entity labels. However, existing contrastive learning based NER methods individually learn the word embedding in source and target domains, ignoring connections between entities with the same label and limiting the effectiveness of contrast learning. In this paper, we propose a novel few-shot NER framework that jointly models different domain texts and optimizes a generalized objective of differentiating between words in all stages. The proposed model builds the cross-domain attention layer to enhance the feature representations of words and transfer the entity similarity information from the source domain to the target domain. This significantly reduces the divergence between entities with same label. Experimental results on the largest Few-shot NER dataset show that CDANER significantly outperforms all baseline methods, which verifies the effectiveness and robustness of the proposed model.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] CEPTNER: Contrastive learning Enhanced Prototypical network for Two-stage few-shot Named Entity Recognition
    Zha, Enze
    Zeng, Delong
    Lin, Man
    Shen, Ying
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [22] MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition
    Fang, Jinyuan
    Wang, Xiaobin
    Meng, Zaiqiao
    Xie, Pengjun
    Huang, Fei
    Jiang, Yong
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4261 - 4276
  • [23] Feature extractor stacking for cross-domain few-shot learning
    Wang, Hongyu
    Frank, Eibe
    Pfahringer, Bernhard
    Mayo, Michael
    Holmes, Geoffrey
    [J]. MACHINE LEARNING, 2024, 113 (01) : 121 - 158
  • [24] Ranking Distance Calibration for Cross-Domain Few-Shot Learning
    Li, Pan
    Gong, Shaogang
    Wang, Chengjie
    Fu, Yanwei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9089 - 9098
  • [25] Relevance equilibrium network for cross-domain few-shot learning
    Ji, Zhong
    Kong, Xiangyu
    Wang, Xuan
    Liu, Xiyao
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (02)
  • [26] Spectral Decomposition and Transformation for Cross-domain Few-shot Learning
    Liu, Yicong
    Zou, Yixiong
    Li, Ruixuan
    Li, Yuhua
    [J]. NEURAL NETWORKS, 2024, 179
  • [27] Few-shot named entity recognition with hybrid multi-prototype learning
    Zenghua Liao
    Junbo Fei
    Weixin Zeng
    Xiang Zhao
    [J]. World Wide Web, 2023, 26 : 2521 - 2544
  • [28] Experiments in cross-domain few-shot learning for image classification
    Wang, Hongyu
    Gouk, Henry
    Fraser, Huon
    Frank, Eibe
    Pfahringer, Bernhard
    Mayo, Michael
    Holmes, Geoffrey
    [J]. JOURNAL OF THE ROYAL SOCIETY OF NEW ZEALAND, 2023, 53 (01) : 169 - 191
  • [29] Threat intelligence named entity recognition techniques based on few-shot learning
    Wang, Haiyan
    Yang, Weimin
    Feng, Wenying
    Zeng, Liyi
    Gu, Zhaoquan
    [J]. ARRAY, 2024, 23
  • [30] Feature extractor stacking for cross-domain few-shot learning
    Hongyu Wang
    Eibe Frank
    Bernhard Pfahringer
    Michael Mayo
    Geoffrey Holmes
    [J]. Machine Learning, 2024, 113 : 121 - 158