CDANER: Contrastive Learning with Cross-domain Attention for Few-shot Named Entity Recognition

被引：2

作者：

Li, Wei ^{[1
,2
]}

Li, Hui ^{[2
]}

Ge, Jingguo ^{[1
,2
]}

Zhang, Lei ^{[2
]}

Li, Liangxiong ^{[2
]}

Wu, Bingzhen ^{[2
]}

机构：

[1] Univ Chinese Acad Sci, Sch Cyberspace Secur, Beijing, Peoples R China

[2] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China

来源：

2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年

基金：

国家重点研发计划;

关键词：

D O I：

10.1109/IJCNN54540.2023.10191439

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot Named Entity Recognition (NER) aims to recognize unseen name entities based on a tiny support set that consists of seen name entities and labels, which is obviously different from traditional supervised NER methods. Contrastive learning has become a popular solution for few-shot NER, which improves the robustness of NER to handle unlabeled entities by learning a similarity metric to measure the semantic similarity between test samples and entity labels. However, existing contrastive learning based NER methods individually learn the word embedding in source and target domains, ignoring connections between entities with the same label and limiting the effectiveness of contrast learning. In this paper, we propose a novel few-shot NER framework that jointly models different domain texts and optimizes a generalized objective of differentiating between words in all stages. The proposed model builds the cross-domain attention layer to enhance the feature representations of words and transfer the entity similarity information from the source domain to the target domain. This significantly reduces the divergence between entities with same label. Experimental results on the largest Few-shot NER dataset show that CDANER significantly outperforms all baseline methods, which verifies the effectiveness and robustness of the proposed model.

引用

页数：8

共 50 条

[21] CEPTNER: Contrastive learning Enhanced Prototypical network for Two-stage few-shot Named Entity Recognition
Zha, Enze
Zeng, Delong
Lin, Man
Shen, Ying
[J]. KNOWLEDGE-BASED SYSTEMS, 2024, 295
[22] MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition
Fang, Jinyuan
Wang, Xiaobin
Meng, Zaiqiao
Xie, Pengjun
Huang, Fei
Jiang, Yong
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4261 - 4276
[23] Feature extractor stacking for cross-domain few-shot learning
Wang, Hongyu
Frank, Eibe
Pfahringer, Bernhard
Mayo, Michael
Holmes, Geoffrey
[J]. MACHINE LEARNING, 2024, 113 (01) : 121 - 158
[24] Ranking Distance Calibration for Cross-Domain Few-Shot Learning
Li, Pan
Gong, Shaogang
Wang, Chengjie
Fu, Yanwei
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9089 - 9098
[25] Relevance equilibrium network for cross-domain few-shot learning
Ji, Zhong
Kong, Xiangyu
Wang, Xuan
Liu, Xiyao
[J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (02)
[26] Spectral Decomposition and Transformation for Cross-domain Few-shot Learning
Liu, Yicong
Zou, Yixiong
Li, Ruixuan
Li, Yuhua
[J]. NEURAL NETWORKS, 2024, 179
[27] Few-shot named entity recognition with hybrid multi-prototype learning
Zenghua Liao
Junbo Fei
Weixin Zeng
Xiang Zhao
[J]. World Wide Web, 2023, 26 : 2521 - 2544
[28] Experiments in cross-domain few-shot learning for image classification
Wang, Hongyu
Gouk, Henry
Fraser, Huon
Frank, Eibe
Pfahringer, Bernhard
Mayo, Michael
Holmes, Geoffrey
[J]. JOURNAL OF THE ROYAL SOCIETY OF NEW ZEALAND, 2023, 53 (01) : 169 - 191
[29] Threat intelligence named entity recognition techniques based on few-shot learning
Wang, Haiyan
Yang, Weimin
Feng, Wenying
Zeng, Liyi
Gu, Zhaoquan
[J]. ARRAY, 2024, 23
[30] Feature extractor stacking for cross-domain few-shot learning
Hongyu Wang
Eibe Frank
Bernhard Pfahringer
Michael Mayo
Geoffrey Holmes
[J]. Machine Learning, 2024, 113 : 121 - 158

← 1 2 3 4 5 →