FEW-NERD: A Few-shot Named Entity Recognition Dataset

被引:0
|
作者
Ding, Ning [1 ,3 ]
Xu, Guangwei [2 ]
Chen, Yulin [3 ]
Wang, Xiaobin [2 ]
Han, Xu [1 ]
Xie, Pengjun [2 ]
Zheng, Hai-Tao [3 ]
Liu, Zhiyuan [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Tsinghua Univ, Shenzhen Int Grad Sch, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, considerable literature has grown up around the theme of few-shot named entity recognition (NER), but little published benchmark data specifically focused on the practical and challenging task. Current approaches collect existing supervised NER datasets and reorganize them into the few-shot setting for empirical study. These strategies conventionally aim to recognize coarse-grained entity types with few examples, while in practice, most unseen entity types are fine-grained. In this paper, we present FEW-NERD, a large-scale human-annotated few-shot NER dataset with a hierarchy of 8 coarse-grained and 66 fine-grained entity types. FEW-NERD consists of 188,238 sentences from Wikipedia, 4,601,160 words are included and each is annotated as context or a part of a two-level entity type. To the best of our knowledge, this is the first few-shot NER dataset and the largest human-crafted NER dataset. We construct benchmark tasks with different emphases to comprehensively assess the generalization capability of models. Extensive empirical results and analysis show that FEW-NERD is challenging and the problem requires further research.
引用
下载
收藏
页码:3198 / 3213
页数:16
相关论文
共 50 条
  • [31] Few-shot named entity recognition with hybrid multi-prototype learning
    Liao, Zenghua
    Fei, Junbo
    Zeng, Weixin
    Zhao, Xiang
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (05): : 2521 - 2544
  • [32] Data Augmentation with Nearest Neighbor Classifier for Few-Shot Named Entity Recognition
    Ge, Yao
    Al-Garadi, Mohammed Ali
    Sarker, Abeed
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 690 - 694
  • [33] Dataset Bias in Few-Shot Image Recognition
    Jiang, Shuqiang
    Zhu, Yaohui
    Liu, Chenlong
    Song, Xinhang
    Li, Xiangyang
    Min, Weiqing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 229 - 246
  • [34] Label Semantics for Few Shot Named Entity Recognition
    Ma, Jie
    Ballesteros, Miguel
    Doss, Srikanth
    Anubhai, Rishita
    Mallya, Sunil
    Al-Onaizan, Yaser
    Roth, Dan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1956 - 1971
  • [35] SLNER: Chinese Few-Shot Named Entity Recognition with Enhanced Span and Label Semantics
    Ren, Zhe
    Qin, Xizhong
    Ran, Wensheng
    APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [36] Improving few-shot named entity recognition via Semantics induced Optimal Transport
    Zhou, Diange
    Li, Shengwen
    Chen, Qizhi
    Yao, Hong
    NEUROCOMPUTING, 2024, 597
  • [37] Named Entity Recognition for Few-Shot Power Dispatch Based on Multi-Task
    Tan, Zhixiang
    Chen, Yan
    Liang, Zengfu
    Meng, Qi
    Lin, Dezhao
    ELECTRONICS, 2023, 12 (16)
  • [38] Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning
    Yang, Yi
    Katiyar, Arzoo
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6365 - 6375
  • [39] Few-shot Named Entity Recognition Based on Fine-grained Prototypical Network
    Qi, Rong-Zhi
    Zhou, Jun-Yu
    Li, Shui-Yan
    Mao, Ying-Chi
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (10): : 4751 - 4765
  • [40] Decomposed Two-Stage Prompt Learning for Few-Shot Named Entity Recognition
    Ye, Feiyang
    Huang, Liang
    Liang, Senjie
    Chi, KaiKai
    INFORMATION, 2023, 14 (05)