FEW-NERD: A Few-shot Named Entity Recognition Dataset

被引:0
|
作者
Ding, Ning [1 ,3 ]
Xu, Guangwei [2 ]
Chen, Yulin [3 ]
Wang, Xiaobin [2 ]
Han, Xu [1 ]
Xie, Pengjun [2 ]
Zheng, Hai-Tao [3 ]
Liu, Zhiyuan [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Tsinghua Univ, Shenzhen Int Grad Sch, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, considerable literature has grown up around the theme of few-shot named entity recognition (NER), but little published benchmark data specifically focused on the practical and challenging task. Current approaches collect existing supervised NER datasets and reorganize them into the few-shot setting for empirical study. These strategies conventionally aim to recognize coarse-grained entity types with few examples, while in practice, most unseen entity types are fine-grained. In this paper, we present FEW-NERD, a large-scale human-annotated few-shot NER dataset with a hierarchy of 8 coarse-grained and 66 fine-grained entity types. FEW-NERD consists of 188,238 sentences from Wikipedia, 4,601,160 words are included and each is annotated as context or a part of a two-level entity type. To the best of our knowledge, this is the first few-shot NER dataset and the largest human-crafted NER dataset. We construct benchmark tasks with different emphases to comprehensively assess the generalization capability of models. Extensive empirical results and analysis show that FEW-NERD is challenging and the problem requires further research.
引用
下载
收藏
页码:3198 / 3213
页数:16
相关论文
共 50 条
  • [21] Decomposed Meta-Learning for Few-Shot Named Entity Recognition
    Ma, Tingting
    Jiang, Huiqiang
    Wu, Qianhui
    Zhao, Tiejun
    Lin, Chin-Yew
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1584 - 1596
  • [22] Few-Shot Named Entity Recognition via Meta-Learning
    Li, Jing
    Chiu, Billy
    Feng, Shanshan
    Wang, Hao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4245 - 4256
  • [23] Few-shot named entity recognition with hybrid multi-prototype learning
    Zenghua Liao
    Junbo Fei
    Weixin Zeng
    Xiang Zhao
    World Wide Web, 2023, 26 : 2521 - 2544
  • [24] Pointer-prototype fusion network for few-shot named entity recognition
    Zhao Haiying
    Guo Xuan
    The Journal of China Universities of Posts and Telecommunications, 2023, 30 (05) : 32 - 41
  • [25] Threat intelligence named entity recognition techniques based on few-shot learning
    Wang, Haiyan
    Yang, Weimin
    Feng, Wenying
    Zeng, Liyi
    Gu, Zhaoquan
    ARRAY, 2024, 23
  • [26] Pointer-prototype fusion network for few-shot named entity recognition
    Haiying, Zhao
    Xuan, Guo
    Journal of China Universities of Posts and Telecommunications, 2023, 30 (05): : 32 - 41
  • [27] Robust Few-Shot Named Entity Recognition with Boundary Discrimination and Correlation Purification
    Xue, Xiaojun
    Zhang, Chunxia
    Xu, Tianxiang
    Niu, Zhendong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19341 - 19349
  • [28] Label-Description Enhanced Network for Few-Shot Named Entity Recognition
    Zhang, Xinyue
    Gao, Hui
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 444 - 455
  • [29] A comparison of few-shot and traditional named entity recognition models for medical text
    Ge, Yao
    Guo, Yuting
    Yang, Yuan-Chi
    Al-Garadi, Mohammed Ali
    Sarker, Abeed
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 84 - 89
  • [30] Few-shot named entity recognition framework for forestry science metadata extraction
    Fan Y.
    Xiao H.
    Wang M.
    Wang J.
    Jiang W.
    Zhu C.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (04) : 2105 - 2118