CoTea: Collaborative teaching for low-resource named entity recognition with a divide-and-conquer strategy

被引:0
|
作者
Yang, Zhiwei [1 ,2 ]
Ma, Jing [3 ]
Yang, Kang [4 ]
Lin, Huiru [5 ]
Chen, Hechang [4 ]
Yang, Ruichao [3 ]
Chang, Yi [4 ,6 ]
机构
[1] Jinan Univ, Guangdong Inst Smart Educ, Guangzhou, Peoples R China
[2] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[4] Jilin Univ, Sch Artificial Intelligence, Changchun, Peoples R China
[5] Jinan Univ, Inst Phys Educ, Guangzhou, Peoples R China
[6] Jilin Univ, Int Ctr Future Sci, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
Low resource; Named entity recognition; Collaborative teaching; Divide-and-conquer;
D O I
10.1016/j.ipm.2024.103657
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Low -resource named entity recognition (NER) aims to identify entity mentions when training data is scarce. Recent approaches resort to distant data with manual dictionaries for improvement, but such dictionaries are not always available for the target domain and have limited coverage of entities, which may introduce noise. In this paper, we propose a novel Collaborative Teaching (CoTea) framework for low -resource NER with a few supporting labeled examples, which can automatically augment training data and reduce label noise. Specifically, CoTea utilizes the entities in the supporting labeled examples to retrieve entity -related unlabeled data heuristically and then generates accurate distant labels with a novel mining -refining iterative mechanism. For optimizing distant labels, the mechanism mines potential entities from non -entity tokens with a recognition teacher and then refines entity labels with another prompt -based discrimination teacher in a divide -and -conquer manner. Experimental results on two benchmark datasets demonstrate that CoTea outperforms state-of-the-art baselines in lowresource settings and achieves 85% and 65% performance levels of the best high -resource baseline methods by merely utilizing about 2% of labeled data.
引用
收藏
页数:17
相关论文
共 48 条
  • [1] AUC Maximization for Low-Resource Named Entity Recognition
    Nguyen, Ngoc Dang
    Tan, Wei
    Du, Lan
    Buntine, Wray
    Beare, Richard
    Chen, Changyou
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13389 - 13399
  • [2] Biomedical Named Entity Recognition Under Low-Resource Situation
    Zhao, Jianfei
    Ren, Xiangyu
    Zhao, Shuo
    Li, Jinyi
    HEALTH INFORMATION PROCESSING. EVALUATION TRACK PAPERS, 2023, 1773 : 41 - 47
  • [3] Converse Attention Knowledge Transfer for Low-Resource Named Entity Recognition
    School of Computer Science and Technology, University of Science and Technology of China, Hefei
    230027, China
    不详
    639798, Singapore
    Int. J. Crowd. Sci., 2024, 3 (140-148):
  • [4] Dual Adversarial Neural Transfer for Low-Resource Named Entity Recognition
    Zhou, Joey Tianyi
    Zhang, Hao
    Jin, Di
    Zhu, Hongyuan
    Fang, Meng
    Goh, Rick Siow Mong
    Kwok, Kenneth
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3461 - 3471
  • [5] Knowledge-Enriched Prompt for Low-Resource Named Entity Recognition
    Hou, Wenlong
    Zhao, Weidong
    Liu, Xianhui
    Guo, Wenyan
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (05)
  • [6] LELNER: A Lightweight and Effective Low-resource Named Entity Recognition model
    Zhang, Zhanjun
    Zhang, Haoyu
    Wan, Qian
    Liu, Jie
    KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [7] Constrained Labeled Data Generation for Low-Resource Named Entity Recognition
    Guo, Ruohao
    Roth, Dan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4519 - 4533
  • [8] A Word Representation to Improve Named Entity Recognition in Low-resource Languages
    Mbouopda, Michael Franklin
    Yonta, Paulin Melatagia
    2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 333 - 337
  • [9] A Robust and Domain-Adaptive Approach for Low-Resource Named Entity Recognition
    Yu, Houjin
    Mao, Xian-Ling
    Chi, Zewen
    Wei, Wei
    Huang, Heyan
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 297 - 304
  • [10] Low-Resource Named Entity Recognition via the Pre-Training Model
    Chen, Siqi
    Pei, Yijie
    Ke, Zunwang
    Silamu, Wushour
    SYMMETRY-BASEL, 2021, 13 (05):