Class incremental named entity recognition without forgetting

被引:0
|
作者
Liu, Ye [1 ]
Huang, Shaobin [1 ]
Wei, Chi [1 ]
Tian, Sicheng [1 ]
Li, Rongsheng [1 ]
Yan, Naiyu [1 ]
Du, Zhijuan [2 ,3 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Inner Mongolia Univ, Hohhot, Peoples R China
[3] Minist Educ, Engn Res Ctr Ecol Big Data, Beijing, Peoples R China
关键词
Class incremental learning; Named entity recognition; Multi-model framework; Continual learning;
D O I
10.1007/s10115-024-02220-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class Incremental Named Entity Recognition (CINER) needs to learn new entity classes without forgetting old entity classes under the setting where the data only contain annotations for new entity classes. As is well known, the forgetting problem is the biggest challenge in Class Incremental Learning (CIL). In the CINER scenario, the unlabeled old class entities will further aggravate the forgetting problem. The current CINER method based on a single model cannot completely avoid the forgetting problem and is sensitive to the learning order of entity classes. To this end, we propose a Multi-Model (MM) framework that trains a new model for each incremental step and uses all the models for inference. In MM, each model only needs to learn the entity classes included in corresponding step, so MM has no forgetting problem and is robust to the different entity class learning orders. Furthermore, we design an error-correction training strategy and conflict-handling rules for MM to further improve performance. We evaluate MM on CoNLL-03 and OntoNotes-V5, and the experimental results show that our framework outperforms the current state-of-the-art (SOTA) methods by a large margin.
引用
收藏
页码:301 / 324
页数:24
相关论文
共 50 条
  • [21] Named Entity Recognition without Labelled Data: A Weak Supervision Approach
    Lison, Pierre
    Barnes, Jeremy
    Hubin, Aliaksandr
    Touileb, Samia
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1518 - 1533
  • [22] Dynamic Named Entity Recognition
    Luiggi, Tristan
    Soulier, Laure
    Guigue, Vincent
    Jendoubi, Siwar
    Baelde, Aurelien
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 890 - 897
  • [23] Speech recognition of a named entity
    Tomita, T
    Okimoto, Y
    Yamamoto, H
    Sagisaka, Y
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1057 - 1060
  • [24] Named Entity Recognition in Query
    Guo, Jiafeng
    Xu, Gu
    Cheng, Xueqi
    Li, Hang
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 267 - 274
  • [25] ProtoNER: Few Shot Incremental Learning for Named Entity Recognition Using Prototypical Networks
    Kumar, Ritesh
    Goyal, Saurabh
    Verma, Ashish
    Isahagian, Vatche
    BUSINESS PROCESS MANAGEMENT WORKSHOPS, BPM 2023, 2024, 492 : 71 - 83
  • [26] Few-shot Named Entity Recognition via encoder and class intervention
    Ding, Long
    Ouyang, Chunping
    Liu, Yongbin
    Tao, Zhihua
    Wan, Yaping
    Gao, Zheng
    AI OPEN, 2024, 5 : 39 - 45
  • [27] Joint Learning of Named Entity Recognition and Entity Linking
    Martins, Pedro Henrique
    Marinho, Zita
    Martins, Andre F. T.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 190 - 196
  • [28] A Survey on Multimodal Named Entity Recognition
    Qian, Shenyi
    Jin, Wenduo
    Chen, Yonggang
    Ma, Jiangtao
    Qiao, Yaqiong
    Lu, Jinyu
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 609 - 622
  • [29] Named Entity Recognition for Mongolian Language
    Munkhjargal, Zoljargal
    Bella, Gabor
    Chagnaa, Altangerel
    Giunchiglia, Fausto
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 243 - 251
  • [30] A composite kernel for named entity recognition
    Saha, Sujan Kumar
    Narayan, Shashi
    Sarkar, Sudeshna
    Mitra, Pabitra
    PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1591 - 1597