Continual Learning for Named Entity Recognition

被引:0
|
作者
Monaikul, Natawut [1 ]
Castellucci, Giuseppe [2 ]
Filice, Simone [2 ]
Rokhlenko, Oleg [2 ]
机构
[1] Univ Illinois, Chicago, IL 60607 USA
[2] Amazon, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition (NER) is a vital task in various NLP applications. However, in many real-world scenarios (e.g., voice-enabled assistants) new named entity types are frequently introduced, entailing re-training NER models to support these new entity types. Re-annotating the original training data for the new entity types could be costly or even impossible when storage limitations or security concerns restrict access to that data, and annotating a new dataset for all of the entities becomes impractical and error-prone as the number of types increases. To tackle this problem, we introduce a novel Continual Learning approach for NER, which requires new training material to be annotated only for the new entity types. To preserve the existing knowledge previously learned by the model, we exploit the Knowledge Distillation (KD) framework, where the existing NER model acts as the teacher for a new NER model (i.e., the student), which learns the new entity types by using the new training material and retains knowledge of old entities by imitating the teacher's outputs on this new training set. Our experiments show that this approach allows the student model to "progressively" learn to identify new entity types without forgetting the previously learned ones. We also present a comparison with multiple strong baselines to demonstrate that our approach is superior for continually updating an NER model.
引用
收藏
页码:13570 / 13577
页数:8
相关论文
共 50 条
  • [31] Active Machine Learning Technique For Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    Singh, Dhirendra
    [J]. PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 180 - 186
  • [32] Learning Morpheme Representation for Mongolian Named Entity Recognition
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    [J]. NEURAL PROCESSING LETTERS, 2019, 50 (03) : 2647 - 2664
  • [33] MetaNER: Named Entity Recognition with Meta-Learning
    Li, Jing
    Shang, Shuo
    Shao, Ling
    [J]. WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 429 - 440
  • [34] Combining self learning and active learning for Chinese Named Entity Recognition
    Yao, Lin
    Sun, Chengjie
    Wang, Xiaolong
    Wang, Xuan
    [J]. Journal of Software, 2010, 5 (05) : 530 - 537
  • [35] Transfer learning for biomedical named entity recognition with neural networks
    Giorgi, John M.
    Bader, Gary D.
    [J]. BIOINFORMATICS, 2018, 34 (23) : 4087 - 4094
  • [36] Transfer Learning for Named-Entity Recognition with Neural Networks
    Lee, Ji Young
    Dernoncourt, Franck
    Szolovits, Peter
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4470 - 4473
  • [37] A Hybrid Deep Learning Framework for Bacterial Named Entity Recognition
    Li, Xusheng
    Wang, Xiaoyan
    Zhong, Ran
    Zhong, Duo
    He, Tingting
    Hu, Xiaohua
    Jiang, Xingpeng
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 428 - 433
  • [38] A deep learning method for named entity recognition in bidding document
    Ji, Yunfei
    Tong, Chao
    Liang, Jun
    Yang, Xi
    Zhao, Zheng
    Wang, Xu
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY, 2019, 1168
  • [39] Transfer learning for Turkish named entity recognition on noisy text
    Kagan Akkaya, Emre
    Can, Burcu
    [J]. NATURAL LANGUAGE ENGINEERING, 2021, 27 (01) : 35 - 64
  • [40] Coreference Aware Representation Learning for Neural Named Entity Recognition
    Dai, Zeyu
    Fei, Hongliang
    Li, Ping
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4946 - 4953