Pooled Contextualized Embeddings for Named Entity Recognition

被引:0
|
作者
Akbik, Alan [1 ]
Bergmann, Tanja [1 ]
Vollgraf, Roland [1 ]
机构
[1] Zalando Res, Muhlenstr 25, D-10243 Berlin, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Contextual string embeddings are a recent type of contextualized word embedding that were shown to yield state-of-the-art results when utilized in a range of sequence labeling tasks. They are based on character-level language models which treat text as distributions over characters and are capable of generating embeddings for any string of characters within any textual context. However, such purely character-based approaches struggle to produce meaningful embeddings if a rare string is used in a underspecified context. To address this drawback, we propose a method in which we dynamically aggregate contextualized embeddings of each unique string that we encounter. We then use a pooling operation to distill a global word representation from all contextualized instances. We evaluate these pooled contextualized embeddings on common named entity recognition (NER) tasks such as CoNLL-03 and WNUT and show that our approach significantly improves the state-of-the-art for NER. We make all code and pre-trained models available to the research community for use and reproduction.
引用
收藏
页码:724 / 728
页数:5
相关论文
共 50 条
  • [1] Shahmukhi named entity recognition by using contextualized word embeddings
    Tehseen, Amina
    Ehsan, Toqeer
    Bin Liaqat, Hannan
    Kong, Xiangjie
    Ali, Amjad
    Al-Fuqaha, Ala
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [2] Chemical Named Entity Recognition with Deep Contextualized Neural Embeddings
    Awan, Zainab
    Kahlke, Tim
    Ralph, Peter J.
    Kennedy, Paul J.
    KDIR: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL 1: KDIR, 2019, : 135 - 144
  • [3] Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings
    Zhai, Zenan
    Dat Quoc Nguyen
    Akhondi, Saber A.
    Thorne, Camilo
    Druckenbrodt, Christian
    Cohn, Trevor
    Gregory, Michelle
    Verspoor, Karin
    SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2019), 2019, : 328 - 338
  • [4] Combining Contextualized Embeddings and Prior Knowledge for Clinical Named Entity Recognition: Evaluation Study
    Jiang, Min
    Sanger, Todd
    Liu, Xiong
    JMIR MEDICAL INFORMATICS, 2019, 7 (04) : 80 - 94
  • [5] Hierarchical Contextualized Representation for Named Entity Recognition
    Luo, Ying
    Xiao, Fengshun
    Zhao, Hai
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8441 - 8448
  • [6] Poincare Embeddings in the Task of Named Entity Recognition
    Munoz, David
    Perez, Fernando
    Pinto, David
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2020, PT II, 2020, 12469 : 193 - 204
  • [7] Named Entity Recognition Only from Word Embeddings
    Luo, Ying
    Zhao, Hai
    Zhan, Junlang
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8995 - 9005
  • [8] Combining Word Embeddings for Portuguese Named Entity Recognition
    da Silva, Messias Gomes
    Alves de Oliveira, Hilario Tomaz
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 198 - 208
  • [9] Exploiting Multiple Embeddings for Chinese Named Entity Recognition
    Xu, Canwen
    Wang, Feiyang
    Han, Jialong
    Li, Chenliang
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2269 - 2272
  • [10] Embeddings for Named Entity Recognition in Geoscience Portuguese Literature
    Consoli, Bernardo
    Santos, Joaquim
    Gomes, Diogo
    Cordeiro, Fabio
    Vieira, Renata
    Moreira, Viviane
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4625 - 4630