Pooled Contextualized Embeddings for Named Entity Recognition

被引:0
|
作者
Akbik, Alan [1 ]
Bergmann, Tanja [1 ]
Vollgraf, Roland [1 ]
机构
[1] Zalando Res, Muhlenstr 25, D-10243 Berlin, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Contextual string embeddings are a recent type of contextualized word embedding that were shown to yield state-of-the-art results when utilized in a range of sequence labeling tasks. They are based on character-level language models which treat text as distributions over characters and are capable of generating embeddings for any string of characters within any textual context. However, such purely character-based approaches struggle to produce meaningful embeddings if a rare string is used in a underspecified context. To address this drawback, we propose a method in which we dynamically aggregate contextualized embeddings of each unique string that we encounter. We then use a pooling operation to distill a global word representation from all contextualized instances. We evaluate these pooled contextualized embeddings on common named entity recognition (NER) tasks such as CoNLL-03 and WNUT and show that our approach significantly improves the state-of-the-art for NER. We make all code and pre-trained models available to the research community for use and reproduction.
引用
收藏
页码:724 / 728
页数:5
相关论文
共 50 条
  • [21] Multilingual Named Entity Recognition Using Pretrained Embeddings, Attention Mechanism and NCRF
    Emelyanov, Anton A.
    Artemova, Ekaterina
    7TH WORKSHOP ON BALTO-SLAVIC NATURAL LANGUAGE PROCESSING (BSNLP'2019), 2019, : 94 - 99
  • [22] Hierarchical Meta-Embeddings for Code-Switching Named Entity Recognition
    Winata, Genta Indra
    Lin, Zhaojiang
    Shin, Jamin
    Liu, Zihan
    Fung, Pascale
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3541 - 3547
  • [23] Geographic Named Entity Recognition and Disambiguation in Mexican News using word embeddings
    Molina-Villegas, Alejandro
    Muniz-Sanchez, Victor
    Arreola-Trapala, Jean
    Alcantara, Filomeno
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176
  • [24] Theoretical Linguistics Rivals Embeddings in Language Clustering for Multilingual Named Entity Recognition
    Imai, Sakura
    Kawahara, Daisuke
    Orita, Naho
    Oda, Hiromune
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-SRW 2023, VOL 4, 2023, : 139 - 151
  • [25] Improving Named Entity Recognition for Morphologically Rich Languages using Word Embeddings
    Demir, Hakan
    Ozgur, Arzucan
    2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 117 - 122
  • [26] Deep recurrent neural networks with word embeddings for Urdu named entity recognition
    Khan, Wahab
    Daud, Ali
    Alotaibi, Fahd
    Aljohani, Naif
    Arafat, Sachi
    ETRI JOURNAL, 2020, 42 (01) : 90 - 100
  • [27] Unsupervised cross-lingual model transfer for named entity recognition with contextualized word representations
    Yan, Huijiong
    Qian, Tao
    Xie, Liang
    Chen, Shanguang
    PLOS ONE, 2021, 16 (09):
  • [28] A Multichannel Biomedical Named Entity Recognition Model Based on Multitask Learning and Contextualized Word Representations
    Wei, Hao
    Gao, Mingyuan
    Zhou, Ai
    Chen, Fei
    Qu, Wen
    Zhang, Yijia
    Lu, Mingyu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [29] Towards Named Entity Disambiguation with Graph Embeddings
    Colliani, Felice Paolo
    Futia, Giuseppe
    Garifo, Giovanni
    Vetro, Antonio
    De Martin, Juan Carlos
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES, AICT 2024, 2024,
  • [30] Word Embeddings for Unsupervised Named Entity Linking
    Nozza, Debora
    Sas, Cezar
    Fersini, Elisabetta
    Messina, Enza
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 115 - 132