Multilingual Transformers for Named Entity Recognition

被引:1
|
作者
Viksna, Rinalds [1 ]
Skadin, Inguna [2 ]
机构
[1] Tilde, Vienibas Gatve 75a, Riga, Latvia
[2] Univ Latvia, Raina Bulv 29, Riga, Latvia
来源
BALTIC JOURNAL OF MODERN COMPUTING | 2022年 / 10卷 / 03期
关键词
named entity recognition; pre-trained language models; multilingual models; FLAIR;
D O I
10.22364/bjmc.2022.10.3.18
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Different methods for automatic named entity recognition (NER) have been researched for many years. Today, the most common technique for training named entity recognition models is a fine-tuning of large pre-trained language models. In this paper, we investigate the performance of various multilingual NER models in the state-of-the-art natural language processing frame-work Flair and compare them against the multilingual NER solution of the MAPA anonymization toolkit and BERT multilingual model, fine-tuned for NER. We demonstrate that in multilingual settings the best results could be achieved with fine-tuned XLM-R model, while in the case of Latvian (monolingual settings), the more targeted LitLat BERT model leads to the best results.
引用
收藏
页码:457 / 469
页数:13
相关论文
共 50 条
  • [1] Tuning Multilingual Transformers for Named Entity Recognition on Slavic Languages
    Arkhipov, Mikhail
    Trofimova, Maria
    Kuratov, Yuri
    Sorokin, Alexey
    [J]. 7TH WORKSHOP ON BALTO-SLAVIC NATURAL LANGUAGE PROCESSING (BSNLP'2019), 2019, : 89 - 93
  • [2] Language Clustering for Multilingual Named Entity Recognition
    Shaffer, Kyle
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 40 - 45
  • [3] Adaptive, multilingual named entity recognition in Web pages
    Petasis, G
    Karkaletsis, V
    Grover, C
    Hachey, B
    Pazienza, MT
    Vindigni, M
    Coch, J
    [J]. ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 1073 - 1074
  • [4] Using WordNet Predicates for Multilingual Named Entity Recognition
    Negri, Matteo
    Magnini, Bernardo
    [J]. GWC 2004: SECOND INTERNATIONAL WORDNET CONFERENCE, PROCEEDINGS, 2003, : 169 - 174
  • [5] Learning multilingual named entity recognition from Wikipedia
    Nothman, Joel
    Ringland, Nicky
    Radford, Will
    Murphy, Tara
    Curran, James R.
    [J]. ARTIFICIAL INTELLIGENCE, 2013, 194 : 151 - 175
  • [6] Multilingual Fine-Grained Named Entity Recognition
    Lupancu, Viorica-Camelia
    Iftene, Adrian
    [J]. COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2023, 31 (03) : 321 - 339
  • [7] On the Strength of Character Language Models for Multilingual Named Entity Recognition
    Yu, Xiaodong
    Mayhew, Stephen
    Sammons, Mark
    Roth, Dan
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3073 - 3077
  • [8] Semantics Fusion of Hierarchical Transformers for Multimodal Named Entity Recognition
    Tong, Zhao
    Liu, Qiang
    Shi, Haichao
    Xia, Yuwei
    Wu, Shu
    Zhang, Xiao-Yu
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 414 - 426
  • [9] Harnessing the power of hugging face's multilingual transformers: unravelling the code-mixed named entity recognition enigma
    Shamim, Rejuwan
    Shaikh, Asadullah
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2024, 12 (03)
  • [10] A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers
    Hamdi, Ahmed
    Pontes, Elvys Linhares
    Boros, Emanuela
    Thi Tuyet Hai Nguyen
    Hackl, Guenter
    Moreno, Jose G.
    Doucet, Antoine
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2328 - 2334