Evaluation of Neural Network Transformer Models for Named-Entity Recognition on Low-Resourced Languages

被引:4
|
作者
Hanslo, Ridewaan [1 ]
机构
[1] Univ Pretoria, Gauteng, South Africa
关键词
D O I
10.15439/2021F7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Network (NN) models produce state-of-the-art results for natural language processing tasks. Further, NN models are used for sequence tagging tasks on low-resourced languages with good results. However, the findings are not consistent for all low-resourced languages, and many of these languages have not been sufficiently evaluated. Therefore, in this paper, transformer NN models are used to evaluate named-entity recognition for ten low-resourced South African languages. Further, these transformer models are compared to other NN models and a Conditional Random Fields (CRF) Machine Learning (ML) model. The findings show that the transformer models have the highest F-scores with more than a 5% performance difference from the other models. However, the CRF ML model has the highest average F-score. The transformer model's greater parallelization allows low-resourced languages to be trained and tested with less effort and resource costs. This makes transformer models viable for low-resourced languages. Future research could improve upon these findings by implementing a linear-complexity recurrent transformer variant.
引用
收藏
页码:115 / 119
页数:5
相关论文
共 50 条
  • [1] Deep Learning Transformer Architecture for Named-Entity Recognition on Low-Resourced Languages: State of the art results
    Hanslo, Ridewaan
    [J]. PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 53 - 60
  • [2] Transfer Learning for Named-Entity Recognition with Neural Networks
    Lee, Ji Young
    Dernoncourt, Franck
    Szolovits, Peter
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4470 - 4473
  • [3] Multilingual Neural Semantic Parsing for Low-Resourced Languages
    Xia, Menglin
    Monti, Emilio
    [J]. 10TH CONFERENCE ON LEXICAL AND COMPUTATIONAL SEMANTICS (SEM 2021), 2021, : 185 - 194
  • [4] Neural Machine Translation for Low-Resourced Indian Languages
    Choudhary, Himanshu
    Rao, Shivansh
    Rohilla, Rajesh
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3610 - 3615
  • [5] Transformer-based Machine Translation for Low-resourced Languages embedded with Language Identification
    Sefara, Tshephisho J.
    Zwane, Skhumbuzo G.
    Gama, Nelisiwe
    Sibisi, Hlawulani
    Senoamadi, Phillemon N.
    Marivate, Vukosi
    [J]. 2021 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2021, : 127 - 132
  • [6] Comparison of Named Entity Recognition models based on Neural Network in Biomedical
    Kishwar, Azka
    Batool, Komal
    [J]. PROCEEDINGS OF 2021 INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGIES (IBCAST), 2021, : 426 - 431
  • [7] Comparison of Text Mining Models for Food and Dietary Constituent Named-Entity Recognition
    Perera, Nadeesha
    Thi Thuy Linh Nguyen
    Dehmer, Matthias
    Emmert-Streib, Frank
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2022, 4 (01): : 254 - 275
  • [8] A Benchmark Evaluation of Multilingual Large Language Models for Arabic Cross-Lingual Named-Entity Recognition
    Al-Duwais, Mashael
    Al-Khalifa, Hend
    Al-Salman, Abdulmalik
    [J]. ELECTRONICS, 2024, 13 (17)
  • [9] Zero-shot evaluation of ChatGPT for food named-entity recognition and linking
    Ogrinc, Matevz
    Korousic Seljak, Barbara
    Eftimov, Tome
    [J]. FRONTIERS IN NUTRITION, 2024, 11
  • [10] HiTRANS: A Hierarchical Transformer Network for Nested Named Entity Recognition
    Yang, Zhiwei
    Ma, Jing
    Chen, Hechang
    Zhang, Yunke
    Chang, Yi
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 124 - 132