Development and evaluation of a NER model in the domain of cultural analysis and tourism

被引:0
|
作者
Docio, Susana Sotelo [1 ]
Gamallo, Pablo [1 ]
Iriarte, Alvaro [2 ]
机构
[1] Univ Santiago de Compostela, Santiago, Spain
[2] Univ Minho, Braga, Portugal
来源
LINGUAMATICA | 2023年 / 15卷 / 02期
关键词
named-entity recognition; machine learning; neural networks; transformers; evaluation; CORPUS;
D O I
10.21814/lm.15.2.405
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Named Entity Recognition (NER) is an essential task in information extraction where entities in a text are identified and classified. One of the primary chal-lenges addressed by NER systems is the difficulty of generalizing what was learned to different types of corpora beyond the training data. This problem is magnified by the fact that most of the training cor-pora used are journalistic and therefore need to be adapted to other genres and domains. In this paper, we use a Spanish corpus consisting of interviews with visitors to the city of Santiago de Compostela and annotated with named entities, to evaluate and train NER systems tailored to the domain of cultural analy-sis and tourism. We provide a comprehensive compa-rison of various approaches employed, ranging from classical machine learning algorithms to fine-tuning Transformer models. The results significantly out-perform the baseline, represented here by the toolkits Stanza, spaCy and FLAIR, although initial tests with unseen entities during training highlight the need for additional evaluations regarding their generalization capability and the utilization of adversarial splits for the corpus.
引用
收藏
页码:3 / 18
页数:16
相关论文
共 50 条
  • [1] A User Model Ontology for Adaptive Systems in Cultural Tourism Domain
    Pandolfo, Laura
    Pulina, Luca
    Grosso, Enrico
    APPLICATIONS OF INTELLIGENT SYSTEMS, 2018, 310 : 212 - 219
  • [2] Building a comprehensive NER model for Satellite Domain
    Maurya P.
    Jafari O.
    Thatte B.
    Ingram C.
    Nagarkar P.
    SN Computer Science, 2022, 3 (3)
  • [3] Ecotourism as an international model for the sustainable development of cultural tourism
    de Esteban Curiel, Javier
    Antonovica, Arta
    TEORIA Y PRAXIS, 2010, 6 (08): : 43 - 53
  • [4] The extension model for sustainable development of tourism cultural industry
    Wei, Zhong-Jun
    Zhou, Ming-Zheng
    Liu, Tao
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2006, 38 (07): : 1164 - 1167
  • [5] Research on the development model of cultural resources and tourism integration under the background of cultural-tourism integration
    Xiao, Shijing
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 196 - 196
  • [6] An Evaluation of Ontology Based Domain Analysis for Model Driven Development
    Guo, Hong
    Gao, Shang
    Krogstie, John
    Traetteberg, Hallvard
    Wang, Alf Inge
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2015, 11 (04) : 41 - 63
  • [7] Cultural Tourism Development in the Philippines: An Analysis of Challenges and Orientations
    Alejandria-Gonzalez, Maria Carinnes P.
    JOURNAL OF QUALITY ASSURANCE IN HOSPITALITY & TOURISM, 2016, 17 (04) : 496 - 515
  • [8] ANALYSIS OF THE CULTURAL AND EDUCATIONAL TOURISM DEVELOPMENT IN THE BELGOROD REGION
    Kharkovskaya, Elena V.
    Posokhova, Natalia V.
    Efremova, Nina V.
    Miroshnichenko, Elena V.
    REVISTA ON LINE DE POLITICA E GESTAO EDUCACIONAL, 2021, 25 : 3550 - 3567
  • [9] Analysis of support capacity of Kota Lama tourism for the development of cultural tourism in Semarang
    Lukito, C. S.
    Santoso, A. B.
    Kurniawan, E.
    FIRST INTERNATIONAL CONFERENCE ON ENVIRONMENTAL GEOGRAPHY AND GEOGRAPHY EDUCATION (ICEGE), 2019, 243
  • [10] Research on Evaluation Model of Market Potentials of Cultural Tourism Resources
    Mo, Zhiming
    AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 3283 - 3285