Multi-domain evaluation framework for named entity recognition tools

被引:9
|
作者
Abdallah, Zahraa S. [1 ]
Carman, Mark [1 ]
Haffari, Gholamreza [1 ]
机构
[1] Monash Univ, Sch Informat Technol, Clayton, Vic, Australia
来源
关键词
Named entity recognition; Multi-domain evaluation; Qualitative data analysis; Benchmark evaluation;
D O I
10.1016/j.csl.2016.10.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting structured information from unstructured text is important for the qualitative data analysis. Leveraging NLP techniques for qualitative data analysis will effectively accelerate the annotation process, allow for large-scale analysis and provide more insights into the text to improve the performance. The first step for gaining insights from the text is Named Entity Recognition (NER). A significant challenge that directly impacts the performance of the NER process is the domain diversity in qualitative data. The represented text varies according to its domain in many aspects including taxonomies, length, formality and format. In this paper we discuss and analyse the performance of state-of-the-art tools across domains to elaborate their robustness and reliability. In order to do that, we developed a standard, expandable and flexible framework to analyse and test tools performance using corpora representing text across various domains. We performed extensive analysis and comparison of tools across various domains and from various perspectives. The resulting comparison and analysis are of significant importance for providing a holistic illustration of the state-of-the-art tools. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:34 / 55
页数:22
相关论文
共 50 条
  • [21] Named Entity Recognition System for the Biomedical Domain
    Sharma, Raghav
    Chauhan, Deependra
    Sharma, Raksha
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 837 - 840
  • [22] Medical Named Entity Recognition with Domain Knowledge
    Pei W.
    Sun S.
    Li X.
    Lu J.
    Yang L.
    Wu Y.
    Data Analysis and Knowledge Discovery, 2023, 7 (03) : 142 - 154
  • [23] Named Entity Recognition in a Very Homogeneous Domain
    Agarwal, Oshin
    Nenkova, Ani
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1850 - 1855
  • [24] Named Entity Recognition Datasets: A Classification Framework
    Zhang, Ying
    Xiao, Gang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [25] Rembrandt - a named-entity recognition framework
    Cardoso, Nuno
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1240 - 1243
  • [26] Named Entity Recognition Datasets: A Classification Framework
    Ying Zhang
    Gang Xiao
    International Journal of Computational Intelligence Systems, 17
  • [27] A multi-task framework based on decomposition for multimodal named entity recognition
    Cai, Chenran
    Wang, Qianlong
    Qin, Bing
    Xu, Ruifeng
    NEUROCOMPUTING, 2024, 604
  • [28] A Biomedical Named Entity Recognition Framework with Multi-granularity Prompt Tuning
    Liu, Zhuoya
    Chi, Tang
    Zhang, Peiliang
    Wu, Xiaoting
    Che, Chao
    HEALTH INFORMATION PROCESSING, CHIP 2022, 2023, 1772 : 95 - 105
  • [29] Comparing Open Arabic Named Entity Recognition Tools
    Aldumaykhi, Abdullah
    Otai, Saad
    Alsudais, Abdulkareem
    2023 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI, 2023, : 46 - 51
  • [30] Smart Search Tools using Named Entity Recognition
    Alshuwaier, Faisal A.
    Almutairi, Waleed A.
    Areshey, Ali M.
    2013 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA), 2013, : 304 - 311