Combining terminology resources and statistical methods for entity recognition: an evaluation

被引:0
|
作者
Roberts, Angus [1 ]
Gaizauskas, Robert [1 ]
Hepple, Mark [1 ]
Guo, Yikun [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Terminologies and other knowledge resources are widely used to aid entity recognition in specialist domain texts. As well as providing lexicons of specialist terms, linkage from the text back to a resource can make additional knowledge available to applications. Use of such resources is especially pertinent in the biomedical domain, where large numbers of these resources are available, and where they are widely used in informatics applications. Terminology resources can be most readily used by simple lexical lookup of terms in the text. A major drawback with such lexical lookup, however, is poor precision caused by ambiguity between domain terms and general language words. We combine lexical lookup with simple filtering of ambiguous terms, to improve precision. We compare this lexical lookup with a statistical method of entity recognition, and to a method which combines the two approaches. We show that the combined method boosts precision with little loss of recall, and that linkage from recognised entities back to the domain knowledge resources can be maintained.
引用
收藏
页码:2974 / 2980
页数:7
相关论文
共 50 条
  • [41] Combining self learning and active learning for Chinese Named Entity Recognition
    Yao L.
    Sun C.
    Wang X.
    Wang X.
    Journal of Software, 2010, 5 (05) : 530 - 537
  • [42] Combining Statistical and Structural Approaches for Arabic Handwriting Recognition
    Siddhu, Muhammad Kashif
    Parvez, Mohammad Tanvir
    Yaakob, Shahrul Nizam
    2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 416 - 421
  • [43] Study of Named Entity Recognition methods in biomedical field
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Chomatek, Lukasz
    10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS, 2019, 160 : 260 - 265
  • [44] Towards the Named Entity Recognition Methods in Biomedical Field
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Chomatek, Lukasz
    SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 375 - 387
  • [45] Augmenting biomedical named entity recognition with general-domain resources
    Yin, Yu
    Kim, Hyunjae
    Xiao, Xiao
    Wei, Chih Hsuan
    Kang, Jaewoo
    Lu, Zhiyong
    Xu, Hua
    Fang, Meng
    Chen, Qingyu
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 159
  • [46] Neural Cross-Lingual Named Entity Recognition with Minimal Resources
    Xie, Jiateng
    Yang, Zhilin
    Neubig, Graham
    Smith, Noah A.
    Carbonell, Jaime
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 369 - 379
  • [47] Writer Recognition by Combining Local and Global Methods
    Steinke, Karl-Heinz
    Gehrke, Martin
    Dzido, Robert
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 1245 - 1250
  • [48] Comparing and Combining Unimodal Methods for Multimodal Recognition
    Ishikawa, Sabra
    Laaksonen, Forma
    2016 14TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2016,
  • [49] Emotional speech recognition: Resources, features, and methods
    Ververidis, Dimitrios
    Kotropoulos, Constantine
    SPEECH COMMUNICATION, 2006, 48 (09) : 1162 - 1181
  • [50] Domain Named Entity Recognition Combining GAN and BiLSTM-Attention-CRF
    Zhang H.
    Guo Y.
    Li T.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (09): : 1851 - 1858