Combining terminology resources and statistical methods for entity recognition: an evaluation

被引:0
|
作者
Roberts, Angus [1 ]
Gaizauskas, Robert [1 ]
Hepple, Mark [1 ]
Guo, Yikun [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Terminologies and other knowledge resources are widely used to aid entity recognition in specialist domain texts. As well as providing lexicons of specialist terms, linkage from the text back to a resource can make additional knowledge available to applications. Use of such resources is especially pertinent in the biomedical domain, where large numbers of these resources are available, and where they are widely used in informatics applications. Terminology resources can be most readily used by simple lexical lookup of terms in the text. A major drawback with such lexical lookup, however, is poor precision caused by ambiguity between domain terms and general language words. We combine lexical lookup with simple filtering of ambiguous terms, to improve precision. We compare this lexical lookup with a statistical method of entity recognition, and to a method which combines the two approaches. We show that the combined method boosts precision with little loss of recall, and that linkage from recognised entities back to the domain knowledge resources can be maintained.
引用
收藏
页码:2974 / 2980
页数:7
相关论文
共 50 条
  • [21] STATISTICAL EVALUATION OF URANIUM RESOURCES
    MEEHAN, RJ
    GRUNDY, WD
    TRANSACTIONS OF THE AMERICAN NUCLEAR SOCIETY, 1968, 11 (01): : 121 - &
  • [22] Symbol recognition combining vectorial and statistical features
    LITIS, Université de Rouen, F-76800 Saint-Etienne du Rouvray, France
    1600, 76-87 (2006):
  • [23] Symbol recognition combining vectorial and statistical features
    Locteau, Herve
    Adam, Sebastien
    Trupin, Eric
    Labiche, Jacques
    Heroux, Pierre
    GRAPHICS RECOGNITION: TEN YEARS REVIEW AND FUTURE PERSPECTIVES, 2006, 3926 : 76 - 87
  • [24] Named Entity Recognition Architecture Combining Contextual and Global Features
    Tran Thi Hong Hanh
    Doucet, Antoine
    Sidere, Nicolas
    Moreno, Jose G.
    Pollak, Senja
    TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 264 - 276
  • [25] Statistical methods for speech recognition
    Neufeld, E
    COMPUTATIONAL LINGUISTICS, 1999, 25 (02) : 297 - 298
  • [26] AIP: A Named Entity Recognition Method Combining Glyphs and Sounds
    Liu, Bo
    Su, Zhuo
    Qu, Guangzhi
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (06)
  • [27] Efficient methods for biomedical named entity recognition
    Chan, Shing-Kit
    Lam, Wai
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 729 - 735
  • [28] Statistical Methods for Estimating Petroleum Resources
    Lark, R. M.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2011, 174 : 513 - 513
  • [29] Using Empirically Constructed Lexical Resources for Named Entity Recognition
    Jonnalagadda, Siddhartha
    Cohen, Trevor
    Wu, Stephen
    Liu, Hongfang
    Gonzalez, Graciela
    BIOMEDICAL INFORMATICS INSIGHTS, 2013, 6 : 17 - 27
  • [30] Anatomical Entity Recognition with a Hierarchical Framework Augmented by External Resources
    Xu, Yan
    Hua, Ji
    Ni, Zhaoheng
    Chen, Qinlang
    Fan, Yubo
    Ananiadou, Sophia
    Chang, Eric I-Chao
    Tsujii, Junichi
    PLOS ONE, 2014, 9 (10):