Combining terminology resources and statistical methods for entity recognition: an evaluation

被引:0
|
作者
Roberts, Angus [1 ]
Gaizauskas, Robert [1 ]
Hepple, Mark [1 ]
Guo, Yikun [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Terminologies and other knowledge resources are widely used to aid entity recognition in specialist domain texts. As well as providing lexicons of specialist terms, linkage from the text back to a resource can make additional knowledge available to applications. Use of such resources is especially pertinent in the biomedical domain, where large numbers of these resources are available, and where they are widely used in informatics applications. Terminology resources can be most readily used by simple lexical lookup of terms in the text. A major drawback with such lexical lookup, however, is poor precision caused by ambiguity between domain terms and general language words. We combine lexical lookup with simple filtering of ambiguous terms, to improve precision. We compare this lexical lookup with a statistical method of entity recognition, and to a method which combines the two approaches. We show that the combined method boosts precision with little loss of recall, and that linkage from recognised entities back to the domain knowledge resources can be maintained.
引用
收藏
页码:2974 / 2980
页数:7
相关论文
共 50 条
  • [31] Chinese named entity recognition with a hybrid-statistical model
    Zhang, XY
    Wang, T
    Tang, JT
    Zhou, HP
    Chen, HW
    WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 900 - 912
  • [34] Combining linguistic with statistical methods in modeling prosody
    Price, P
    Ostendorf, M
    SIGNAL TO SYNTAX: BOOTSTRAPPING FROM SPEECH TO GRAMMAR IN EARLY ACQUISITION, 1996, : 67 - 83
  • [35] An analysis of statistical terminology applied in emergency medicine literature methods
    Shref, Jacob
    Thomas, Alyssa
    Huecker, Martin
    AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2022, 58 : 251 - 254
  • [36] Combining data-driven systems for improving named entity recognition
    Kozareva, Z
    Ferrández, O
    Montoyo, A
    Muñoz, R
    Suárez, A
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3513 : 80 - 90
  • [37] Evaluation of Named Entity Recognition in Spanish with OpenCalais
    Toribio, Raquel
    Martinez, Paloma
    de Pablo-Sanchez, Cesar
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 287 - 290
  • [38] Evaluation of Named Entity Recognition in Handwritten Documents
    Villanova-Aparisi, David
    Martinez-Hinarejos, Carlos-D
    Romero, Veronica
    Pastor-Gadea, Moises
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 568 - 582
  • [39] EVALUATION OF STATISTICAL RECOGNITION FUNCTION
    NAGASAWA, K
    NOGUCHI, S
    OIZUMI, J
    ELECTRONICS & COMMUNICATIONS IN JAPAN, 1967, 50 (09): : 35 - &
  • [40] Combining data-driven systems for improving named entity recognition
    Kozareva, Z.
    Ferrandez, O.
    Montoyo, A.
    Munoz, R.
    Suarez, A.
    Gomez, J.
    DATA & KNOWLEDGE ENGINEERING, 2007, 61 (03) : 449 - 466