Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction

被引:0
|
作者
Zhang, Ziqi [1 ]
Iria, Jose [2 ]
Ciravegna, Fabio [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, S Yorkshire, England
[2] IBM Res, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Domain specific entity recognition often relies on domain-specific knowledge to improve system performance. However, such knowledge often suffers from limited domain portability and is expensive to build and maintain. Therefore, obtaining it in a generic and unsupervised manner would be a desirable feature for domain-specific entity recognition systems. In this paper, we introduce an approach that exploits domain-specificity of words as a form of domain-knowledge for entity-recognition tasks. Compared to prior work in the field, our approach is generic and completely unsupervised. We empirically show an improvement in entity extraction accuracy when features derived by our unsupervised method are used, with respect to baseline methods that do not employ domain knowledge. We also compared the results against those of existing systems that use manually crafted domain knowledge, and found them to be competitive.
引用
收藏
页码:2606 / 2613
页数:8
相关论文
共 50 条
  • [1] TLATR: Automatic Topic Labeling Using Automatic (Domain-Specific) Term Recognition
    Truica, Ciprian-Octavian
    Apostol, Elena-Simona
    [J]. IEEE ACCESS, 2021, 9 : 76624 - 76641
  • [2] TLATR: Automatic Topic Labeling Using Automatic (Domain-Specific) Term Recognition
    Truica, Ciprian-Octavian
    Apostol, Elena-Simona
    [J]. IEEE Access, 2021, 9 : 76624 - 76641
  • [3] An Empirical Cross Domain-Specific Entity Recognition with Domain Vector
    Chen, Wei
    Han, Songqiao
    Huang, Hailiang
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3868 - 3872
  • [4] Methods for automatic term recognition in domain-specific text collections: A survey
    Astrakhantsev, N. A.
    Fedorenko, D. G.
    Turdakov, D. Yu.
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2015, 41 (06) : 336 - 349
  • [5] Methods for automatic term recognition in domain-specific text collections: A survey
    N. A. Astrakhantsev
    D. G. Fedorenko
    D. Yu. Turdakov
    [J]. Programming and Computer Software, 2015, 41 : 336 - 349
  • [6] A Novel Domain-Specific Feature Extraction Scheme For Arabic Handwritten Digits Recognition
    Abdelazeem, Sherif
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 247 - 252
  • [7] Corpus and Baseline Model for Domain-Specific Entity Recognition in German
    Torge, Sunna
    Hahn, Waldemar
    Jaekel, Rene
    Nagel, Wolfgang E.
    [J]. 2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 314 - 320
  • [8] Chinese Named Entity Recognition Method for Domain-Specific Text
    Liu, He
    Ma, Yuekun
    Gao, Chang
    Jia, Qi
    Zhang, Dezheng
    [J]. TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2023, 30 (06): : 1799 - 1808
  • [9] Named Entity Recognition and Normalization: A Domain-Specific Language Approach
    Vazquez, Miguel
    Chagoyen, Monica
    Pascual-Montano, Alberto
    [J]. 2ND INTERNATIONAL WORKSHOP ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (IWPACBB 2008), 2009, 49 : 147 - 155
  • [10] Transfer Learning for Domain-Specific Named Entity Recognition in German
    Torge, Sunna
    Hahn, Waldemar
    Jaekel, Rene
    [J]. 2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 321 - 327