Domain-specific Named Entity Recognition with Document-Level Optimization

被引:7
|
作者
Wang, Limin [1 ,2 ]
Li, Shoushan [1 ,2 ]
Yan, Qian [1 ,2 ]
Zhou, Guodong [1 ,2 ]
机构
[1] Soochow Univ, Nat Language Proc Lab, Suzhou, Peoples R China
[2] Soochow Univ, Sch Comp Sci & Technol, 1 Shizi St, Suzhou 215006, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
Named entity recognition; Integer linear programming; Chinese language processing;
D O I
10.1145/3213544
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous studies normally formulate named entity recognition (NER) as a sequence labeling task and optimize the solution in the sentence level. In this article, we propose a document-level optimization approach to NER and apply it in a domain-specific document-level NER task. As a baseline, we apply a state-of-the-art approach, i.e., long-short-term memory (LSTM), to perform word classification. On this basis, we define a global objective function with the obtained word classification results and achieve global optimization via Integer Linear Programming (ILP). Specifically, in the ILP-based approach, we propose four kinds of constraints, i.e., label transition, entity length, label consistency, and domain-specific regulation constraints, to incorporate various entity recognition knowledge in the document level. Empirical studies demonstrate the effectiveness of the proposed approach to domain-specific document-level NER.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Document-level attention-based BiLSTM-CRF incorporating disease dictionary for disease named entity recognition
    Xu, Kai
    Yang, Zhenguo
    Kang, Peipei
    Wang, Qi
    Liu, Wenyin
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 108 : 122 - 132
  • [22] Explicitly Capturing Relations between Entity Mentions via Graph Neural Networks for Domain-specific Named Entity Recognition
    Chen, Pei
    Ding, Haibo
    Araki, Jun
    Huang, Ruihong
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 735 - 742
  • [23] An Empirical Cross Domain-Specific Entity Recognition with Domain Vector
    Chen, Wei
    Han, Songqiao
    Huang, Hailiang
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3868 - 3872
  • [24] Quantifying the advantage of domain-specific pre-training on named entity recognition tasks in materials science
    Trewartha, Amalie
    Walker, Nicholas
    Huo, Haoyan
    Lee, Sanghoon
    Cruse, Kevin
    Dagdelen, John
    Dunn, Alexander
    Persson, Kristin A.
    Ceder, Gerbrand
    Jain, Anubhav
    [J]. PATTERNS, 2022, 3 (04):
  • [25] Corpus and Baseline Model for Domain-Specific Entity Recognition in German
    Torge, Sunna
    Hahn, Waldemar
    Jaekel, Rene
    Nagel, Wolfgang E.
    [J]. 2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 314 - 320
  • [26] Named entity recognition for Psychological domain: challenges in document annotation for the Arabic Language
    Lakel, Kheira
    Bendella, Fatima
    Benkhadda, Samira
    [J]. PROCEEDINGS OF 2017 FIRST INTERNATIONAL CONFERENCE ON EMBEDDED & DISTRIBUTED SYSTEMS (EDIS 2017), 2017, : 39 - 43
  • [27] Detection of Domain -Specific Hypernyms Using Named Entity Recognition and Extraction
    Prata, D. L.
    Mocan, C. M.
    Nandra, C. I.
    Chifu, E. S.
    [J]. 2024 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS, AQTR, 2024, : 335 - 339
  • [28] Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction
    Zhang, Ziqi
    Iria, Jose
    Ciravegna, Fabio
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2606 - 2613
  • [29] Entity and Evidence Guided Document-Level Relation Extraction
    Huang, Kevin
    Qi, Peng
    Wang, Guangtao
    Ma, Tengyu
    Huang, Jing
    [J]. REPL4NLP 2021: PROCEEDINGS OF THE 6TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP, 2021, : 307 - 315
  • [30] Document-level Relation Extraction With Entity and Context Information
    Huang, He-Yan
    Yuan, Chang-Sen
    Feng, Chong
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (10): : 1953 - 1962