Domain-specific Named Entity Recognition with Document-Level Optimization

被引:7
|
作者
Wang, Limin [1 ,2 ]
Li, Shoushan [1 ,2 ]
Yan, Qian [1 ,2 ]
Zhou, Guodong [1 ,2 ]
机构
[1] Soochow Univ, Nat Language Proc Lab, Suzhou, Peoples R China
[2] Soochow Univ, Sch Comp Sci & Technol, 1 Shizi St, Suzhou 215006, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
Named entity recognition; Integer linear programming; Chinese language processing;
D O I
10.1145/3213544
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous studies normally formulate named entity recognition (NER) as a sequence labeling task and optimize the solution in the sentence level. In this article, we propose a document-level optimization approach to NER and apply it in a domain-specific document-level NER task. As a baseline, we apply a state-of-the-art approach, i.e., long-short-term memory (LSTM), to perform word classification. On this basis, we define a global objective function with the obtained word classification results and achieve global optimization via Integer Linear Programming (ILP). Specifically, in the ILP-based approach, we propose four kinds of constraints, i.e., label transition, entity length, label consistency, and domain-specific regulation constraints, to incorporate various entity recognition knowledge in the document level. Empirical studies demonstrate the effectiveness of the proposed approach to domain-specific document-level NER.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Domain-Specific Chinese Word Segmentation with Document-Level Optimization
    Yan, Qian
    Shen, Chenlin
    Li, Shoushan
    Xia, Fen
    Du, Zekai
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 353 - 365
  • [2] Document-Level Named Entity Recognition with Q-Network
    Lu, Tingming
    Gui, Yaocheng
    Gao, Zhiqiang
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 164 - 178
  • [3] Span Graph Transformer for Document-Level Named Entity Recognition
    Mao, Hongli
    Mao, Xian-Ling
    Tang, Hanlin
    Shang, Yu-Ming
    Huang, Heyan
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18769 - 18777
  • [4] Leveraging Document-Level Label Consistency for Named Entity Recognition
    Gui, Tao
    Ye, Jiacheng
    Zhang, Qi
    Zhou, Yaqian
    Gong, Yeyun
    Huang, Xuanjing
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3976 - 3982
  • [5] Exploiting global contextual information for document-level named entity recognition
    Yu, Yiting
    Wang, Zanbo
    Wei, Wei
    Zhang, Ruihan
    Mao, Xian-Ling
    Feng, Shanshan
    Wang, Fei
    He, Zhiyong
    Jiang, Sheng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [6] Consistency enhancement of model prediction on document-level named entity recognition
    Jeong, Minbyul
    Kang, Jaewoo
    [J]. BIOINFORMATICS, 2023, 39 (06)
  • [7] Document-Level Named Entity Recognition by Incorporating Global and Neighbor Features
    Hu, Anwen
    Dou, Zhicheng
    Wen, Ji-rong
    [J]. INFORMATION RETRIEVAL (CCIR 2019), 2019, 11772 : 79 - 91
  • [8] Leveraging Multi-Token Entities in Document-Level Named Entity Recognition
    Hu, Anwen
    Dou, Zhicheng
    Nie, Jian-Yun
    Wen, Ji-Rong
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7961 - 7968
  • [9] Chinese Named Entity Recognition Method for Domain-Specific Text
    Liu, He
    Ma, Yuekun
    Gao, Chang
    Jia, Qi
    Zhang, Dezheng
    [J]. TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2023, 30 (06): : 1799 - 1808
  • [10] DocBAN: An Efficient Biaffine Attention Network for Document-Level Named Entity Recognition
    Wu, Hao
    Li, Xianxian
    Yang, Danping
    Zhou, Aoxiang
    Wang, Peng
    Liu, Peng
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 65 - 76