Soft-constrained inference for Named Entity Recognition

被引:18
|
作者
Fersini, E. [1 ]
Messina, E. [1 ]
Felici, G. [2 ]
Roth, D. [3 ]
机构
[1] Univ Milano Bicocca, DISCo, I-20126 Milan, Italy
[2] CNR, Inst Syst Anal & Comp Sci, I-00185 Rome, Italy
[3] Univ Illinois, Dept Comp Sci, Champaign, IL 61822 USA
关键词
Conditional Random Fields; Named Entity Recognition; Rule extraction; Integer linear programming;
D O I
10.1016/j.ipm.2014.04.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Much of the valuable information in supporting decision making processes originates in text-based documents. Although these documents can be effectively searched and ranked by modern search engines, actionable knowledge need to be extracted and transformed in a structured form before being used in a decision process. In this paper we describe how the discovery of semantic information embedded in natural language documents can be viewed as an optimization problem aimed at assigning a sequence of labels (hidden states) to a set of interdependent variables (textual tokens). Dependencies among variables are efficiently modeled through Conditional Random Fields, an indirected graphical model able to represent the distribution of labels given a set of observations. The Markov property of these models prevent them to take into account long-range dependencies among variables, which are indeed relevant in Natural Language Processing. In order to overcome this limitation we propose an inference method based on Integer Programming formulation of the problem, where long distance dependencies are included through non-deterministic soft constraints. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:807 / 819
页数:13
相关论文
共 50 条
  • [1] Named Entity Recognition: Resource Constrained Maximum Path
    Pugliese, Luigi Di Puglia
    Fersini, Elisabetta
    Guerriero, Francesca
    Messina, Enza
    [J]. 12TH INTERNATIONAL CONFERENCE APPLIED MATHEMATICAL PROGRAMMING AND MODELLING-APMOD 2016, 2017, 14
  • [2] Named Entity Recognition Networks Based on Syntactically Constrained Attention
    Sun, Weiwei
    Liu, Shengquan
    Liu, Yan
    Kong, Lingqi
    Jian, Zhaorui
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [3] Constrained Decoding for Computationally Efficient Named Entity Recognition Taggers
    Lester, Brian
    Pressel, Daniel
    Hemmeter, Amy
    Choudhury, Sagnik Ray
    Bangalore, Srinivas
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1841 - 1848
  • [4] Named Entity Recognition in the Medical Domain with Constrained CRF Models
    Jochim, Charles
    Deleris, Lea A.
    [J]. 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 839 - 849
  • [5] Hard-constrained versus soft-constrained parameter estimation
    Benavoli, A.
    Chisci, L.
    Farina, A.
    Ortenzi, L.
    Zappa, G.
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2006, 42 (04) : 1224 - 1239
  • [6] Constrained Labeled Data Generation for Low-Resource Named Entity Recognition
    Guo, Ruohao
    Roth, Dan
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4519 - 4533
  • [7] Named Entity Recognition for Vietnamese
    Dat Ba Nguyen
    Son Huu Hoang
    Son Bao Pham
    Thai Phuong Nguyen
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, PROCEEDINGS, 2010, 5991 : 205 - 214
  • [8] Persian Named Entity Recognition
    Dashtipour, Kia
    Gogate, Mandar
    Adeel, Ahsan
    Algarafi, Abdulrahman
    Howard, Newton
    Hussain, Amir
    [J]. 2017 IEEE 16TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2017, : 79 - 83
  • [9] Named Entity Recognition for Tweets
    Liu, Xiaohua
    Wei, Furu
    Zhang, Shaodian
    Zhou, Ming
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (01)
  • [10] An Overview of Named Entity Recognition
    Sun, Peng
    Yang, Xuezhen
    Zhao, Xiaobing
    Wang, Zhijuan
    [J]. 2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 273 - 278