Multiobjective Approach for Feature Selection in Maximum Entropy based Named Entity Recognition

被引:3
|
作者
Ekbal, Asif [1 ]
Saha, Sriparna [1 ]
Hasanuzzaman, Md [2 ]
机构
[1] Univ Trento, Trento, Italy
[2] West Bengal Ind Dev Corp, Kolkata, India
关键词
D O I
10.1109/ICTAI.2010.54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the problem of appropriate feature selection for constructing a Maximum Entropy (ME) based Named Entity Recognition (NER) system under the multiobjective optimization (MOO) framework. Two conflicting objective functions are simultaneously optimized using the search capability of MOO. These objectives are (i). the dimensionality of features, which is tried to be minimized, and (ii). the corresponding F-measure value of the classifier, trained using the features present, is maximized. The features are encoded in the chromosomes. Thereafter, a multiobjective evolutionary algorithm in the steps of a popular MOO technique, NSGA-II, is developed to determine the appropriate feature subset. The proposed technique is evaluated to determine the suitable feature combinations for NER in a resource-constrained language, namely Bengali. Evaluation results yield the recall, precision and F-measure values of 72.45%, 82.39% and 77.11%, respectively.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Feature selection techniques for maximum entropy based biomedical named entity recognition
    Saha, Sujan Kumar
    Sarkar, Sudeshna
    Mitra, Pabitra
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) : 905 - 911
  • [2] Hungarian named entity recognition with a maximum entropy approach
    Varga, Daniel
    Simon, Eszter
    [J]. ACTA CYBERNETICA, 2007, 18 (02): : 293 - 301
  • [3] Improving feature extraction in named entity recognition based on maximum entropy model
    Jiang, Wei
    Guan, Yi
    Wang, Xiao-Long
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2630 - +
  • [4] A probabilistic feature based Maximum Entropy model for Chinese named entity recognition
    Zhang, Suxiang
    Wang, Xiaojie
    Wen, Juan
    Qin, Ying
    Zhong, Yixin
    [J]. COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 189 - +
  • [5] Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition
    Asif Ekbal
    Sriparna Saha
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 143 - 166
  • [6] Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition
    Ekbal, Asif
    Saha, Sriparna
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (02) : 143 - 166
  • [7] Hybrid Feature Selection Approach for Arabic Named Entity Recognition
    Shahine, Miran
    Sakre, Mohamed
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 452 - 464
  • [8] A Semi-supervised Approach for Maximum Entropy Based Hindi Named Entity Recognition
    Saha, Sujan Kumar
    Mitra, Pabitra
    Sarkar, Sudeshna
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 225 - 230
  • [9] Multiobjective Optimization Approach for Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    Garbe, Christoph S.
    [J]. PRICAI 2010: TRENDS IN ARTIFICIAL INTELLIGENCE, 2010, 6230 : 52 - +
  • [10] Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition
    Asif Ekbal
    Sriparna Saha
    [J]. International Journal of Machine Learning and Cybernetics, 2016, 7 : 597 - 611