Bengali Named Entity Recognition using Classifier Combination

被引:9
|
作者
Ekbal, Asif [1 ]
Bandyopadhyay, Sivaji [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
关键词
D O I
10.1109/ICAPR.2009.86
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports about the development of a Named Entity Recognition (NER) system for Bengali by combining the outputs of the classifiers like Maximum Entropy (ME), Conditional Random Field (CRF) and Support Vector Machine (SVM) using a majority voting approach. The training set consists of approximately 150K wordforms and has been manually annotated with the four major NE tags such as Person name, Location name, Organization name and Miscellaneous name tags. Lexical context patterns, generated from an unlabeled corpus of 3 million wordforms, have been used in order to improve the performance of the classifiers. Evaluation results of the voted system for the gold standard test set of 30K wordforms have demonstrated the overall recall, precision, and f-Score values of 87.11%, 83.61%, and 85.32%, respectively, which shows an improvement of 4.66% in f-Score over the best performing SVM based system and an improvement of 9.5% in f-score over the least performing ME based system.
引用
收藏
页码:259 / 262
页数:4
相关论文
共 50 条
  • [1] Named entity recognition in Bengali using system combination
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    LINGUISTICAE INVESTIGATIONES, 2014, 37 (01): : 1 - 22
  • [2] Named Entity Recognition and transliteration in Bengali
    Ekbal, Asif
    Naskar, Sudip Kumar
    Bandyopadhyay, Sivaji
    LINGUISTICAE INVESTIGATIONES, 2007, 30 (01): : 95 - 114
  • [3] Named entity recognition in Bengali and Hindi using support vector machine
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    LINGUISTICAE INVESTIGATIONES, 2011, 34 (01): : 35 - 67
  • [4] Three different models for named entity recognition in Bengali
    Ekbal, Asif
    PROGRESS IN PATTERN RECOGNITION, 2007, : 161 - 170
  • [5] Improving Biochemical Named Entity Recognition Using PSO Classifier Selection and Bayesian Combination Methods
    Akkasi, Abbas
    Varoglu, Ekrem
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (06) : 1327 - 1338
  • [6] Boosting drug named entity recognition using an aggregate classifier
    Korkontzelos, Ioannis
    Piliouras, Dimitrios
    Dowsey, Andrew W.
    Ananiadou, Sophia
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2015, 65 (02) : 145 - 153
  • [7] Classifier Ensemble using Multiobjective Optimization for Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 783 - 788
  • [8] Named Entity Recognition in Bengali and Hindi Using MuRIL and Conditional Random Fields
    Kaushik Bose
    Kamal Sarkar
    SN Computer Science, 5 (7)
  • [9] Hindi named entity recognition using system combination
    Sarkar, Kamal
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2018, 5 (01) : 11 - 39
  • [10] Bengali Named Entity Recognition: A survey with deep learning benchmark
    Rifat, Md Jamiur Rahman
    Abujar, Sheikh
    Noori, Sheak Rashed Haider
    Hossain, Syed Akhter
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,