Bengali Named Entity Recognition using Classifier Combination

被引:9
|
作者
Ekbal, Asif [1 ]
Bandyopadhyay, Sivaji [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
关键词
D O I
10.1109/ICAPR.2009.86
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports about the development of a Named Entity Recognition (NER) system for Bengali by combining the outputs of the classifiers like Maximum Entropy (ME), Conditional Random Field (CRF) and Support Vector Machine (SVM) using a majority voting approach. The training set consists of approximately 150K wordforms and has been manually annotated with the four major NE tags such as Person name, Location name, Organization name and Miscellaneous name tags. Lexical context patterns, generated from an unlabeled corpus of 3 million wordforms, have been used in order to improve the performance of the classifiers. Evaluation results of the voted system for the gold standard test set of 30K wordforms have demonstrated the overall recall, precision, and f-Score values of 87.11%, 83.61%, and 85.32%, respectively, which shows an improvement of 4.66% in f-Score over the best performing SVM based system and an improvement of 9.5% in f-score over the least performing ME based system.
引用
收藏
页码:259 / 262
页数:4
相关论文
共 50 条
  • [21] Teaching a weaker classifier: Named entity recognition on upper case text
    Chieu, HL
    Ng, HT
    40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 481 - 488
  • [22] Combining multiple classifiers using vote based classifier ensemble technique for named entity recognition
    Saha, Sriparna
    Ekbal, Asif
    DATA & KNOWLEDGE ENGINEERING, 2013, 85 : 15 - 39
  • [23] Blog Text Analysis Using Topic Modeling, Named Entity Recognition and Sentiment Classifier Combine
    Waila, Pranav
    Singh, V. K.
    Singh, M. K.
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1166 - 1171
  • [24] Weighted Vote Based Classifier Ensemble Selection Using Genetic Algorithm for Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 6177 : 256 - +
  • [25] Chemical named entity recognition in the texts of scientific publications using the naive Bayes classifier approach
    Tarasova, O. A.
    Rudik, A., V
    Biziukova, N. Yu
    Filimonov, D. A.
    Poroikov, V. V.
    JOURNAL OF CHEMINFORMATICS, 2022, 14 (01)
  • [26] Entity Recognition in Bengali Language
    Das, Sujit Kumar
    Dhar, Sourish
    2015 INTERNATIONAL SYMPOSIUM ON ADVANCED COMPUTING AND COMMUNICATION (ISACC), 2015, : 157 - 160
  • [27] Telugu named entity recognition using bert
    Gorla, SaiKiranmai
    Tangeda, Sai Sharan
    Neti, Lalita Bhanu Murthy
    Malapati, Aruna
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 14 (02) : 127 - 140
  • [28] Named entity recognition through corpus transformation and system combination
    Troyano, JA
    Carrillo, V
    Enríquez, F
    Galán, FJ
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2004, 3230 : 255 - 266
  • [29] NECKAr: A Named Entity Classifier for Wikidata
    Geiss, Johanna
    Spitz, Andreas
    Gertz, Michael
    LANGUAGE TECHNOLOGIES FOR THE CHALLENGES OF THE DIGITAL AGE, GSCL 2017, 2018, 10713 : 115 - 129
  • [30] Named entity recognition by using maximum entropy
    SCSE, VIT University, Vellore, India
    Int. J. Database Theory Appl., 2 (43-50):