Thai Named Entity Recognition Based on Conditional Random Fields

被引:9
|
作者
Tirasaroj, Nutcha [1 ]
Aroonmanakun, Wirote [1 ]
机构
[1] Chulalongkorn Univ, Dept Linguist, Bangkok, Thailand
关键词
D O I
10.1109/SNLP.2009.5340913
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents the Thai named entity recognition (NER) systems using Conditional Random Fields (CRFs). In the previous studies of Thai NER, there are not any systems using syllable-segmented data as an input but word-segmented one. Since the results of some researches on NER in other languages such as Chinese show that the systems based on character are better than those based on word, this study is also conducted to find out if the syllable-segmented input helps improve Thai NER. In order to compare the system getting word-segmented input to that getting syllable-segmented input, there will be two sets of features used in the systems in this study. The results of the experiment show that the systems do not perform well enough due to few features used. However, it reveals that the syllable-based system is slightly better than the word-based one. The corpus, training data preparation and system overview are also included in this paper.
引用
收藏
页码:216 / 220
页数:5
相关论文
共 50 条
  • [21] Exploring unsupervised features in Conditional Random Fields for Spanish Named Entity Recognition
    Copara, Jenny
    Ochoa, Jose
    Thorne, Camilo
    Glavas, Goran
    [J]. PROCEEDINGS OF 2016 5TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2016), 2016, : 283 - 288
  • [22] Conditional Random Fields for Spanish Named Entity Recognition Using Unsupervised Features
    Copara, Jenny
    Ochoa, Jose
    Thorne, Camilo
    Glavas, Goran
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2016, 2016, 10022 : 175 - 186
  • [23] ADAPTIVE NAMED ENTITY RECOGNITION BASED ON CONDITIONAL RANDOM FIELDS WITH AUTOMATIC UPDATED DYNAMIC GAZETTEERS
    Wu, Xixin
    Wu, Zhiyong
    Jia, Jia
    Cai, Lianhong
    [J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 363 - 367
  • [24] Disease Named Entity Recognition Using Semisupervised Learning and Conditional Random Fields
    Suakkaphong, Nichalin
    Zhang, Zhu
    Chen, Hsinchun
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (04): : 727 - 737
  • [25] Chinese Named Entity Recognition Based on Rules and Conditional Random Field
    Liu, Weiming
    Yu, Bin
    Zhang, Chen
    Wang, Han
    Pan, Ke
    [J]. PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 268 - 272
  • [26] Named Entity Recognition of Glioma Protein Based on Conditional Random Field
    Li, Shaowu
    He, Shuqian
    Jiang, Wenjuan
    Huang, Chuixue
    Qin, Lingling
    Liu, Qian
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 14 - 14
  • [27] Bacterial Named Entity Recognition based on Dictionary and Conditional Random Field
    Wang, Xiaoyan
    Jiang, Xingpeng
    Liu, Mengwen
    He, Tingting
    Hu, Xiaohua
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 439 - 444
  • [28] Named Entity Recognition on Code-Switched Data Using Conditional Random Fields
    Sikdar, Utpal Kumar
    Barik, Biswanath
    Gamback, Bjorn
    [J]. COMPUTATIONAL APPROACHES TO LINGUISTIC CODE-SWITCHING, 2018, : 115 - 119
  • [29] Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition
    Okanohara, Daisuke
    Miyao, Yusuke
    Tsuruoka, Yoshimasa
    Tsujii, Jun'ichi
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 465 - 472
  • [30] Named Entity Recognition with Conditional Random Fields on Turkish News Dataset: Revisiting the Features
    Cekinel, Recep Firat
    Agriman, Mustafa
    Karagoz, Pinar
    Yilmaz, Burcu
    [J]. 2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,