Thai Named Entity Recognition Based on Conditional Random Fields

被引:9
|
作者
Tirasaroj, Nutcha [1 ]
Aroonmanakun, Wirote [1 ]
机构
[1] Chulalongkorn Univ, Dept Linguist, Bangkok, Thailand
关键词
D O I
10.1109/SNLP.2009.5340913
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents the Thai named entity recognition (NER) systems using Conditional Random Fields (CRFs). In the previous studies of Thai NER, there are not any systems using syllable-segmented data as an input but word-segmented one. Since the results of some researches on NER in other languages such as Chinese show that the systems based on character are better than those based on word, this study is also conducted to find out if the syllable-segmented input helps improve Thai NER. In order to compare the system getting word-segmented input to that getting syllable-segmented input, there will be two sets of features used in the systems in this study. The results of the experiment show that the systems do not perform well enough due to few features used. However, it reveals that the syllable-based system is slightly better than the word-based one. The corpus, training data preparation and system overview are also included in this paper.
引用
收藏
页码:216 / 220
页数:5
相关论文
共 50 条
  • [31] Incorporating dictionary features into conditional random fields for gene/protein named entity recognition
    Lin, Hongfei
    Li, Yanpeng
    Yang, Zhihao
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 162 - 173
  • [32] Extending Hybrid Conditional Random Fields Approach of Named Entity Recognition for Marathi Tweets
    Patawar, Maithilee L.
    Potey, M. A.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
  • [33] Named entity recognition for Chinese construction documents based on conditional random field
    Qiqi ZHANG
    Cong XUE
    Xing SU
    Peng ZHOU
    Xiangyu WANG
    Jiansong ZHANG
    [J]. Frontiers of Engineering Management, 2023, 10 (02) : 237 - 249
  • [34] Named entity recognition for Chinese construction documents based on conditional random field
    Qiqi Zhang
    Cong Xue
    Xing Su
    Peng Zhou
    Xiangyu Wang
    Jiansong Zhang
    [J]. Frontiers of Engineering Management, 2023, 10 : 237 - 249
  • [35] Named entity recognition for Chinese construction documents based on conditional random field
    Zhang, Qiqi
    Xue, Cong
    Su, Xing
    Zhou, Peng
    Wang, Xiangyu
    Zhang, Jiansong
    [J]. FRONTIERS OF ENGINEERING MANAGEMENT, 2023, 10 (02) : 237 - 249
  • [36] SBLC: a hybrid model for disease named entity recognition based on semantic bidirectional LSTMs and conditional random fields
    Xu, Kai
    Zhou, Zhanfan
    Gong, Tao
    Hao, Tianyong
    Liu, Wenyin
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [37] SBLC: a hybrid model for disease named entity recognition based on semantic bidirectional LSTMs and conditional random fields
    Kai Xu
    Zhanfan Zhou
    Tao Gong
    Tianyong Hao
    Wenyin Liu
    [J]. BMC Medical Informatics and Decision Making, 18
  • [38] Named Entity Recognition in Biomedical Literature: A Comparison of Support Vector Machines and Conditional Random Fields
    Liu, Feng
    Chen, Yifei
    Manderick, Bernard
    [J]. ENTERPRISE INFORMATION SYSTEMS-BOOKS, 2008, 12 : 137 - 147
  • [39] Fine-grained Named Entity Recognition using Conditional Random Fields for Question Answering
    Lee, Changki
    Hwang, Yi-Gyu
    Oh, Hyo-Jung
    Lim, Soojong
    Heo, Jeong
    Lee, Chung-Hee
    Kim, Hyeon-Jin
    Wang, Ji-Hyun
    Jang, Myung-Gil
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 581 - 587
  • [40] Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks
    Wei, Qikang
    Chen, Tao
    Xu, Ruifeng
    He, Yulan
    Gui, Lin
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016, : 1 - 8