Thai Named Entity Recognition Based on Conditional Random Fields

被引:9
|
作者
Tirasaroj, Nutcha [1 ]
Aroonmanakun, Wirote [1 ]
机构
[1] Chulalongkorn Univ, Dept Linguist, Bangkok, Thailand
关键词
D O I
10.1109/SNLP.2009.5340913
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents the Thai named entity recognition (NER) systems using Conditional Random Fields (CRFs). In the previous studies of Thai NER, there are not any systems using syllable-segmented data as an input but word-segmented one. Since the results of some researches on NER in other languages such as Chinese show that the systems based on character are better than those based on word, this study is also conducted to find out if the syllable-segmented input helps improve Thai NER. In order to compare the system getting word-segmented input to that getting syllable-segmented input, there will be two sets of features used in the systems in this study. The results of the experiment show that the systems do not perform well enough due to few features used. However, it reveals that the syllable-based system is slightly better than the word-based one. The corpus, training data preparation and system overview are also included in this paper.
引用
收藏
页码:216 / 220
页数:5
相关论文
共 50 条
  • [41] Advanced Feature-Driven Disease Named Entity Recognition Using Conditional Random Fields
    Rahman, Hidayat
    Hahn, Thomas
    Segall, Richard
    [J]. PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 469 - 469
  • [42] Hierarchical conditional random fields (HCRF) for chinese named entity tagging
    Lu, Peng
    Yang, Yiping
    Gao, Yibo
    Ren, He
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2007, : 24 - +
  • [43] Conditional random fields for clinical named entity recognition: A comparative study using Korean clinical texts
    Lee, Wangjin
    Kim, Kyungmo
    Lee, Eun Young
    Choi, Jinwook
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 101 : 7 - 14
  • [44] Rich features based Conditional Random Fields for biological named entities recognition
    Sun, Chengjie
    Guan, Yi
    Wang, Xiaolong
    Lin, Lei
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (09) : 1327 - 1333
  • [45] Named Entity Recognition of Chinese Electronic Medical Records Based on Cascaded Conditional Random Field
    Chen, Xiaoyu
    Shi, Shenghui
    Zhan, Siyan
    Jiang, Daguang
    Lin, Xiaoyong
    [J]. 2019 4TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2019), 2019, : 364 - 368
  • [46] Early results for chinese named entity recognition using conditional random fields model, HMM and maximum entropy
    Feng, YY
    Sun, L
    Zhang, JL
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 549 - 552
  • [47] Cybersecurity Named Entity Recognition Using Bidirectional Long Short-Term Memory with Conditional Random Fields
    Ma, Pingchuan
    Jiang, Bo
    Lu, Zhigang
    Li, Ning
    Jiang, Zhengwei
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2021, 26 (03) : 259 - 265
  • [48] A combination of active learning and self-learning for named entity recognition on Twitter using conditional random fields
    Van Cuong Tran
    Ngoc Thanh Nguyen
    Fujita, Hamido
    Dinh Tuyen Hoang
    Hwang, Dosam
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 132 : 179 - 187
  • [49] Cybersecurity Named Entity Recognition Using Bidirectional Long Short-Term Memory with Conditional Random Fields
    PingchuanMa
    BoJiang
    ZhigangLu
    NingLi
    ZhengweiJiang
    [J]. Tsinghua Science and Technology, 2021, 26 (03) : 259 - 265
  • [50] Named Entity Recognition for Thai Historical Data
    Laosen, Nasith
    Laosen, Kanjana
    Paklao, Thummarat
    [J]. 2024 21ST INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING, JCSSE 2024, 2024, : 528 - 533