Chunking using conditional random fields in Korean texts

被引:0
|
作者
Lee, YH
Kim, MY
Lee, JH
机构
[1] POSTECH, Div Elect & Comp Engn, Pohang 790784, South Korea
[2] AITrc, Pohang 790784, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method of chunking in Korean texts using conditional random fields (CRFs), a recently introduced probabilistic model for labeling and segmenting sequence of data. In agglutinative languages such as Korean and Japanese, a rule-based chunking method is predominantly used for its simplicity and efficiency. A hybrid of a rule-based and machine learning method was also proposed to handle exceptional cases of the rules. In this paper, we present how CRFs can be applied to the task of chunking in Korean texts. Experiments using the STEP 2000 dataset show that the proposed method significantly improves the performance as well as outperforms previous systems.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [1] Chunking Arabic Texts Using Conditional Random Fields
    Khoufi, Nabil
    Aloulou, Chafik
    Hadrich Belguith, Lamia
    [J]. 2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 428 - 432
  • [2] Chunking in Turkish with Conditional Random Fields
    Yildiz, Olcay Taner
    Solak, Ercan
    Ehsani, Razieh
    Gorgun, Onur
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 173 - 184
  • [3] Chinese chunking algorithm based on conditional random fields
    Sun, Guang-Lu
    Liu, Bing-Quan
    Wang, Xiao-Long
    Liu, Yuan-Chao
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2509 - 2513
  • [4] Conditional random fields for clinical named entity recognition: A comparative study using Korean clinical texts
    Lee, Wangjin
    Kim, Kyungmo
    Lee, Eun Young
    Choi, Jinwook
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 101 : 7 - 14
  • [5] Chinese Chunking Algorithm Based on Cascaded Conditional Random Fields
    Sun, Guang-Lu
    Liu, Yuan-Chao
    Qiao, Pei-Li
    Lang, Fei
    [J]. PROCEEDINGS OF THE 11TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2008,
  • [6] Vietnamese Noun Phrase Chunking based on Conditional Random Fields
    Nguyen Thi Huong Thao
    Nguyen Phuong Thai
    Nguyen Le Minh
    Ha Quang Thuy
    [J]. INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2009), 2009, : 172 - +
  • [7] Bengali Noun Phrase Chunking Based on Conditional Random Fields
    Sarkar, Kamal
    Gayen, Vivekananda
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON BUSINESS AND INFORMATION MANAGEMENT (ICBIM), 2014,
  • [8] Punctuation Prediction for Vietnamese Texts Using Conditional Random Fields
    Pham, Quang H.
    Nguyen, Binh T.
    Nguyen Viet Cuong
    [J]. SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 322 - 327
  • [9] Chinese chunking method based on conditional random fields and semantic classes
    Sun, Guang-Lu
    Lang, Fei
    Xue, Yi-Bo
    [J]. Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2011, 43 (07): : 135 - 139
  • [10] Extracting Terms from Texts with Conditional Random Fields
    Li YiXuan
    Lu Xun
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMPUTER AND SOCIETY, 2016, 37 : 293 - 296