Scaling conditional random field with application to Chinese word segmentation

被引:0
|
作者
Zhao, Hai [1 ]
Kit, Chunyu [1 ]
机构
[1] City Univ Hong Kong, Dept Chinese Translat & Linguist, 83 Tat Chee Ave, Kowloon, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a powerful sequence labeling model, conditional random field (CRF) has been applied to a number of natural language processing (NLP) tasks successfully. However, the high complexity of CRF training only allows a very small tag (or label)(1) set, because the training becomes intractable as the tag set enlarges. This paper proposes an improved decomposed training and joint decoding algorithm for CRF learning. Instead of training a single CRF model for all tags, it trains a binary sub-CRF independently for each tag. A predicted tag sequence is then produced by a joint decoding algorithm based on the probabilistic output of all sub-CRFs involved. To test its effectiveness, this approach is applied to tackle Chinese word segmentation (CWS) as a character tagging problem. Our evaluation shows that it can reduce time and memory cost by 20-39% and 44-50%, respectively, without any significant performance loss on various large-scale data sets.
引用
收藏
页码:95 / +
页数:3
相关论文
共 50 条
  • [21] Improving the automatic segmentation of subtitles through conditional random field
    Alvarez, Aitor
    Martinez-Hinarejos, Carlos-D.
    Arzelus, Haritz
    Balenciaga, Marina
    del Pozo, Arantza
    [J]. SPEECH COMMUNICATION, 2017, 88 : 83 - 95
  • [22] A Chinese Toponym Recognition Method Based on Conditional Random Field
    [J]. Gao, Yong (gaoyong@pku.edu.can), 2017, Editorial Board of Medical Journal of Wuhan University (42):
  • [23] Conditional Random Field in the Application of the Product Feature Extraction
    Zhang, Heng
    Qin, Yiqing
    Lv, Xueqiang
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 128 - 132
  • [24] Adaptive Hybrid Conditional Random Field Model for SAR Image Segmentation
    Wang, Fan
    Wu, Yan
    Li, Ming
    Zhang, Peng
    Zhang, Qingjun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (01): : 537 - 550
  • [25] Semiautomatic tumor segmentation with multimodal images in a conditional random field framework
    Hu, Yu-chi
    Grossberg, Michael
    Mageras, Gikas
    [J]. JOURNAL OF MEDICAL IMAGING, 2016, 3 (02)
  • [26] Conditional random field for text segmentation from images with complex background
    Li, Minhua
    Bai, Meng
    Wang, Chunheng
    Xiao, Baihua
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (14) : 2295 - 2308
  • [27] SEMANTIC CONDITIONAL RANDOM FIELD FOR OBJECT BASED SAR IMAGE SEGMENTATION
    Duan, Yiping
    Tao, Xiaoming
    Han, Chaoyi
    Lu, Jianhua
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2625 - 2629
  • [28] An efficient conditional random field approach for automatic and interactive neuron segmentation
    Uzunbas, Mustafa Gokhan
    Chen, Chao
    Metaxas, Dimitris
    [J]. MEDICAL IMAGE ANALYSIS, 2016, 27 : 31 - 44
  • [29] SUPERPIXEL-ENHANCED PAIRWISE CONDITIONAL RANDOM FIELD FOR SEMANTIC SEGMENTATION
    Sulimowicz, Li
    Ahmad, Ishfaq
    Aved, Alexander
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 271 - 275
  • [30] Gastric histopathology image segmentation using a hierarchical conditional random field
    Sun, Changhao
    Li, Chen
    Zhang, Jinghua
    Rahaman, Md Mamunur
    Ai, Shiliang
    Chen, Hao
    Kulwa, Frank
    Li, Yixin
    Li, Xiaoyan
    Jiang, Tao
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2020, 40 (04) : 1535 - 1555