Scaling conditional random field with application to Chinese word segmentation

被引:0
|
作者
Zhao, Hai [1 ]
Kit, Chunyu [1 ]
机构
[1] City Univ Hong Kong, Dept Chinese Translat & Linguist, 83 Tat Chee Ave, Kowloon, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a powerful sequence labeling model, conditional random field (CRF) has been applied to a number of natural language processing (NLP) tasks successfully. However, the high complexity of CRF training only allows a very small tag (or label)(1) set, because the training becomes intractable as the tag set enlarges. This paper proposes an improved decomposed training and joint decoding algorithm for CRF learning. Instead of training a single CRF model for all tags, it trains a binary sub-CRF independently for each tag. A predicted tag sequence is then produced by a joint decoding algorithm based on the probabilistic output of all sub-CRFs involved. To test its effectiveness, this approach is applied to tackle Chinese word segmentation (CWS) as a character tagging problem. Our evaluation shows that it can reduce time and memory cost by 20-39% and 44-50%, respectively, without any significant performance loss on various large-scale data sets.
引用
收藏
页码:95 / +
页数:3
相关论文
共 50 条
  • [41] A conditional random field-based model for joint sequence segmentation and classification
    Chatzis, Sotirios P.
    Kosmopoulos, Dimitrios I.
    Doliotis, Paul
    [J]. PATTERN RECOGNITION, 2013, 46 (06) : 1569 - 1578
  • [42] Fully convolutional networks semantic segmentation based on conditional random field optimization
    Wu, Qian
    Gu, Jinan
    Wu, Chen
    Li, Jin
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2021, 21 (05) : 1405 - 1415
  • [43] Ground Estimation and Point Cloud Segmentation using SpatioTemporal Conditional Random Field
    Rummelhard, Lukas
    Paigwar, Anshul
    Negre, Amaury
    Laugier, Christian
    [J]. 2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 1105 - 1110
  • [44] Locally Shared Features: An Efficient Alternative to Conditional Random Field for Semantic Segmentation
    Yang, Zhengeng
    Yu, Hongshan
    Sun, Wei
    Mao, Zhihong
    Sun, Mingui
    [J]. IEEE ACCESS, 2019, 7 : 2263 - 2272
  • [45] Segmentation of Anatomical Branching Structures Based on Texture Features and Conditional Random Field
    Nuzhnaya, Tatyana
    Bakic, Predrag
    Kontos, Despina
    Megalooikonomou, Vasileios
    Ling, Haibin
    [J]. MEDICAL IMAGING 2012: IMAGE PROCESSING, 2012, 8314
  • [46] Automatic Segmentation of Cervical Nuclei Based on Deep learning and a Conditional Random Field
    Liu, Yiming
    Zhang, Pengcheng
    Song, Qingche
    Li, Andi
    Zhang, Peng
    Gui, Zhiguo
    [J]. IEEE ACCESS, 2018, 6 : 53709 - 53721
  • [47] Efficient Vessel Segmentation Based on Proposed Adaptive Conditional Random Field Model
    Math, Laxmi
    Fatima, Ruksar
    [J]. Recent Advances in Computer Science and Communications, 2022, 15 (05) : 794 - 804
  • [48] Signature Segmentation from Machine Printed Documents using Conditional Random Field
    Mandal, Ranju
    Roy, Partha Pratim
    Pal, Umapada
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1170 - 1174
  • [49] Conditional random field with high-order dependencies for sequence labeling and segmentation
    Cuong, Nguyen Viet
    Ye, Nan
    Lee, Wee Sun
    Chieu, Hai Leong
    [J]. Journal of Machine Learning Research, 2014, 15 : 981 - 1009
  • [50] EM Image Segmentation of Nerve Cells Based on Conditional Random Field Model
    He, Fuyun
    Liang, Yan
    Huang, Xiaoming
    [J]. ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 79 - 83