An Effective Joint Model for Chinese Word Segmentation and POS Tagging

被引:0
|
作者
Wang, Heng-Jun [1 ]
Si, Nian-Wen [1 ]
Chen, Cheng [1 ]
机构
[1] Zhengzhou Inst Informat Sci & Technol, Zhengzhou, Henan, Peoples R China
关键词
Word segmentation; POS tagging; Joint model; Long Short-Term Memory; Sequence labeling;
D O I
10.1145/3028842.3028877
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese word segmentation and Part-of-speech (POS) tagging have been studied for decades. However, most of the previous works mainly focus on pipeline method which will lead to error propagation. In order to make word segmentation and POS tagging jointly in one model, in this paper, we propose an effective neural network model to improve the accuracy of the segmentation and tagging. Our model works based on the hierarchical Long Short-Term Memory (LSTM) and trained jointly in one objective function. What's more, to better utilizing the transition features between tags, we further introduce the transition matrix which can help to search the best tagging sequence. Experiment on Chinese Treebank shows that our model achieves competitive accuracy on word segmentation and POS tagging.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging
    Zhang, Meishan
    Yu, Nan
    Fu, Guohong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1528 - 1538
  • [2] A Unified Model for Joint Chinese Word Segmentation and POS Tagging with Heterogeneous Annotation Corpora
    Zhao, Jiayi
    Qiu, Xipeng
    Huang, Xuanjing
    [J]. 2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 227 - 230
  • [3] Joint Chinese word segmentation and POS tagging system with undirected graphical models
    Zhu C.-H.
    Zhao T.-J.
    Zheng D.-Q.
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2010, 32 (03): : 700 - 704
  • [4] A Neural Joint Model with BERT for Burmese Syllable Segmentation, Word Segmentation, and POS Tagging
    Mao, Cunli
    Man, Zhibo
    Yu, Zhengtao
    Gao, Shengxiang
    Wang, Zhenhan
    Wang, Hongbin
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [5] Word segmentation and POS tagging for Chinese keyphrase extraction
    Huang, XC
    Chen, J
    Yan, PL
    Luo, X
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 364 - 369
  • [6] Character-Level Dependency Model for Joint Word Segmentation, POS Tagging, and Dependency Parsing in Chinese
    Guo, Zhen
    Zhang, Yujie
    Su, Chen
    Xu, Jinan
    Isahara, Hitoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (01): : 257 - 264
  • [7] Joint Chinese Word Segmentation and POS Tagging Using an Error-Driven Word-Character Hybrid Model
    Kruengkrai, Canasai
    Uchimoto, Kiyotaka
    Kazama, Jun'ichi
    Wang, Yiou
    Torisawa, Kentaro
    Isahara, Hitoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (12) : 2298 - 2305
  • [8] LM Enhanced BiRNN-CRF for Joint Chinese Word Segmentation and POS Tagging
    Zhang, Jianhu
    Liu, Gongshen
    Zhou, Jie
    Zhou, Cheng
    Sun, Huanrong
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 105 - 116
  • [9] A Data-Driven Model for Automated Chinese Word Segmentation and POS Tagging
    Xu, Qing
    Wang, Zhiyou
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [10] A Chinese Radical Embedding for Joint Segmentation and POS Tagging
    Han, Xu
    Wang, Cong
    Liu, Jie
    Li, Zheng
    [J]. PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 578 - 582