Integrated Chinese word segmentation and part-of-speech tagging based on the divide-and-conquer strategy

被引:0
|
作者
Sun, MS [1 ]
Xu, DL [1 ]
Tsou, BK [1 ]
机构
[1] Tsing Hua Univ, Dept Comp Sci, Natl Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
关键词
Chinese word segmentation; part-of-speech tagging; integration; Divide-and-Conquer;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, various ways of integration of Chinese word segmentation and part-of-speech tagging, including the so-called true-integration and pseudo-integration, are tested and compared based on a test corpus consisting of 367,114 Chinese characters. A novel true-integration approach, named 'the divide-and-conquer integration', is originally proposed. Preliminary experiments show that this true integration achieves 98.72% accuracy of word segmentation, 95.65% accuracy of part-of-speech tagging, and 94.43% accuracy of word segmentation and part-of-speech tagging, outperforming all other kinds of combinations to some extent (though not very significant). The results demonstrate the potential for further improving the performance of Chinese word segmentation and part-of-speech tagging.
引用
收藏
页码:610 / 615
页数:6
相关论文
共 50 条
  • [1] An integrated approach to Chinese word segmentation and part-of-speech tagging
    Sun, Maosong
    Xu, Dongliang
    Tsou, Benjamin K.
    Lu, Huaming
    [J]. COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 299 - +
  • [2] Repairing errors for Chinese word segmentation and part-of-speech tagging
    Yao, TF
    Ding, W
    Erbach, G
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1881 - 1886
  • [3] Research on the model of integrating Chinese word segmentation with part-of-speech tagging
    Tong, Xiaojun
    Cui, Minggen
    Song, Guolong
    [J]. DCABES 2007 Proceedings, Vols I and II, 2007, : 1062 - 1065
  • [4] Unified Framework of Performing Chinese Word Segmentation and Part-of-Speech Tagging
    Zhang Kaixu
    Sun Maosong
    [J]. CHINA COMMUNICATIONS, 2012, 9 (03) : 1 - 9
  • [5] Research on the System of Jointing Chinese Word Segmentation with Part-of-speech Tagging
    Li, Qin
    Wei, Wei
    [J]. 2013 SIXTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2013, : 387 - 390
  • [6] Incorporating knowledge for joint Chinese word segmentation and part-of-speech tagging with SynSemGCN
    Tang, Xuemei
    Wang, Jun
    Su, Qi
    [J]. ASLIB JOURNAL OF INFORMATION MANAGEMENT, 2024,
  • [7] A joint method for Chinese word segmentation and part-of-speech tagging based on BiLSTM-CRF
    Yuan, Lichi
    [J]. Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2023, 54 (08): : 3145 - 3153
  • [8] Correcting word segmentation and part-of-speech tagging errors for Chinese named entity recognition
    Yao, TF
    Wei, D
    Erbach, G
    [J]. INTERNET CHALLENGE: TECHNOLOGY AND APPLICATIONS, 2002, : 29 - 36
  • [9] Petal segmentation in CT images based on divide-and-conquer strategy
    Naka, Yuki
    Utsumi, Yuzuko
    Iwamura, Masakazu
    Tsukaya, Hirokazu
    Kise, Koichi
    [J]. FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [10] A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text
    Xiong, Ying
    Wang, Zhongmin
    Jiang, Dehuan
    Wang, Xiaolong
    Chen, Qingcai
    Xu, Hua
    Yan, Jun
    Tang, Buzhou
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 2)