BosonNLP: An Ensemble Approach for Word Segmentation and POS Tagging

被引:16
|
作者
Min, Kerui [1 ]
Ma, Chenggang [1 ]
Zhao, Tianmei [1 ]
Li, Haiyan [1 ]
机构
[1] BosonData Inc, Shanghai, Peoples R China
关键词
D O I
10.1007/978-3-319-25207-0_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chineseword segmentation and POS tagging are arguably the most fundamental tasks in Chinese natural language processing. In this paper, we show an ensemble approach for segmentation and POS tagging, combining both discriminative and generative methods to get the advantage of both worlds. Our approach achieved the F1-score of 96.65% and 91.55% for segmentation and tagging respectively in the contest of NLPCC 2015 Shared Task 1, obtained the 1st place for both tasks.
引用
收藏
页码:520 / 526
页数:7
相关论文
共 50 条
  • [1] A hybrid approach to word segmentation and POS tagging
    Oki Electric Industry Co., Ltd., 2−5−7 Honmachi, Chuo-ku, Osaka
    541−0053, Japan
    不详
    619−0289, Japan
    [J]. Proc. Annu. Meet. Assoc. Comput Linguist, 1600, (217-220):
  • [2] Word segmentation and POS tagging for Chinese keyphrase extraction
    Huang, XC
    Chen, J
    Yan, PL
    Luo, X
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 364 - 369
  • [3] Exploiting Heterogeneous Annotations for Weibo Word Segmentation and POS Tagging
    Chao, Jiayuan
    Li, Zhenghua
    Chen, Wenliang
    Zhang, Min
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 495 - 506
  • [4] An Effective Joint Model for Chinese Word Segmentation and POS Tagging
    Wang, Heng-Jun
    Si, Nian-Wen
    Chen, Cheng
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP'16), 2016,
  • [5] Joint Word Segmentation, POS-Tagging and Syntactic Chunking
    Lyu, Chen
    Zhang, Yue
    Ji, Donghong
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3007 - 3014
  • [6] A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging
    Zhang, Meishan
    Yu, Nan
    Fu, Guohong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1528 - 1538
  • [7] A Neural Joint Model with BERT for Burmese Syllable Segmentation, Word Segmentation, and POS Tagging
    Mao, Cunli
    Man, Zhibo
    Yu, Zhengtao
    Gao, Shengxiang
    Wang, Zhenhan
    Wang, Hongbin
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [8] ACUT: An Associative Classifier Approach to Unknown Word POS Tagging
    Elahimanesh, Mohammad Hossein
    Minaei-Bidgoli, Behrouz
    Kermani, Fateme
    [J]. ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP 2013, 2014, 427 : 250 - +
  • [9] A Data-Driven Model for Automated Chinese Word Segmentation and POS Tagging
    Xu, Qing
    Wang, Zhiyou
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [10] Joint Chinese word segmentation and POS tagging system with undirected graphical models
    Zhu, Cong-Hui
    Zhao, Tie-Jun
    Zheng, De-Quan
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2010, 32 (03): : 700 - 704