Applying class triggers in Chinese pos tagging based on maximum entropy model

被引:0
|
作者
Zhao, Y [1 ]
Wang, XL [1 ]
Liu, BQ [1 ]
Guan, Y [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
Chinese POS tagging; trigger; average mutual information; maximum entropy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method of applying class triggers in Chinese POS tagging based on Maximum Entropy model is proposed in this paper. First of all, Feature template of "word->word/tat" is used to extract the triggers from corpus and the triggers that we extracted are added into the Maximum Entropy model as a new kind of feature. Then, the average mutual information is applied to make feature selection and the semantic lexicon is used to build class triggers to overcome sparseness problem. Meanwhile, A solution based on experience to deal with over-fitting problem in model training is presented. Finally, the performance of the system is evaluated on a manually annotated POS tag corpus. The experiment demonstrates that the method can provide increase of accuracy of POS tagging from 94% to 96%, compared our new model with HMM model that is smoothed by absolute smoothing.
引用
下载
收藏
页码:1641 / 1645
页数:5
相关论文
共 50 条
  • [1] Chinese POS tagging based on maximum entropy model
    Zhao, R
    Wang, XL
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 601 - 605
  • [2] Chinese POS Tagging Using Restricted Maximum Entropy Model
    Zhang Hong
    Ren Fuji
    CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (01): : 39 - 42
  • [3] Some improvements on maximum entropy based Chinese POS tagging
    Center for Intelligence Science and Technology Research, Beijing University of Posts and Telecommunications, Beijing 100876, China
    J. China Univ. Post Telecom., 2006, SUPPL. (99-103):
  • [4] Chinese POS tagging employing maximum entropy and word clustering
    Ma, Jianjun
    Huang, Degen
    Li, Zezhong
    Journal of Information and Computational Science, 2010, 7 (12): : 2420 - 2428
  • [5] An English POS Tagging Approach Based on Maximum Entropy
    Yi, Chen
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA AND SMART CITY (ICITBS), 2016, : 81 - 84
  • [6] Improving Persian POS Tagging Using the Maximum Entropy Model
    Kardan, Ahmad A.
    Imani, Maryam Bahojb
    2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
  • [7] Semi-supervised maximum entropy based POS tagging for large scale Chinese corpus
    Yuan, Caixia
    Wang, Xiaojie
    Zhai, Junjie
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 385 - 389
  • [8] A Modified Markov-Based Maximum-Entropy Model for POS Tagging of Odia Text
    Pattnaik, Sagarika
    Nayak, Ajit Kumar
    INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
  • [9] Chinese part of speech tagging based on maximum entropy method
    Lin, H
    Yuan, CF
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1447 - 1450
  • [10] The study of a nonstationary maximum entropy Markov model and its application on the pos-tagging task
    Xiao, Jinghui
    Wang, Xiaolong
    Liu, Bingquan
    ACM Transactions on Asian Language Information Processing, 2007, 6 (02):