Applying class triggers in Chinese pos tagging based on maximum entropy model

被引:0
|
作者
Zhao, Y [1 ]
Wang, XL [1 ]
Liu, BQ [1 ]
Guan, Y [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
Chinese POS tagging; trigger; average mutual information; maximum entropy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method of applying class triggers in Chinese POS tagging based on Maximum Entropy model is proposed in this paper. First of all, Feature template of "word->word/tat" is used to extract the triggers from corpus and the triggers that we extracted are added into the Maximum Entropy model as a new kind of feature. Then, the average mutual information is applied to make feature selection and the semantic lexicon is used to build class triggers to overcome sparseness problem. Meanwhile, A solution based on experience to deal with over-fitting problem in model training is presented. Finally, the performance of the system is evaluated on a manually annotated POS tag corpus. The experiment demonstrates that the method can provide increase of accuracy of POS tagging from 94% to 96%, compared our new model with HMM model that is smoothed by absolute smoothing.
引用
收藏
页码:1641 / 1645
页数:5
相关论文
共 50 条
  • [31] Class Probability Distribution Based Maximum Entropy Model for Classification of Datasets with Sparse Instances
    Arumugam, Saravanan
    Damotharan, Anandhi
    Marudhachalam, Srividya
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2023, 20 (03) : 949 - 976
  • [32] A maximum entropy Chinese character-based parser
    Luo, XQ
    PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2003, : 192 - 199
  • [33] A Maximum Entropy Based Reordering Model for Mongolian-Chinese SMT with Morphological Information
    Yang, Zhenxin
    Li, Miao
    Zhu, Zede
    Chen, Lei
    Wei, Linyu
    Wang, Shaoqi
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2014), 2014, : 175 - 178
  • [34] Character-Level Dependency Model for Joint Word Segmentation, POS Tagging, and Dependency Parsing in Chinese
    Guo, Zhen
    Zhang, Yujie
    Su, Chen
    Xu, Jinan
    Isahara, Hitoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (01): : 257 - 264
  • [35] Improved POS Tagging Model for Malay Twitter Data based on Machine Learning Algorithm
    Ariffin, Siti Noor Allia Noor
    Tiun, Sabrina
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 229 - 234
  • [36] Event classification based on maximum entropy model
    Yu J.-D.
    Li X.-Y.
    Fan X.-Z.
    Pang W.-B.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2010, 39 (04): : 612 - 616
  • [37] Joint POS Tagging and Transition-based Constituent Parsing in Chinese with Non-local Features
    Wang, Zhiguo
    Xue, Nianwen
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 733 - 742
  • [38] Audio classification based on maximum entropy model
    Feng, Z
    Zhou, YQ
    Wu, LD
    Li, ZG
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 745 - 748
  • [39] Contribution of predictive factors of land degradation occurrence applying maximum entropy model
    Abolhasani, Azam
    Khosravi, Hassan
    Zehtabian, Gholamreza
    Rahmati, Omid
    Alamdarloo, Esmaeil Heydari
    D'Odorico, Paolo
    ARID LAND RESEARCH AND MANAGEMENT, 2024, 38 (03) : 299 - 317
  • [40] A Novel Chinese Entity Relationship Extraction Method Based on the Bidirectional Maximum Entropy Markov Model
    Lv, Chengyao
    Pan, Deng
    Li, Yaxiong
    Li, Jianxin
    Wang, Zong
    COMPLEXITY, 2021, 2021