Research on Modern Chinese Multi-category Words Part of Speech Tagging Based on Hidden Markov Model

被引:0
|
作者
Song, Zhendong [1 ]
机构
[1] Heilongjiang Univ, Informat Sci & Technol Inst, Harbin, Peoples R China
来源
PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING | 2014年 / 5卷
关键词
Computer systems; Chinese information processing; Multi-category words; Part of speech tagging; Hidden Markov Model;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In recent years, computer systems are widely used in the modern Chinese part of speech tagging. Modern Chinese part of speech tagging is a basic subject in the natural language processing. It is widely used in machine translation, natural language understanding, establishing of the Chinese corpus, information retrieval, text classification, text proofreading and speech recognition, among others. In the part of speech tagging, multi-category words part of speech (POS) tagging is always a difficulty. Although the total number of multi-category words in the modern Chinese is not high, the usage is fairly widespread. This paper, proposes an algorithm of multi-category words part of speech tagging. First, it is word segmentation according to the traditional method. And then, on this basis, we introduce a method based on the rules of multi-category words part of speech tagging. Finally, a detailed description of the Hidden Markov Model (HMM) used in the words part of speech tagging, and a statistical algorithm based on Hidden Markov Model.
引用
收藏
页码:393 / 397
页数:5
相关论文
共 50 条
  • [31] A hidden Markov model for morphology of compound roles in Persian text part of tagging
    Rezaei H.
    Motameni H.
    Barzegar B.
    International Journal of Engineering, Transactions B: Applications, 2021, 34 (11): : 2494 - 2507
  • [32] Morphology Analysis for Hidden Markov Model based Indonesian Part-of-Speech Tagger
    Muljono
    Afini, Umriya
    Supriyanto, Catur
    2017 1ST INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2017, : 237 - 240
  • [33] Hidden Markov model-based Korean part-of-speech tagging considering high agglutinativity, word-spacing, and lexical correlativity
    Lee, SZ
    Tsujii, J
    Rim, HC
    38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 376 - 383
  • [34] Chinese part of speech tagging based on maximum entropy method
    Lin, H
    Yuan, CF
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1447 - 1450
  • [35] Identifying Verb-Preposition Multi-Category Words in Chinese-English Patent Machine Translation
    Li, Hongzheng
    Zhu, Yun
    Jin, Yaohong
    ARTIFICIAL LIFE AND COMPUTATIONAL INTELLIGENCE, 2015, 8955 : 409 - 421
  • [36] Part-of-speech tagging using word probability based on category patterns
    Kang, Mi-young
    Jung, Sung-won
    Park, Kyung-soon
    Kwon, Hyuk-chul
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2007, 4394 : 119 - +
  • [37] A Research of Chinese Sentence Constituent Labeling Based on History Hidden Markov Model
    Xu, ZhiJian
    Luo, SiMing
    Li, Dan
    Li, HongXia
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 65 - 68
  • [38] TWO-STAGE MODEL SELECTION WITH PARAMETERS WEIGHTED HIDDEN MARKOV MODELS AND LIKELIHOOD RATIO FOR PART-OF-SPEECH TAGGING
    Sun, Shichang
    Liu, Hongbo
    Zhao, Pixi
    Lin, Hongfei
    NEURAL NETWORK WORLD, 2012, 22 (03) : 245 - 262
  • [39] A hidden Markov model based visual speech synthesizer
    Williams, JJ
    Katsaggelos, AK
    Randolph, MA
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2393 - 2396
  • [40] EXPRESSIVE SPEECH IDENTIFICATIONS BASED ON HIDDEN MARKOV MODEL
    Lutfi, Syaheerah L.
    Montero, J. M.
    Barra-Chicote, R.
    Lucas-Cuesta, J. M.
    Gallardo-Antolin, A.
    HEALTHINF 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON HEALTH INFORMATICS, 2009, : 488 - +