Research on Modern Chinese Multi-category Words Part of Speech Tagging Based on Hidden Markov Model

被引:0
|
作者
Song, Zhendong [1 ]
机构
[1] Heilongjiang Univ, Informat Sci & Technol Inst, Harbin, Peoples R China
关键词
Computer systems; Chinese information processing; Multi-category words; Part of speech tagging; Hidden Markov Model;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In recent years, computer systems are widely used in the modern Chinese part of speech tagging. Modern Chinese part of speech tagging is a basic subject in the natural language processing. It is widely used in machine translation, natural language understanding, establishing of the Chinese corpus, information retrieval, text classification, text proofreading and speech recognition, among others. In the part of speech tagging, multi-category words part of speech (POS) tagging is always a difficulty. Although the total number of multi-category words in the modern Chinese is not high, the usage is fairly widespread. This paper, proposes an algorithm of multi-category words part of speech tagging. First, it is word segmentation according to the traditional method. And then, on this basis, we introduce a method based on the rules of multi-category words part of speech tagging. Finally, a detailed description of the Hidden Markov Model (HMM) used in the words part of speech tagging, and a statistical algorithm based on Hidden Markov Model.
引用
收藏
页码:393 / 397
页数:5
相关论文
共 50 条
  • [1] Application of Cloud Desktop in Modern Chinese Multi-category Words Part of Speech Tagging
    Song, Zhendong
    13TH GLOBAL CONGRESS ON MANUFACTURING AND MANAGEMENT, 2017, 174 : 1215 - 1220
  • [2] Vietnamese Part of Speech Tagging Based on Multi-category Words Disambiguation Model
    Zhao Chen
    Liu Yanchao
    Guo Jianyi
    Chen Wei
    Yan Xin
    Yu Zhengtao
    Chen Xiuqin
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 267 - 277
  • [3] Application of Big Data and Intelligent Processing Technology in Modern Chinese Multi-category Words Part of Speech Tagging Corpus
    Song, Zhendong
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEM (ICISS 2018), 2018, : 107 - 111
  • [4] A TENGRAM method based part-of-speech tagging of multi-category words in Hindi language
    Gupta, J. P.
    Tayal, Devendra K.
    Gupta, Arti
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) : 15084 - 15093
  • [6] Hidden Markov Model Based Part of Speech Tagging for Nepali Language
    Paul, Abhijit
    Purkayastha, Bipul Syam
    Sarkar, Sunita
    2015 INTERNATIONAL SYMPOSIUM ON ADVANCED COMPUTING AND COMMUNICATION (ISACC), 2015, : 149 - 156
  • [7] A Hidden Markov Model for Persian Part-of-Speech Tagging
    Okhovvat, Morteza
    Bidgoli, Behrouz Minaei
    WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010), 2011, 3
  • [8] A part-of-speech tagging method based on improved hidden Markov model
    Yuan, L.-C. (yuanlichi@sohu.com), 1600, Central South University of Technology (43):
  • [9] Part-of-speech tagging based on hidden Markov model assuming joint independence
    Lee, SZ
    Tsujii, J
    Rim, HC
    38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 263 - 269
  • [10] Hidden Markov Model with Rule Based Approach for Part of Speech Tagging of Myanmar Language
    Zin, Khine Khine
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND INFORMATION TECHNOLOGY, 2009, : 123 - +