Hidden Markov model-based Korean part-of-speech tagging considering high agglutinativity, word-spacing, and lexical correlativity

被引:0
|
作者
Lee, SZ [1 ]
Tsujii, J [1 ]
Rim, HC [1 ]
机构
[1] Univ Tokyo, Dept Informat Sci, Bunkyo Ku, Tokyo 1130033, Japan
来源
38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE | 2000年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present hidden Markov models for Korean part-of-speech tagging, which consider Korean characteristics such as high agglutinativity, word-spacing, and high lexical correlativity. In order ot consider rich information in contexts, the models adopt a less strict Markov assumption. In the models, sparse-data problem is very serious and their parameters tend to be estimated unreliably because they have a large number of parameters. To overcome sparse-data problem, our model uses a simplified version of the well-known back-off smoothing method. To mitigate unreliable estimation problem, our models assume joint independence instead of conditional independence because joint probabilities have the same degree of estimation reliability. Experimental results show that models with rich contexts perform even better than standard HMMs and that joint independent assumption is effective in some models.
引用
收藏
页码:376 / 383
页数:8
相关论文
共 16 条
  • [1] A Hidden Markov Model for Persian Part-of-Speech Tagging
    Okhovvat, Morteza
    Bidgoli, Behrouz Minaei
    WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010), 2011, 3
  • [2] A part-of-speech tagging method based on improved hidden Markov model
    Yuan, L.-C. (yuanlichi@sohu.com), 1600, Central South University of Technology (43):
  • [3] Named Entity Recognition Based On A Hidden Markov Model in Part-Of-Speech Tagging
    Ageishi, Ryohei
    Miura, Takao
    2008 FIRST INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES, VOLS 1 AND 2, 2008, : 404 - 409
  • [4] Part-of-speech tagging based on hidden Markov model assuming joint independence
    Lee, SZ
    Tsujii, J
    Rim, HC
    38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 263 - 269
  • [6] Twitter Part-Of-Speech Tagging Using Pre-classification Hidden Markov Model
    Sun, Shichang
    Liu, Hongbo
    Lin, Hongfei
    Abraham, Ajith
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1118 - 1123
  • [7] Hidden Markov Model Based Part of Speech Tagging for Nepali Language
    Paul, Abhijit
    Purkayastha, Bipul Syam
    Sarkar, Sunita
    2015 INTERNATIONAL SYMPOSIUM ON ADVANCED COMPUTING AND COMMUNICATION (ISACC), 2015, : 149 - 156
  • [8] A Comparative Study of Hidden Markov Model and Conditional Random Fields on a Yoruba Part-of-Speech Tagging Task
    Ayogu, Ikechukwu I.
    Adetunmbi, Adebayo O.
    Ojokoh, Bolanle A.
    Oluwadare, Samuel A.
    PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON COMPUTING NETWORKING AND INFORMATICS (ICCNI 2017), 2017,
  • [9] Part-of-speech Tagging and Named Entity Recognition Using Improved Hidden Markov Model and Bloom Filter
    Ankita
    Nazeer, K. A. Abdul
    2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), 2018, : 1072 - 1077
  • [10] Morphology Analysis for Hidden Markov Model based Indonesian Part-of-Speech Tagger
    Muljono
    Afini, Umriya
    Supriyanto, Catur
    2017 1ST INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2017, : 237 - 240