Improving Persian POS Tagging Using the Maximum Entropy Model

被引:0
|
作者
Kardan, Ahmad A. [1 ]
Imani, Maryam Bahojb [1 ]
机构
[1] Amirkabir Univ Technol, Dept Comp Engn & Informat Technol, Tehran, Iran
关键词
Natural Language Processing; Part of Speech Tagging; Persian Part of Speech Tagging; Maximum Entropy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Part of Speech (POS) tagging is one of the fundamental steps in various speech and text processing applications. POS tagging is the process of assigning the words in input sentences with their categories according to their contextual and grammatical properties. In addition to the general POS tagging difficulties such as the disambiguation of multi-category words and unknown words, the Persian language, unlike the English language, is a free order language and it has its own characteristics. These challenges can greatly affect the quality of the part-of-speech tagging process. An efficient POS tagging process has been developed for some languages, especially for the English language, but just a few researches have been done on the Persian language. To address these issues and achieve high POS tagging accuracy, we chose features which can show the important characteristics of words in a sentence, as well as maximum entropy as a machine learning classifier. Experimental results show that the proposed Persian POS tagging system outperforms the other state-of-the-art Persian taggers.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Chinese POS Tagging Using Restricted Maximum Entropy Model
    Zhang Hong
    Ren Fuji
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (01) : 39 - 42
  • [2] Chinese POS tagging based on maximum entropy model
    Zhao, R
    Wang, XL
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 601 - 605
  • [3] An English POS Tagging Approach Based on Maximum Entropy
    Yi, Chen
    [J]. 2015 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA AND SMART CITY (ICITBS), 2016, : 81 - 84
  • [4] Applying class triggers in Chinese pos tagging based on maximum entropy model
    Zhao, Y
    Wang, XL
    Liu, BQ
    Guan, Y
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1641 - 1645
  • [5] Some improvements on maximum entropy based Chinese POS tagging
    Center for Intelligence Science and Technology Research, Beijing University of Posts and Telecommunications, Beijing 100876, China
    [J]. J. China Univ. Post Telecom., 2006, SUPPL. (99-103):
  • [6] Chinese POS tagging employing maximum entropy and word clustering
    Ma, Jianjun
    Huang, Degen
    Li, Zezhong
    [J]. Journal of Information and Computational Science, 2010, 7 (12): : 2420 - 2428
  • [7] Persian POS tagging using probabilistic morphological analysis
    Fadaei, Hakimeh
    Shamsfard, Mehrnoush
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2010, 38 (04) : 264 - 273
  • [8] A Modified Markov-Based Maximum-Entropy Model for POS Tagging of Odia Text
    Pattnaik, Sagarika
    Nayak, Ajit Kumar
    [J]. INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
  • [9] The study of a nonstationary maximum entropy Markov model and its application on the pos-tagging task
    Xiao, Jinghui
    Wang, Xiaolong
    Liu, Bingquan
    [J]. ACM Transactions on Asian Language Information Processing, 2007, 6 (02):
  • [10] Using a smoothing maximum entropy model for Chinese nominal entity tagging
    Chen, JY
    Xue, NW
    Palmer, M
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 493 - 499