ACUT: An Associative Classifier Approach to Unknown Word POS Tagging

被引:0
|
作者
Elahimanesh, Mohammad Hossein [1 ]
Minaei-Bidgoli, Behrouz [2 ]
Kermani, Fateme [1 ]
机构
[1] Comp Res Ctr Islamic Sci, Qom, Iran
[2] Iran Univ Sci & Technol, Tehran, Iran
关键词
Part-of-Speech tagging; Associative classifier; Hidden Markov Model; Unknown words;
D O I
10.1007/978-3-319-10849-0_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The focus of this article is unknown word Part-of-Speech (POS) tagging. POS tagging which is one the fundamental requirements for intelligent text processing based on texts language. Therefore, this article firstly aims to provide a POS tagger with high accuracy for Persian language. The technique which is proposed by this article for handling unknown words is using a combination of a type of associative classifier along with a Hidden Markov Models (HMM) algorithm. Associative classification is a new classification approach integrating association mining and classification. The associative classifier used in this study is a type of associative classifiers that is innovated by this research. This kind of classifier not only uses sequence probability but also uses the CBA classifier. CBA first generates all the association rules with certain support and confidence thresholds as candidate rules. It then selects a small set of rules from them to form a classifier. When predicting the class label for an example, the best rule whose body is satisfied by the example is chosen for prediction. Based on the experimental results, the proposed algorithm can increase the accuracy of Persian unknown word POS tagging to 81.8 %. The total accuracy of proposed tagger is 98 % and its sentence accuracy is 63.1 %.
引用
收藏
页码:250 / +
页数:3
相关论文
共 50 条
  • [31] Joint Chinese word segmentation and POS tagging system with undirected graphical models
    Zhu C.-H.
    Zhao T.-J.
    Zheng D.-Q.
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2010, 32 (03): : 700 - 704
  • [32] Simple semi-supervised learning for chinese word segmentation and pos tagging
    Li, Xinxin
    Wang, Xuan
    Waqas, Muhammad
    Harbin, Anwar
    [J]. Information Technology Journal, 2013, 12 (20) : 5955 - 5961
  • [33] Split-word Architecture in Recurrent Neural Networks POS-Tagging
    Di Gennaro, Giovanni
    Ospedale, Armando
    Di Girolamo, Antonio
    Buonanno, Amedeo
    Palmieri, Francesco A. N.
    Fedele, Gianfranco
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [34] Multi-Dialect Arabic POS Tagging: A CRF Approach
    Darwish, Kareem
    Mubarak, Hamdy
    Eldesouki, Mohamed
    Abdelali, Ahmed
    Samih, Younes
    Alharbi, Randah
    Attia, Mohammed
    Magdy, Walid
    Kallmeyer, Laura
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 93 - 98
  • [35] A Fine-Grained Domain Adaption Model for Joint Word Segmentation and POS Tagging
    Jiang, Peijie
    Long, Dingkun
    Sun, Yueheng
    Zhang, Meishan
    Xu, Guangwei
    Xie, Pengjun
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3587 - 3598
  • [36] Thai Personal Named Entity Extraction without using Word Segmentation or POS Tagging
    Sutheebanjard, P.
    Premchaiswadi, W.
    [J]. 2009 EIGHTH INTERNATIONAL SYMPOSIUM ON NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2009, : 221 - 226
  • [37] Enhanced Neural Machine Translation by Joint Decoding with Word and POS-tagging Sequences
    Feng, Xiaocheng
    Feng, Zhangyin
    Zhao, Wanlong
    Qin, Bing
    Liu, Ting
    [J]. MOBILE NETWORKS & APPLICATIONS, 2020, 25 (05): : 1722 - 1728
  • [38] Enhanced Neural Machine Translation by Joint Decoding with Word and POS-tagging Sequences
    Xiaocheng Feng
    Zhangyin Feng
    Wanlong Zhao
    Bing Qin
    Ting Liu
    [J]. Mobile Networks and Applications, 2020, 25 : 1722 - 1728
  • [39] LM Enhanced BiRNN-CRF for Joint Chinese Word Segmentation and POS Tagging
    Zhang, Jianhu
    Liu, Gongshen
    Zhou, Jie
    Zhou, Cheng
    Sun, Huanrong
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 105 - 116
  • [40] Closure Based Integrated Approach for Associative Classifier
    Chowdhury, Soumyadeep Basu
    Pal, Debasmita
    Sarkar, Anindita
    Mondal, Kartick Chandra
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COMMUNICATION, 2017, 458 : 225 - 235