Transformation Rule Learning without Rule Templates: A Case Study in Part of Speech Tagging

被引:1
|
作者
Bach, Ngo Xuan [1 ]
Cuong, Le Anh [1 ]
Ha, Nguyen Viet [2 ]
Binh, Nguyen Ngoc [1 ]
机构
[1] Vietnam Natl Univ, Coll Technol, Hanoi, Vietnam
[2] Vietnam Natl Univ, Informat Technol Inst, Hanoi, Vietnam
关键词
D O I
10.1109/ALPIT.2008.73
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Part of Speech (POS) tagging is an important problem and is one of the first steps included in many tasks in natural language processing. It affects directly on the accuracy of many, other problems such as Syntax Parsing, Word Sense Disambiguation, and Machine Translation. Stochastic models solve this problem relatively well, but they Still make mistakes. Transformation-based learning (TBL) is a solution which can be used to improve stochastic taggers by learning a set of tran formation rules. However its rule learning algorithm has the disadvantages that rule templates must be prepared by hand and only rules are instances of rule templates can be generated. In this paper we propose a model to learn transformation rules without rule templates. This model considers the rule learning problem as a feature selection problem. Experiments on Penn TreeBank showed that the proposal model reduces errors of stochastic taggers with some tags.
引用
收藏
页码:9 / +
页数:2
相关论文
共 50 条
  • [1] Rule Based Part of Speech Tagging of Sindhi Language
    Mahar, Javed Ahmed
    Memon, Ghulam Qadir
    [J]. 2010 INTERNATIONAL CONFERENCE ON SIGNAL ACQUISITION AND PROCESSING: ICSAP 2010, PROCEEDINGS, 2010, : 101 - 106
  • [2] Combining rule-based and case-based learning for iterative part-of-speech tagging
    Lopes, AA
    Jorge, A
    [J]. ADVANCES IN CASE-BASED REASONING, PROCEEDINGS, 2001, 1898 : 26 - 36
  • [3] The computational complexity of rule-based part-of-speech tagging
    Oliva, K
    Kveton, P
    Ondruska, R
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 82 - 89
  • [4] A method integrating rule and HMM for Chinese part-of-speech tagging
    Hui Ning
    Hua Yang
    Zhihui Li
    [J]. ICIEA 2007: 2ND IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-4, PROCEEDINGS, 2007, : 723 - 725
  • [5] Rule Based Approach for Arabic Part of Speech Tagging and Name Entity Recognition
    Btoush, Mohammad Hjouj
    Alarabeyyat, Abdulsalam
    Olab, Isa
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (06) : 331 - 335
  • [6] Part-of-Speech Tagging with Rule-Based Data Preprocessing and Transformer
    Li, Hongwei
    Mao, Hongyan
    Wang, Jingzi
    [J]. ELECTRONICS, 2022, 11 (01)
  • [7] Hidden Markov Model with Rule Based Approach for Part of Speech Tagging of Myanmar Language
    Zin, Khine Khine
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND INFORMATION TECHNOLOGY, 2009, : 123 - +
  • [8] Constrained atomic term: Widening the reach of rule templates in transformation based learning
    dos Santos, CN
    Oliveira, C
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3808 : 622 - 633
  • [9] A Scalable Solution for Rule-Based Part-of-Speech Tagging on Novel Hardware Accelerators
    Sadredini, Elaheh
    Guo, Deyuan
    Bo, Chunkun
    Rahimi, Reza
    Skadron, Kevin
    Wang, Hongning
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 665 - 674