Parallel HMM-Based Approach for Arabic Part of Speech Tagging

被引:0
|
作者
Kadim, Ayoub [1 ]
Lazrek, Azzeddine [1 ]
机构
[1] Cadi Ayyad Univ, Fac Sci, Dept Comp Sci, Marrakech, Morocco
关键词
Part of speech tagging; hidden Markov model; Viterbi algorithm; natural language processing; corpus; arabic language; WORDS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we try to go beyond the classical use of the Hidden Markov Model for Part Of Speech Tagging, particularly for the Arabic language. In fact, most available Arabic tagging systems and tagsets are derived from English and do not make use of the linguistic richness of Arabic. Our new proposed tagging system will consist of two Hidden Markov Models working in parallel: In addition to the main model, a second model is added to serve as a reference for low probabilities tags. Of course, a dual corpus is required to train both models. To do so, we restructure the Nemlar Arabic corpus and extract a new tagset from diacritics and grammatical rules. The approach is implemented by using Java programming environment and several experimentations are conducted to evaluate it. The results of this approach, which are promising, as well as its limitations, are deeply discussed and future possible enhancements are also highlighted. This work will open the door for new promising research perspectives, particularly for the Arabic language processing, and more generally for the applications of Hidden Markov Models.
引用
收藏
页码:341 / 351
页数:11
相关论文
共 50 条
  • [1] Bidirectional HMM-based Arabic POS tagging
    Kadim, Ayoub
    Lazrek, Azzeddine
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (02) : 303 - 312
  • [2] Arabic HMM-based Speech Synthesis
    Khalil, Krichi Mohamed
    Adnan, Cherif
    [J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
  • [3] Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic
    Houidhek, Amal
    Colotte, Vincent
    Mnasri, Zied
    Jouvet, Denis
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 895 - 906
  • [4] Usage of the HMM-Based Speech Synthesis for intelligent Arabic voice
    Fares, Tamer S.
    Khalil, Awad H.
    Hegazy, Abd El-Fatah A.
    [J]. INTELLIGENT SYSTEMS AND AUTOMATION, 2008, 1019 : 93 - +
  • [5] A BAYESIAN APPROACH TO HMM-BASED SPEECH SYNTHESIS
    Hashimoto, Kei
    Zen, Heiga
    Nankaku, Yoshihiko
    Masuko, Takashi
    Tokuda, Keiichi
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4029 - +
  • [6] Part of speech tagging for Arabic
    Kuebler, Sandra
    Mohamed, Emad
    [J]. NATURAL LANGUAGE ENGINEERING, 2012, 18 : 521 - 548
  • [7] Arabic Part of Speech Tagging
    Mohamed, Emad
    Kuebler, Sandra
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2537 - 2543
  • [8] Rule Based Approach for Arabic Part of Speech Tagging and Name Entity Recognition
    Btoush, Mohammad Hjouj
    Alarabeyyat, Abdulsalam
    Olab, Isa
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (06) : 331 - 335
  • [9] SPEECH-LAUGHS: AN HMM-BASED APPROACH FOR AMUSED SPEECH SYNTHESIS
    El Haddad, Kevin
    Dupont, Stephane
    Urbain, Jerome
    Dutoit, Thierry
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4939 - 4943
  • [10] From Stochastic Speech Recognition to Understanding: An HMM-Based approach
    Boda, PP
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 57 - 64