Open Vocabulary Arabic Handwriting Recognition Using Morphological Decomposition

被引:16
|
作者
Hamdani, Mahdi [1 ]
Mousa, Amr El-Desoky [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Human Language Technol & Pattern Recognit Grp, Aachen, Germany
关键词
D O I
10.1109/ICDAR.2013.63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of Language Models (LMs) is a very important component in large and open vocabulary recognition systems. This paper presents an open-vocabulary approach for Arabic handwriting recognition. The proposed approach makes use of Arabic word decomposition based on morphological analysis. The vocabulary is a combination of words and subwords obtained by the decomposition process. Out Of Vocabulary (OOV) words can be recognized by combining different elements from the lexicon. The recognition system is based on Hidden Markov Models (HMMs) with position and context dependent character models. An n-gram LM trained on the decomposed text is used along with the HMMs during the search. The approach is evaluated using two Arabic handwriting datasets. The open vocabulary approach leads to a significant improvement in the system performance. Two different types experiments for two Arabic handwriting recognition tasks are conducted in this work. The proposed approach for open vocabulary allows to have an absolute improvement of up to 1% in the Word Error Rate (WER) for the constrained task and to keep the same performance of the baseline system for the unconstrained one.
引用
收藏
页码:280 / 284
页数:5
相关论文
共 50 条
  • [1] A large vocabulary system for Arabic online handwriting recognition
    Ibrahim Abdelaziz
    Sherif Abdou
    Hassanin Al-Barhamtoshy
    Pattern Analysis and Applications, 2016, 19 : 1129 - 1141
  • [2] A large vocabulary system for Arabic online handwriting recognition
    Abdelaziz, Ibrahim
    Abdou, Sherif
    Al-Barhamtoshy, Hassanin
    PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (04) : 1129 - 1141
  • [3] The RWTH Large Vocabulary Arabic Handwriting Recognition System
    Hamdani, Mahdi
    Doetsch, Patrick
    Kozielski, Michal
    Mousa, Amr El-Desoky
    Ney, Hermann
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 111 - 115
  • [4] New Morphological Markovian Approach for Analysis and Recognition of Open Arabic Canonical Vocabulary
    Ben Cheikh, Imen
    Laffet, Anas
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 183 - 188
  • [5] Large Vocabulary Hybrid DNN/HMM Arabic Online Handwriting Recognition System
    Khaled, Omar
    Fahmy, Aly
    Abdou, Sherif
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 876 - 881
  • [6] Arabic Handwriting Recognition using Sequential Minimal Optimization
    Hassen, Hanadi
    Al-Maadeed, Somaya
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 79 - 84
  • [7] Arabic handwriting recognition using variable duration HMM
    Kundu, Amlan
    Hines, Tom
    Phillips, Jon
    Huyck, Benjamin D.
    Van Guilder, Linda C.
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 644 - 648
  • [8] Offline Arabic Handwriting Recognition Using BLSTMs Combination
    Jemni, Sana Khamekhem
    Kessentini, Yousri
    Kanoun, Slim
    Ogier, Jean-Marc
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 31 - 36
  • [9] Arabic handwriting recognition using structural and syntactic pattern attributes
    Parvez, Mohammad Tanvir
    Mahmoud, Sabri A.
    PATTERN RECOGNITION, 2013, 46 (01) : 141 - 154