Optical character recognition of handwritten Arabic using hidden Markov models

被引:1
|
作者
Aulama, Mohannad M. [1 ]
Natsheh, Asem M. [1 ]
Abandah, Gheith A. [1 ]
Olama, Mohammed M. [2 ]
机构
[1] Univ Jordan, Dept Comp Engn, Amman 11942, Jordan
[2] CSED, Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
来源
关键词
Character recognition; OCR; Arabic OCR; hidden Markov models (HMMs); Viterbi algorithm;
D O I
10.1117/12.884087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of optical character recognition (OCR) of handwritten Arabic has not received a satisfactory solution yet. In this paper, an Arabic OCR algorithm is developed based on Hidden Markov Models (HMMs) combined with the Viterbi algorithm, which results in an improved and more robust recognition of characters at the sub-word level. Integrating the HMMs represents another step of the overall OCR trends being currently researched in the literature. The proposed approach exploits the structure of characters in the Arabic language in addition to their extracted features to achieve improved recognition rates. Useful statistical information of the Arabic language is initially extracted and then used to estimate the probabilistic parameters of the mathematical HMM. A new custom implementation of the HMM is developed in this study, where the transition matrix is built based on the collected large corpus, and the emission matrix is built based on the results obtained via the extracted character features. The recognition process is triggered using the Viterbi algorithm which employs the most probable sequence of sub-words. The model was implemented to recognize the sub-word unit of Arabic text raising the recognition rate from being linked to the worst recognition rate for any character to the overall structure of the Arabic language. Numerical results show that there is a potentially large recognition improvement by using the proposed algorithms.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models
    Azeem, Sherif Abdel
    Ahmed, Hany
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2013, 16 (04) : 399 - 412
  • [2] Multifont Arabic character recognition using Hough transform and hidden Markov models
    Ben Amor, N
    Ben Amara, NE
    [J]. ISPA 2005: Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005, : 285 - 288
  • [3] Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models
    Sherif Abdel Azeem
    Hany Ahmed
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2013, 16 : 399 - 412
  • [4] Handwritten address recognition using hidden Markov models
    Brakensiek, A
    Rigoll, G
    [J]. READING AND LEARNING: ADAPTIVE CONTENT RECOGNITION, 2004, 2956 : 103 - 122
  • [5] Recognition of Cursive Arabic Handwritten Text Using Embedded Training Based on Hidden Markov Models
    Rabi, Mouhcine
    Amrouch, Mustapha
    Mahani, Zouhir
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (01)
  • [6] Off-line recognition of handwritten Arabic words using multiple Hidden Markov Models
    Alma'adeed, S
    Higgins, C
    Elliman, D
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XX, 2004, : 33 - 40
  • [7] Off-line recognition of handwritten Arabic words using multiple hidden Markov models
    Alma'adeed, S
    Higgins, C
    Elliman, D
    [J]. KNOWLEDGE-BASED SYSTEMS, 2004, 17 (2-4) : 75 - 79
  • [8] Recognition of Arabic handwritten words using contextual character models
    El-Hajj, Ramy
    Mokbel, Chafic
    Ukforman-Sulm, Laurence
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XV, 2008, 6815
  • [9] Online Thai handwritten character recognition using Hidden Markov Models and Support Vector Machines
    Sanguansat, P
    Asdornwised, W
    Jitapunkul, S
    [J]. IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2: SMART INFO-MEDIA SYSTEMS, 2004, : 492 - 497
  • [10] Online Farsi Handwritten Character Recognition Using Hidden Markov Model
    Ghods, Vahid
    Sohrabi, Mohammad Karim
    [J]. JOURNAL OF COMPUTERS, 2016, 11 (02) : 169 - 175