Optical character recognition of handwritten Arabic using hidden Markov models

被引:1
|
作者
Aulama, Mohannad M. [1 ]
Natsheh, Asem M. [1 ]
Abandah, Gheith A. [1 ]
Olama, Mohammed M. [2 ]
机构
[1] Univ Jordan, Dept Comp Engn, Amman 11942, Jordan
[2] CSED, Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
来源
关键词
Character recognition; OCR; Arabic OCR; hidden Markov models (HMMs); Viterbi algorithm;
D O I
10.1117/12.884087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of optical character recognition (OCR) of handwritten Arabic has not received a satisfactory solution yet. In this paper, an Arabic OCR algorithm is developed based on Hidden Markov Models (HMMs) combined with the Viterbi algorithm, which results in an improved and more robust recognition of characters at the sub-word level. Integrating the HMMs represents another step of the overall OCR trends being currently researched in the literature. The proposed approach exploits the structure of characters in the Arabic language in addition to their extracted features to achieve improved recognition rates. Useful statistical information of the Arabic language is initially extracted and then used to estimate the probabilistic parameters of the mathematical HMM. A new custom implementation of the HMM is developed in this study, where the transition matrix is built based on the collected large corpus, and the emission matrix is built based on the results obtained via the extracted character features. The recognition process is triggered using the Viterbi algorithm which employs the most probable sequence of sub-words. The model was implemented to recognize the sub-word unit of Arabic text raising the recognition rate from being linked to the worst recognition rate for any character to the overall structure of the Arabic language. Numerical results show that there is a potentially large recognition improvement by using the proposed algorithms.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Using Advanced Hidden Markov Models for Online Arabic handwriting recognition
    Hosny, Ibrahim
    Abdou, Sherif
    Fahmy, Aly
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 565 - 569
  • [32] Recognition of off-line handwritten arabic words using Hidden Markov Model approach
    Alma'adeed, S
    Higgens, C
    Elliman, D
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 481 - 484
  • [33] The recognition of handwritten digit strings of unknown length using hidden Markov models
    Procter, S
    Illingworth, J
    Elms, AJ
    FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1515 - 1517
  • [34] Offline handwritten Farsi cursive text recognition Using Hidden Markov Models
    Imani, Zahra
    Ahmadyfard, Alireza
    Zohrevand, Abbas
    Alipour, Mohamad
    2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 75 - 79
  • [35] A Survey on Arabic Handwritten Character Recognition
    Ali A.A.A.
    Suresha M.
    Ahmed H.A.M.
    SN Computer Science, 2020, 1 (3)
  • [36] A Database for Arabic Handwritten Character Recognition
    AlKhateeb, Jawad H.
    INTERNATIONAL CONFERENCE ON COMMUNICATIONS, MANAGEMENT, AND INFORMATION TECHNOLOGY (ICCMIT'2015), 2015, 65 : 556 - 561
  • [37] Training of hidden Markov models for cursive handwritten word recognition
    Bojovic, M
    Savic, MD
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 973 - 976
  • [38] On-line Thai handwritten character recognition using hidden Markov model and fuzzy logic
    Budsayaplakorn, R
    Asdornwised, W
    Jitapunkul, S
    2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 537 - 546
  • [39] Structural hidden Markov models: An application to handwritten numeral recognition
    Oakland University, School of Engineering and Computer Science, 131 Dodge Hall, Rochester, MI 48309, United States
    Intell. Data Anal., 2006, 1 (67-79):
  • [40] Hidden Markov Models for online handwritten Tamil word recognition
    Bharath, A.
    Madhvanath, Sriganesh
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 506 - 510