Optical character recognition of handwritten Arabic using hidden Markov models

被引:1
|
作者
Aulama, Mohannad M. [1 ]
Natsheh, Asem M. [1 ]
Abandah, Gheith A. [1 ]
Olama, Mohammed M. [2 ]
机构
[1] Univ Jordan, Dept Comp Engn, Amman 11942, Jordan
[2] CSED, Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
来源
关键词
Character recognition; OCR; Arabic OCR; hidden Markov models (HMMs); Viterbi algorithm;
D O I
10.1117/12.884087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of optical character recognition (OCR) of handwritten Arabic has not received a satisfactory solution yet. In this paper, an Arabic OCR algorithm is developed based on Hidden Markov Models (HMMs) combined with the Viterbi algorithm, which results in an improved and more robust recognition of characters at the sub-word level. Integrating the HMMs represents another step of the overall OCR trends being currently researched in the literature. The proposed approach exploits the structure of characters in the Arabic language in addition to their extracted features to achieve improved recognition rates. Useful statistical information of the Arabic language is initially extracted and then used to estimate the probabilistic parameters of the mathematical HMM. A new custom implementation of the HMM is developed in this study, where the transition matrix is built based on the collected large corpus, and the emission matrix is built based on the results obtained via the extracted character features. The recognition process is triggered using the Viterbi algorithm which employs the most probable sequence of sub-words. The model was implemented to recognize the sub-word unit of Arabic text raising the recognition rate from being linked to the worst recognition rate for any character to the overall structure of the Arabic language. Numerical results show that there is a potentially large recognition improvement by using the proposed algorithms.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Structural hidden Markov models: An application to handwritten numeral recognition
    Bouchaffra, D.
    Tan, J.
    INTELLIGENT DATA ANALYSIS, 2006, 10 (01) : 67 - 79
  • [42] Recognising handwritten Arabic manuscripts using a single hidden Markov model
    Khorsheed, MS
    PATTERN RECOGNITION LETTERS, 2003, 24 (14) : 2235 - 2242
  • [43] Arabic Handwritten Character Recognition Using Machine Learning Approaches
    Ali, Amani Ali Ahmed
    Suresha, M.
    2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 187 - 192
  • [44] Recognition of off-line printed Arabic text using Hidden Markov Models
    Al-Muhtaseb, Husni A.
    Mahmoud, Sabri A.
    Qahwaji, Rami S.
    SIGNAL PROCESSING, 2008, 88 (12) : 2902 - 2912
  • [45] Appearance-based Arabic Sign Language Recognition using Hidden Markov Models
    Ahmed, A. Abdelbaky
    Aly, Saleh
    2014 INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICET), 2014,
  • [46] Arabic and Latin handwritting recognition using multi-stream hidden Markov models
    Kessentini, Yousri
    Paquet, Thierry
    Ben Hamadou, AbdelMajid
    TRAITEMENT DU SIGNAL, 2009, 26 (05) : 395 - 407
  • [47] Off-line Farsi Arabic Handwritten Word Recognition Using Vector Quantization and Hidden Markov Model
    Vaseghi, Behroz
    Alirezaee, Shahpour
    Ahmadi, Majid
    Amirfattahi, Rasoul
    INMIC: 2008 INTERNATIONAL MULTITOPIC CONFERENCE, 2008, : 575 - +
  • [48] A Novel Character Recognition Algorithm Based on Hidden Markov Models
    Wang, Yu
    Wei, Xueye
    Han, Lei
    Wu, Xiaojin
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PROCEEDINGS, 2009, 5855 : 298 - 305
  • [49] Optical Handwritten with Character Recognition
    Zahra, Syeda Binish
    Moaen, Shanza
    Munir, Sundus
    Hassan, Arfa
    Nadeem, Afrozah
    Farooq, Muhammad Sajid
    4TH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING (IC)2, 2021, : 562 - 569
  • [50] Javanese character recognition using Hidden Markov Model
    Widiarti, Anastasia Rita
    Wastu, Phalita Nari
    World Academy of Science, Engineering and Technology, 2009, 57 : 261 - 264