An HMM-based OCR for Persian/Arabic texts

被引:0
|
作者
Ahmadi, A [1 ]
Omatu, S [1 ]
Yoshioka, M [1 ]
机构
[1] Osaka Prefecture Univ, Div Comp & Syst Sci, Dept Engn, Sakai, Osaka 5998531, Japan
关键词
OCR; Persian; Arabic; HMM; character recognition; word recognition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a system for recognition of Persian/Arabic texts using hidden Markov model (HMM) in the character level classification. The text is segmented to words at first and from words to characters. A Shadow coding mask are used for extracting the features of characters. Then a Self-Organization Map is employed for clustering the features and reducing the size of inputs. Finally by using the MW classifier and a level building algorithm the words will be composed through recognized characters, The system is evaluated with 10 pages of Persian printed texts and the mean correct classification rate is 95.1% in word level and 99.4% in character level.
引用
收藏
页码:824 / 828
页数:5
相关论文
共 50 条
  • [1] An experimental HMM-based postal OCR system
    Kornai, A
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3177 - 3180
  • [2] Font adaptation of an HMM-based OCR system
    Ait-Mohand, Kamel
    Heutte, Laurent
    Paquet, Thierry
    Ragot, Nicolas
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [3] Arabic HMM-based Speech Synthesis
    Khalil, Krichi Mohamed
    Adnan, Cherif
    [J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
  • [4] Bidirectional HMM-based Arabic POS tagging
    Kadim, Ayoub
    Lazrek, Azzeddine
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (02) : 303 - 312
  • [5] OCR OF ARABIC TEXTS
    AMIN, A
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1988, 301 : 616 - 625
  • [6] Script-independent, HMM-based text line finding for OCR
    Lu, ZD
    Schwartz, R
    Raphael, C
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS: APPLICATIONS, ROBOTICS SYSTEMS AND ARCHITECTURES, 2000, : 551 - 554
  • [7] A HMM-Based Arabic/Latin Handwritten/Printed Identification System
    Rouhou, Ahmed Cheikh
    Abdelhedi, Zeineb
    Kessentini, Yousri
    [J]. PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 298 - 307
  • [8] Parallel HMM-Based Approach for Arabic Part of Speech Tagging
    Kadim, Ayoub
    Lazrek, Azzeddine
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (02) : 341 - 351
  • [9] HMM-based system for recognizing words in historical Arabic manuscript
    Khorsheed, M. S.
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2007, 22 (04): : 294 - 303
  • [10] HMM-Based Arabic Sign Language Recognition Using Kinect
    Sarhan, Noha A.
    El-Sonbaty, Yasser
    Youssef, Sherine M.
    [J]. 2015 TENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2015, : 134 - 139