Offline recognition of omnifont Arabic text using the HMM ToolKit (HTK)

被引:46
|
作者
Khorsheed, M. S. [1 ]
机构
[1] King Abdulaziz Univ Sci & Technol, Riyadh 11442, Saudi Arabia
关键词
document analysis; pattern analysis and recognition; machine vision; Arabic OCR; HTK;
D O I
10.1016/j.patrec.2007.03.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a cursive Arabic text recognition system. The system decomposes the document image into text line images and extracts a set of simple statistical features from a narrow window which is sliding a long that text line. It then injects the resulting feature vectors to the Hidden Markov Model Toolkit (HTK). HTK is a portable toolkit for speech recognition system. The proposed system is applied to a data corpus which includes Arabic text of more than 600 A4-size sheets typewritten in multiple computer-generated fonts. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1563 / 1571
页数:9
相关论文
共 50 条
  • [1] Using HMM Toolkit (HTK) for Recognition of Arabic Manuscripts Characters
    Maqqor, Ahlam
    Halli, Akram
    Satori, Khalid
    Tairi, Hamid
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 475 - 479
  • [2] Recognition of Off-line Arabic Handwriting words Using HMM Toolkit (HTK)
    El Moubtahij, Hicham
    Satori, Khalid
    Halli, Akram
    2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 167 - 171
  • [3] Offline arabic text recognition system
    Sarfraz, M
    Nawaz, SN
    Al-Khuraidly, A
    2003 INTERNATIONAL CONFERENCE ON GEOMETRIC MODELING AND GRAPHICS, PROCEEDINGS, 2003, : 30 - 35
  • [4] Offline Arabic Handwriting Recognition System based on HMM
    Xiang, Dong
    Yan, Huahua
    Chen, Xianqiao
    Cheng, Yanfen
    PROCEEDINGS 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, (ICCSIT 2010), VOL 1, 2010, : 526 - 529
  • [5] A Database for Offline Arabic Handwritten Text Recognition
    Mahmoud, Sabri A.
    Ahmad, Irfan
    Alshayeb, Mohammed
    Al-Khatib, Wasfi G.
    IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT II: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, 2011, 6754 : 397 - 406
  • [6] Offline Arabic Handwritten Text Recognition: A Survey
    Parvez, Mohammad Tanvir
    Mahmoud, Sabri A.
    ACM COMPUTING SURVEYS, 2013, 45 (02)
  • [7] An offline arabic text recognition systemusing syntactic approach
    Nawaz, SN
    Ahmed, MJ
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VI, PROCEEDINGS: IMAGE, ACOUSTIC, SIGNAL PROCESSING AND OPTICAL SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 294 - 299
  • [8] PC based offline Arabic text recognition system
    Zidouri, A
    Sarfraz, M
    Nawaz, SN
    Ahmad, MJ
    SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 2, PROCEEDINGS, 2003, : 431 - 434
  • [9] Speech Recognition using HTK Toolkit for Marathi Language
    Chavan, Supriya S.
    Handore, S. M.
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1591 - 1597
  • [10] Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models
    Espana-Boquera, Salvador
    Jose Castro-Bleda, Maria
    Gorbe-Moya, Jorge
    Zamora-Martinez, Francisco
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (04) : 767 - 779