Offline recognition of omnifont Arabic text using the HMM ToolKit (HTK)

被引:46
|
作者
Khorsheed, M. S. [1 ]
机构
[1] King Abdulaziz Univ Sci & Technol, Riyadh 11442, Saudi Arabia
关键词
document analysis; pattern analysis and recognition; machine vision; Arabic OCR; HTK;
D O I
10.1016/j.patrec.2007.03.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a cursive Arabic text recognition system. The system decomposes the document image into text line images and extracts a set of simple statistical features from a narrow window which is sliding a long that text line. It then injects the resulting feature vectors to the Hidden Markov Model Toolkit (HTK). HTK is a portable toolkit for speech recognition system. The proposed system is applied to a data corpus which includes Arabic text of more than 600 A4-size sheets typewritten in multiple computer-generated fonts. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1563 / 1571
页数:9
相关论文
共 50 条
  • [21] Spontaneous Speech Recognition for the Credit Card Corpus Using the HTK Toolkit
    Young, Stephen J.
    Woodland, Philip C.
    Byrne, William J.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04): : 615 - 621
  • [22] Arabic handwriting recognition using variable duration HMM
    Kundu, Amlan
    Hines, Tom
    Phillips, Jon
    Huyck, Benjamin D.
    Van Guilder, Linda C.
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 644 - 648
  • [23] Isolated Text Recognition using SVD and HMM
    Chandra, Mahesh
    Kumari, Akanksha
    Kumar, Sanjeev
    2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 1264 - 1267
  • [24] Improvements in Sub-Character HMM Model Based Arabic Text Recognition
    Ahmad, Irfan
    Fink, Gernot A.
    Mahmoud, Sabri A.
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 537 - 542
  • [25] Offline Arabic Handwriting Recognition Using BLSTMs Combination
    Jemni, Sana Khamekhem
    Kessentini, Yousri
    Kanoun, Slim
    Ogier, Jean-Marc
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 31 - 36
  • [26] Optimizing the integration of a statistical language model in HMM based offline handwritten text recognition
    Zimmermann, M
    Bunke, H
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 541 - 544
  • [27] Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking
    AlKhateeb, Jawad H.
    Ren, Jinchang
    Jiang, Jianmin
    Al-Muhtaseb, Husni
    PATTERN RECOGNITION LETTERS, 2011, 32 (08) : 1081 - 1088
  • [28] Benchmarking Post-processing Techniques for Offline Arabic Text Recognition System
    Jemni, Sana Khamekhem
    Kesentini, Yousri
    Kanoun, Slim
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 267 - 277
  • [29] Segmentation of Offline Handwritten Arabic Text
    Ghaleb, Hashem
    Nagabhushan, P.
    Pal, Umapada
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 41 - 45
  • [30] Arabic Sign Language (ArSL) Recognition System Using HMM
    Youssif, Aliaa A. A.
    Aboutabl, Amal Elsayed
    Ali, Heba Hamdy
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2011, 2 (11) : 45 - 51