Script-independent, HMM-based text line finding for OCR

被引:0
|
作者
Lu, ZD [1 ]
Schwartz, R [1 ]
Raphael, C [1 ]
机构
[1] GTE Corp, BBN Technol, Cambridge, MA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new, script-independent, HMM-based technique to locate test lines on images containing one ol more paragraphs of single-column rest. The parameters of the HMMs are trained on-line on each image using an unsupervised training procedure. We present results of line finding experiments in Arabic, Chinese ann English to demonstrate the performance as well as the script-independent nature of the technique. Comparison of HMM-based line finding with manual line finding shows that the use of HMM-based technique does not lead to a significant increase in the recognition error rate.
引用
收藏
页码:551 / 554
页数:4
相关论文
共 50 条
  • [1] Adaptive Script-Independent Text Line Extraction
    Ziaratban, Majid
    Faez, Karim
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (04): : 866 - 877
  • [2] Script-independent text line segmentation in freestyle handwritten documents
    Li, Yi
    Zheng, Yefeng
    Doermann, David
    Jaeger, Stefan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (08) : 1313 - 1329
  • [3] Script-Independent Text Segmentation from Document Images
    Sahare P.
    Tembhurne J.V.
    Parate M.R.
    Diwan T.
    Dhok S.B.
    [J]. International Journal of Ambient Computing and Intelligence, 2022, 13 (01)
  • [4] An HMM-based OCR for Persian/Arabic texts
    Ahmadi, A
    Omatu, S
    Yoshioka, M
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 824 - 828
  • [5] An experimental HMM-based postal OCR system
    Kornai, A
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3177 - 3180
  • [6] Font adaptation of an HMM-based OCR system
    Ait-Mohand, Kamel
    Heutte, Laurent
    Paquet, Thierry
    Ragot, Nicolas
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [7] Script Independent Text Pre-processing and Segmentation for OCR
    Sawant, Archana S.
    Chougule, D. G.
    [J]. 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, SIGNALS, COMMUNICATION AND OPTIMIZATION (EESCO), 2015,
  • [8] TCBR-HMM: An HMM-based text classifier with a CBR system
    Borrajo, L.
    Seara Vieira, A.
    Iglesias, E. L.
    [J]. APPLIED SOFT COMPUTING, 2015, 26 : 463 - 473
  • [9] A hierarchical, HMM-based automatic evaluation of OCR accuracy for a digital library of books
    Feng, Shaolei
    Manmatha, R.
    [J]. OPENING INFORMATION HORIZONS, 2006, : 109 - +
  • [10] Arabic OCR system analogous to HMM-based ASR systems; Implementation and evaluation
    Rashwan, M.A.
    Fakhr, M.W.
    Attia, M.
    EL-Mahallawy, M.S.
    [J]. Journal of Engineering and Applied Science, 2007, 54 (06): : 653 - 672