Hierarchical content classification and script determination for automatic document image processing

被引:9
|
作者
Chi, Z [1 ]
Wang, Q
Siu, WC
机构
[1] Hong Kong Polytech Univ, Ctr Multimedia Signal Proc, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China
[2] Northwestern Polytech Univ, Dept Comp Sci & Engn, Xian 710072, Peoples R China
关键词
document image processing; page segmentation; content classification; script determination; background thinning; cross-correlation; Kolmogorov complexity; neural networks;
D O I
10.1016/S0031-3203(03)00128-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Page segmentation and image content classification play an important role in automatic image processing with applications to mixed-type document image compression, form and check reading, and automatic mail sorting. In this paper, we first present an enhanced background thinning based approach for fast page segmentation. After the analysis of three different methods individually, a hierarchical approach for document content classification is proposed, which classifies a sub-image into one of two categories: text and halftone. Our approach combines a neural network model, cross-correlation metric, and Kolmogorov complexity measure in a hierarchical structure. Considering the necessity of a recognition system, we also propose using a three-layer feedforward neural network to classify text regions into Chinese and English scripts. The classification accuracy on a number of document images reaches 100% and 97.1% for halftone region and text region, respectively. Meanwhile, the system can achieve a correct rate of 92.3% and 95.0% for Chinese and alphabetic script determination, respectively. (C) 2003 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:2483 / 2500
页数:18
相关论文
共 50 条
  • [1] Hierarchical content classification and script determination for automatic document image processing
    Wang, Q
    Chi, Z
    Zhao, RC
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 77 - 80
  • [2] Page segmentation and content classification for automatic document image processing
    Yip, SK
    Chi, Z
    [J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 279 - 282
  • [3] Determination of the script and language content of document images
    Spitz, AL
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (03) : 235 - 245
  • [4] AN EXPERIMENT IN AUTOMATIC HIERARCHICAL DOCUMENT CLASSIFICATION
    GARLAND, K
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1983, 19 (03) : 113 - 120
  • [5] Automatic hierarchical color image classification
    Huang, J
    Kumar, SR
    Zabih, R
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) : 151 - 159
  • [6] Automatic Hierarchical Color Image Classification
    Jing Huang
    S. Ravi Kumar
    Ramin Zabih
    [J]. EURASIP Journal on Advances in Signal Processing, 2003
  • [7] Hierarchical classification for automatic image annotation
    Dept of Computer Science, UNC-Charlotte, Charlotte, NC 28223, United States
    [J]. Proc. Annu. Int. ACM SIGIR Conf. Res. Dev. Inf. Retr., 2007, (111-118):
  • [8] AUTOMATIC IMAGE ORIENTATION DETECTION WITH PRIOR HIERARCHICAL CONTENT-BASED CLASSIFICATION
    Cingovska, Ivana
    Ivanovski, Zoran
    Martin, Francois
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [9] Automatic detection of document script and orientation
    Lu, Shijian
    Tan, Chew Lim
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 237 - 241
  • [10] Script identification of document image analysis
    Cheng, Juan
    Ping, Xijian
    Zhou, Guanwei
    Yang, Yang
    [J]. ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 3, PROCEEDINGS, 2006, : 178 - +