Adaptive document block segmentation and classification

被引:26
|
作者
Shih, FY
Chen, SS
机构
[1] Computer Vision Laboratory, Department of Computer and Information Science, New Jersey Institule of Technology, Newark
关键词
D O I
10.1109/3477.537322
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This correspondence presents an adaptive block segmentation and classification technique for daily-received office documents having complex layout structures such as multiple columns and mixed-mode contents of text, graphics, and pictures. First, an improved two-step block segmentation algorithm is performed based on run-length smoothing for decomposing any document into single-mode blocks. Then, a rule-based block classification is used for classifying each block into the test, horizontal/vertical line, graphics, or picture type. The document features and rules used are independent of character font and size and the scanning resolution. Experimental results show that our algorithms are capable of correctly segmenting and classifying different types of mixed-mode printed documents.
引用
收藏
页码:797 / 802
页数:6
相关论文
共 50 条
  • [1] Hierarchical method for block segmentation and classification of general document images
    Park, Young Seak
    Ebina, Tsuyosi
    Ito, Akira
    Systems and Computers in Japan, 1993, 24 (09) : 84 - 96
  • [2] Adaptive image compression based on segmentation and block classification
    El-Sakka, MR
    Kamel, MS
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 1999, 10 (01) : 33 - 46
  • [3] Adaptive image compression based on segmentation and block classification
    El-Sakka, MR
    Kamel, MS
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 2, 1998, : 555 - 559
  • [4] Adaptive segmentation of document images
    Sylwester, D
    Seth, S
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 827 - 831
  • [5] An edge-based block segmentation and classification for document analysis with automatic character string extraction
    Park, CJ
    Jeon, JH
    Koo, TM
    Choi, HM
    INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 707 - 712
  • [6] Document segmentation and classification into musical scores and text
    Pedersoli, Fabrizio
    Tzanetakis, George
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2016, 19 (04) : 289 - 304
  • [7] Document segmentation and classification into musical scores and text
    Fabrizio Pedersoli
    George Tzanetakis
    International Journal on Document Analysis and Recognition (IJDAR), 2016, 19 : 289 - 304
  • [8] Adaptive web document classification with MCRDR
    Kim, YS
    Park, SS
    Deards, E
    Kang, BH
    ITCC 2004: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 1, PROCEEDINGS, 2004, : 476 - 480
  • [9] Document segmentation and classification with top-down approach
    Wang, K
    Li, SZ
    Ragupathi, S
    FIRST INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED INTELLIGENT ELECTRONIC SYSTEMS, PROCEEDINGS 1997 - KES '97, VOLS 1 AND 2, 1997, : 243 - 247
  • [10] Adaptive window based uneven lighting document segmentation
    Gu, Guoqing
    Han, Wenwen
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 223 - 226