Color, complex document segmentation and compression

被引:0
|
作者
Fung, HT
Parker, KJ
机构
来源
DOCUMENT RECOGNITION IV | 1997年 / 3027卷
关键词
document; color documents; complex documents; segmentation; and compression;
D O I
10.1117/12.270071
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We propose a novel segmentation algorithm called SMART (Segmentation by subjecting Macroblocks of Active Regions to the binarizability Test) for color, complex documents. It decomposes a document image into ''binarizable'' and ''non-binarizable'' components. The segmentation procedure includes color transformation, halftone texture suppression, subdivision of the image into 8x8 blocks, classification of the 8x8 blocks as ''active'' or ''inactive,'' formation of macroblocks from the active blocks, and classification of the macroblocks as binarizable or non-binarizable. The classification processes involve the DCT coefficients and a histogram analysis. SMART is compared to three well-known segmentation algorithms: CRLA,(1) RXYC,(2) and SPACE.(3) SMART can handle image components of various shapes, multiple backgrounds of different gray levels, different relative grayness of text to its background, tilted image components, and text of different gray levels. To compress the segmented image, we apply JPEG(4) to the non-binarizable macroblocks and the Group 4 coding scheme(5) to the binary image representing the binarizable macroblocks and to the bitmap, storing the configuration of all macroblocks. Data about the representative gray values, the color information, and other descriptors of the binarizable macroblocks and the background regions are also sent to allow image reconstruction. The gain in using our compression algorithm over using JPEG for the whole image is significant. This gain increases as the proportion of the size of the binarizable macroblocks and the background regions to the image size increases. Psychovisual experiments also show that the subjects prefer the reconstructed images from our compression algorithm to those from the bitrate-matching JPEG images. In a series of test images, this document segmentation and compression system enables compression ratios two times to six times improved over standard methods.
引用
收藏
页码:180 / 191
页数:2
相关论文
共 50 条
  • [1] Color segmentation of complex document images
    Nikolaou, N.
    Papamarkos, N.
    [J]. VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2006, : 220 - +
  • [2] Color segmentation of complex document images
    Nikolaou, N.
    Papamarkos, N.
    [J]. ADVANCES IN COMPUTER GRAPHICS AND COMPUTER VISION, 2007, 4 : 251 - 263
  • [3] Foreground text segmentation in complex color document images using Gabor filters
    S. Nirmala
    P. Nagabhushan
    [J]. Signal, Image and Video Processing, 2012, 6 : 669 - 678
  • [4] Foreground text segmentation in complex color document images using Gabor filters
    Nirmala, S.
    Nagabhushan, P.
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2012, 6 (04) : 669 - 678
  • [5] Text Segmentation for MRC Document Compression
    Haneda, Eri
    Bouman, Charles A.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (06) : 1611 - 1626
  • [6] Color halftone document segmentation and descreening
    Kuo, CH
    Tewfik, AH
    Rao, AR
    [J]. 2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2001, : 1065 - 1068
  • [7] Color document synthesis as a compression strategy
    Monte da Silva, Jodo Marcelo
    Lins, Rafael Duelre
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 466 - 470
  • [8] Wavelet-based images compression of color document by fuzzy picture-text segmentation
    Wu, BF
    Chiu, CC
    Lin, WL
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2003, 26 (01) : 113 - 118
  • [9] Color document image segmentation for automated document entry systems
    Suen, HM
    Wang, JF
    [J]. 1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 131 - 136
  • [10] A general segmentation scheme for DjVu document compression
    Haffner, P
    Bottou, L
    LeCun, Y
    Vincent, L
    [J]. MATHEMATICAL MORPHOLOGY, PROCEEDINGS, 2002, : 17 - 36