Color Reduction for Complex Document Images

被引:49
|
作者
Nikolaou, Nikos [1 ]
Papamarkos, Nikos [1 ]
机构
[1] Democritus Univ Thrace, Dept Elect & Comp Engn, Image Proc & Multimedia Lab, GR-67100 Xanthi, Greece
关键词
color reduction; text information extraction; mean-shift; edge preserving smoothing; MEAN SHIFT; QUANTIZATION; ALGORITHM; SPACE; TEXT; SEGMENTATION; EXTRACTION; MAP;
D O I
10.1002/ima.20174
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new technique for color reduction of complex document images is presented in this article. It reduces significantly the number of colors of the document image (less than 15 colors in most of the cases) so as to have solid characters and uniform local backgrounds. Therefore, this technique can be used as a preprocessing step by text information extraction applications. Specifically, using the edge map of the document image, a representative set of samples is chosen that constructs a 3D color histogram. Based on these samples in the 3D color space, a relatively large number of colors (usually no more than 100 colors) are obtained by using a simple clustering procedure. The final colors are obtained by applying a mean-shift based procedure. Also, an edge preserving smoothing filter is used as a preprocessing stage that enhances significantly the quality of the initial image. Experimental results prove the method's capability of producing correctly segmented complex color documents where the character elements can be easily extracted as connected components. (C) 2009 Wiley Periodicals, Inc. Int J Imaging Syst Technol, 19, 14-26, 2009; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ima.20174
引用
收藏
页码:14 / 26
页数:13
相关论文
共 50 条
  • [1] Color segmentation of complex document images
    Nikolaou, N.
    Papamarkos, N.
    VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2006, : 220 - +
  • [2] Color segmentation of complex document images
    Nikolaou, N.
    Papamarkos, N.
    ADVANCES IN COMPUTER GRAPHICS AND COMPUTER VISION, 2007, 4 : 251 - 263
  • [3] Separation of Foreground Text from Complex Background in Color Document Images
    Shivananda, Nirmala
    Nagabhushan, P.
    ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 306 - 309
  • [4] Foreground text segmentation in complex color document images using Gabor filters
    S. Nirmala
    P. Nagabhushan
    Signal, Image and Video Processing, 2012, 6 : 669 - 678
  • [5] Foreground text segmentation in complex color document images using Gabor filters
    Nirmala, S.
    Nagabhushan, P.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2012, 6 (04) : 669 - 678
  • [6] Stamp Detection in Color Document Images
    Micenkova, Barbora
    van Beusekom, Joost
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1125 - 1129
  • [7] Color, complex document segmentation and compression
    Fung, HT
    Parker, KJ
    DOCUMENT RECOGNITION IV, 1997, 3027 : 180 - 191
  • [8] A Character Degradation Model for Color Document Images
    Do Thi Luyen
    Carel, Elodie
    Ogier, Jean-Marc
    Burie, Jean-Christophe
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 806 - 810
  • [9] SEPARATION OF OVERLAPPED COLOR PLANES FOR DOCUMENT IMAGES
    Zheng, Danian
    Sun, Jun
    Naoi, Satoshi
    Suwa, Misako
    Takebe, Hiroaki
    Hotta, Yoshinobu
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1949 - 1952
  • [10] Restoration of Old Document Images using Different Color Spaces Restoration of Old Document Images
    Sgarbi, Ederson Marcos
    Della Mura, Wellington Aparecido
    Moya, Nikolas
    Facon, Jacques
    Legal Ayala, Horacio A.
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS (VISAPP), VOL 1, 2014, : 82 - 88