Color Reduction for Complex Document Images

被引:49
|
作者
Nikolaou, Nikos [1 ]
Papamarkos, Nikos [1 ]
机构
[1] Democritus Univ Thrace, Dept Elect & Comp Engn, Image Proc & Multimedia Lab, GR-67100 Xanthi, Greece
关键词
color reduction; text information extraction; mean-shift; edge preserving smoothing; MEAN SHIFT; QUANTIZATION; ALGORITHM; SPACE; TEXT; SEGMENTATION; EXTRACTION; MAP;
D O I
10.1002/ima.20174
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new technique for color reduction of complex document images is presented in this article. It reduces significantly the number of colors of the document image (less than 15 colors in most of the cases) so as to have solid characters and uniform local backgrounds. Therefore, this technique can be used as a preprocessing step by text information extraction applications. Specifically, using the edge map of the document image, a representative set of samples is chosen that constructs a 3D color histogram. Based on these samples in the 3D color space, a relatively large number of colors (usually no more than 100 colors) are obtained by using a simple clustering procedure. The final colors are obtained by applying a mean-shift based procedure. Also, an edge preserving smoothing filter is used as a preprocessing stage that enhances significantly the quality of the initial image. Experimental results prove the method's capability of producing correctly segmented complex color documents where the character elements can be easily extracted as connected components. (C) 2009 Wiley Periodicals, Inc. Int J Imaging Syst Technol, 19, 14-26, 2009; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ima.20174
引用
收藏
页码:14 / 26
页数:13
相关论文
共 50 条
  • [41] Unsupervised Decomposition of Color Document Images by Projecting Colors to A Spherical Surface
    He, Yuan
    Sun, Jun
    Naoi, Satoshi
    Fujii, Yusaku
    Fujimoto, Katsuhito
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 394 - +
  • [42] Color-dependent Banding Characterization and Simulation on Natural Document Images
    Hu, Sirui
    Nachlieli, Hila
    Shaked, Doron
    Shiffman, Smadar
    Allebach, Jan P.
    COLOR IMAGING XVII: DISPLAYING, PROCESSING, HARDCOPY, AND APPLICATIONS, 2012, 8292
  • [43] Contrast Based Color Plane Selection for Binarization of Historical Document Images
    Paramasivam, M. E.
    Sabeenian, R. S.
    EMERGING TRENDS IN ELECTRICAL, COMMUNICATIONS AND INFORMATION TECHNOLOGIES, 2017, 394 : 249 - 255
  • [44] On foreground-background separation in low quality color document images
    Garain, U
    Paquet, T
    Heutte, L
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 585 - 589
  • [45] One Image Reduction Algorithm for RGB Color Images
    Paternain, D.
    Jurio, A.
    Pagola, M.
    Bustince, H.
    Beliakov, G.
    PROCEEDINGS OF THE 7TH CONFERENCE OF THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY (EUSFLAT-2011) AND LFA-2011, 2011, : 366 - 371
  • [46] Fuzzy impulse noise reduction methods for color images
    Schulte, Stefan
    Nachtegael, Mike
    De Witte, Valerie
    Van der Weken, Dietrich
    Kerre, Etienne E.
    COMPUTATIONAL INTELLIGENCE, THEORY AND APPLICATION, 2006, : 711 - +
  • [47] On the new robust algorithm of noise reduction in color images
    Smolka, B
    COMPUTERS & GRAPHICS-UK, 2003, 27 (04): : 503 - 513
  • [48] Skew detection for complex document images using fuzzy runlength
    Shi, ZX
    Govindaraju, V
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 715 - 719
  • [49] AUTOMATIC TEXT EXTRACTION, REMOVAL AND INPAINTING OF COMPLEX DOCUMENT IMAGES
    Chen, Yen-Lin
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (1A): : 303 - 327
  • [50] Line separation for complex document images using fuzzy runlength
    Shi, ZX
    Govindaraju, V
    FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2004, : 306 - 312