Color Reduction for Complex Document Images

被引:49
|
作者
Nikolaou, Nikos [1 ]
Papamarkos, Nikos [1 ]
机构
[1] Democritus Univ Thrace, Dept Elect & Comp Engn, Image Proc & Multimedia Lab, GR-67100 Xanthi, Greece
关键词
color reduction; text information extraction; mean-shift; edge preserving smoothing; MEAN SHIFT; QUANTIZATION; ALGORITHM; SPACE; TEXT; SEGMENTATION; EXTRACTION; MAP;
D O I
10.1002/ima.20174
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new technique for color reduction of complex document images is presented in this article. It reduces significantly the number of colors of the document image (less than 15 colors in most of the cases) so as to have solid characters and uniform local backgrounds. Therefore, this technique can be used as a preprocessing step by text information extraction applications. Specifically, using the edge map of the document image, a representative set of samples is chosen that constructs a 3D color histogram. Based on these samples in the 3D color space, a relatively large number of colors (usually no more than 100 colors) are obtained by using a simple clustering procedure. The final colors are obtained by applying a mean-shift based procedure. Also, an edge preserving smoothing filter is used as a preprocessing stage that enhances significantly the quality of the initial image. Experimental results prove the method's capability of producing correctly segmented complex color documents where the character elements can be easily extracted as connected components. (C) 2009 Wiley Periodicals, Inc. Int J Imaging Syst Technol, 19, 14-26, 2009; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ima.20174
引用
收藏
页码:14 / 26
页数:13
相关论文
共 50 条
  • [21] LOCATING TEXT IN COMPLEX COLOR IMAGES
    ZHONG, Y
    KARU, K
    JAIN, AK
    PATTERN RECOGNITION, 1995, 28 (10) : 1523 - 1535
  • [22] Verification of color characteristics of document images captured in uncontrolled conditions
    Kunina, I. A.
    Padas, O. A.
    Kolomyttseva, O. A.
    COMPUTER OPTICS, 2024, 48 (04) : 554 - 561
  • [23] Foreground Text Extraction in Color Document Images for Enhanced Readability
    Nirmala, S.
    Nagabhushan, P.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 387 - 392
  • [24] Color Quantization in Document Images Using Biogeography Based Optimization
    Gupta, Surbhi
    Bhardwaj, Deepti
    Sandhu, Parvinder S.
    SOFTWARE AND COMPUTER APPLICATIONS, 2011, 9 : 72 - 78
  • [25] Skew detection and reconstruction of color-printed document images
    Chen, YK
    Wang, JF
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (08): : 1018 - 1024
  • [26] Skew detection and reconstruction of color-printed document images
    Chen, Yi-Kai
    Wang, Jhing-Fa
    IEICE Transactions on Information and Systems, 2001, E84-D (08) : 1018 - 1024
  • [27] Fast Integral MeanShift : Application to Color Segmentation of Document Images
    LeBourgeois, Frank
    Drira, Fadoua
    Gaceb, Djamel
    Jean Duong
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 52 - 56
  • [29] Robust filter for noise reduction in color images
    Szczepanski, M
    Smolka, B
    Plataniotis, KN
    Venetsanopoulos, AN
    CGIV'2002: FIRST EUROPEAN CONFERENCE ON COLOUR IN GRAPHICS, IMAGING, AND VISION, CONFERENCE PROCEEDINGS, 2002, : 517 - 522
  • [30] Nonparametric technique of noise reduction in color images
    Smolka, B
    Lukac, R
    Plataniotis, K
    Venetsanopoulos, A
    OPTICAL METROLOGY FOR ARTS AND MULTIMEDIA, 2003, 5146 : 254 - 265