Text-Graphics Separation to Detect Logo and Stamp from Color Document Images: A Spectral Approach

被引:0
|
作者
Nandedkar, Amit Vijay [1 ]
Mukhopadhyay, Jayanta [2 ]
Sural, Shamik [1 ]
机构
[1] IIT Kharagpur, Sch Informat Technol, Kharagpur, W Bengal, India
[2] IIT Kharagpur, Dept Comp Sci & Engn, Kharagpur, W Bengal, India
关键词
BLOCK SEGMENTATION; CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text and graphics separation is an important task in the field of document image processing. This work aims at detecting graphics such as logos and stamps in a scanned document image. A novel spectral filtering based text-graphics separation algorithm (SFTGS) is presented here. The property of text that it is the major source of high spatial frequency components in a document image, is exploited in this algorithm. Accordingly high frequency filtering is used to separate the text symbols. This is followed by a segmentation process for delineating residual text and the graphics. The main advantage of SFTGS is that it works in a single pass, and can discriminate graphics and text without supervised training. Subsequently, the graphics segments are further categorized into two different classes, namely logos and stamps. In this case, we assume that these are the two classes of graphical objects present in the documents. The technique is evaluated using publicly available document dataset consisting of graphics as stamps and logos. The result is compared with existing approaches reported in the literature, and it is found that the proposed method performs superior to them. An overall performance of 89.1% recall and 96.9% precision is obtained for SFTGS.
引用
收藏
页码:571 / 575
页数:5
相关论文
共 14 条
  • [1] Stamp and Logo Detection from Document Images by Finding Outliers
    Dey, Soumyadeep
    Mukherjee, Jayanta
    Sural, Shamik
    [J]. 2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,
  • [2] Segmentation of text and graphics from document images
    Chowdhury, S. P.
    Mandal, S.
    Das, A. K.
    Chanda, Bhabatosh
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 619 - +
  • [3] Separation of Foreground Text from Complex Background in Color Document Images
    Shivananda, Nirmala
    Nagabhushan, P.
    [J]. ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 306 - 309
  • [4] A ROBUST ALGORITHM FOR TEXT STRING SEPARATION FROM MIXED TEXT GRAPHICS IMAGES
    FLETCHER, LA
    KASTURI, R
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1988, 10 (06) : 910 - 918
  • [5] A knowledge-based approach for Textual Information Extraction from Mixed Text/Graphics Complex Document Images
    Chen, Yen-Lin
    [J]. IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3270 - 3277
  • [6] Handwritten and Machine Printed Text Separation from Kannada Document Images
    Pardeshi, Rajmohan
    Hangarge, Mallikarjun
    Doddamani, Srikanth
    Santosh, K. C.
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
  • [7] Novel Approach to Background-Text-Nontext Separation in Ancient Degraded Document Images
    Asatryan, David
    Sazhumyan, Grigor
    Aznauryan, Lusine
    [J]. 2017 ELEVENTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGIES (CSIT), 2017, : 154 - 157
  • [8] Learning to detect, localize and recognize many text objects in document images from few examples
    Moysset, Bastien
    Kermorvant, Christopher
    Wolf, Christian
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (03) : 161 - 175
  • [9] Learning to detect, localize and recognize many text objects in document images from few examples
    Bastien Moysset
    Christopher Kermorvant
    Christian Wolf
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 161 - 175
  • [10] A knowledge-based system for extracting text-lines from mixed and overlapping text/graphics compound document images
    Chen, Yen-Lin
    Hong, Zeng-Wei
    Chuang, Cheng-Hung
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 494 - 507