Touching Text Character Localization in Graphical Documents Using SIFT

被引:0
|
作者
Pratim Roy, Partha [1 ]
Pal, Umapada [2 ]
Llados, Josep [1 ]
机构
[1] Univ Autonoma Barcelona, Comp Vis Ctr, Bellaterra 08193, Barcelona, Spain
[2] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata 108, India
关键词
TEXT/GRAPHICS SEPARATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Interpretation of graphical document images is a challenging task as it requires proper understanding of text/graphics symbols present in such documents. Difficulties arise in graphical document recognition when text and symbol overlapped/touched. Intersection of text and symbols with graphical lines and curves occur frequently in graphical documents and hence separation of such symbols is very difficult. Several pattern recognition and classification techniques exist to recognize isolated text/symbol. But, the touching/overlapping text and symbol recognition has not yet been dealt successfully. An interesting technique, Scale Invariant Feature Transform (SIFT), originally devised for object recognition can take care of overlapping problems. Even if SIFT features have emerged as a very powerful object descriptors, their employment in graphical documents context has not been investigated much. In this paper we present the adaptation of the SIFT approach in the context of text character localization (spotting) in graphical documents. We evaluate the applicability of this technique in such documents and discuss the scope of improvement by combining some state-of-the-art approaches.
引用
收藏
页码:199 / +
页数:2
相关论文
共 50 条
  • [1] Multi-oriented touching text character segmentation in graphical documents using dynamic programming
    Pratim Roy, Partha
    Pal, Umapada
    Llados, Josep
    Delalandre, Mathieu
    PATTERN RECOGNITION, 2012, 45 (05) : 1972 - 1983
  • [2] Touching Character Segmentation Method for Chinese Historical Documents
    Sun, Xiaolu
    Peng, Liangrui
    Ding, Xiaoqing
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [3] Localization Of Touching Letters In Arabic Handwritten Documents
    Nabil, Aouadi
    Echi, Afef Kacem
    Belaid, Abdel
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 501 - 506
  • [4] Recognition of Multi-Oriented Touching Characters in Graphical Documents
    Roy, Partha Pratim
    Pal, Umapada
    Llados, Josep
    SIXTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS & IMAGE PROCESSING ICVGIP 2008, 2008, : 297 - +
  • [5] Multi-Oriented Text Recognition in Graphical Documents using HMM
    Roy, Partha Pratim
    Roy, Sangheeta
    Pal, Umapada
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 136 - 140
  • [6] Text line extraction in graphical documents using background and foreground information
    Partha Pratim Roy
    Umapada Pal
    Josep Lladós
    International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 227 - 241
  • [7] Text line extraction in graphical documents using background and foreground information
    Pratim Roy, Partha
    Pal, Umapada
    Llados, Josep
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (03) : 227 - 241
  • [8] Text localization in color documents
    Nikolaou, N.
    Badekas, E.
    Papamarkos, N.
    Strouthopoulos, C.
    VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2006, : 181 - +
  • [9] A COMPLETE SYSTEM FOR DETECTION AND RECOGNITION OF TEXT IN GRAPHICAL DOCUMENTS USING BACKGROUND INFORMATION
    Pratim Roy, Partha
    Llados, Josep
    Pal, Umapada
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2009, : 209 - +
  • [10] Optical character recognition for degraded text documents
    Sanyal, Sudip
    Dhingra, Kapil Dev
    Sharma, Pramod Kumar
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 1988 - +