Segmentation-free word spotting with exemplar SVMs

被引:45
|
作者
Almazan, Jon [1 ]
Gordo, Albert [2 ]
Fornes, Alicia [1 ]
Valveny, Ernest [1 ]
机构
[1] Univ Autonoma Barcelona, Comp Vis Ctr, Dept Ciencies Computacio, Bellaterra 08193, Barcelona, Spain
[2] INRIA Grenoble, Rhone Alpes Res Ctr, F-38330 Montbonnot St Martin, France
关键词
Word spotting; Segmentation-free; Unsupervised learning; Reranking; Query expansion; Compression; RECOGNITION; MODEL;
D O I
10.1016/j.patcog.2014.06.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose an unsupervised segmentation-free method for word spotting in document images. Documents are represented with a grid of HOG descriptors, and a sliding-window approach is used to locate the document regions that are most similar to the query. We use the Exemplar SVM framework to produce a better representation of the query in an unsupervised way. Then, we use a more discriminative representation based on Fisher Vector to rerank the best regions retrieved, and the most promising ones are used to expand the Exemplar SVM training set and improve the query representation. Finally, the document descriptors are precomputed and compressed with Product Quantization. This offers two advantages: first, a large number of documents can be kept in RAM memory at the same time. Second, the sliding window becomes significantly faster since distances between quantized HOG descriptors can be precomputed. Our results significantly outperform other segmentation-free methods in the literature, both in accuracy and in speed and memory usage. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3967 / 3978
页数:12
相关论文
共 50 条
  • [21] Segmentation-free word spotting in historical Bangla handwritten document using Wave Kernel Signature
    Sugata Das
    Sekhar Mandal
    [J]. Pattern Analysis and Applications, 2020, 23 : 593 - 610
  • [22] Efficient segmentation-free keyword spotting in historical document collections
    Rusinol, Marcal
    Aldavert, David
    Toledo, Ricardo
    Llados, Josep
    [J]. PATTERN RECOGNITION, 2015, 48 (02) : 545 - 555
  • [23] Efficient Exemplar Word Spotting
    Almazan, Jon
    Gordo, Albert
    Fornes, Alicia
    Valveny, Ernest
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [24] A segmentation free Word Spotting for handwritten documents
    Ghorbel, Adam
    Ogier, Lean-Marc
    Vincent, Nicole
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 346 - 350
  • [25] An Application-Independent and Segmentation-Free Approach for Spotting Queries in Document Images
    Chatbri, Houssem
    Kwan, Paul
    Kameyama, Keisuke
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2891 - 2896
  • [26] Segmentation-free Keyword Spotting for Handwritten Documents based on Heat Kernel Signature
    Zhang, Xi
    Tan, Chew Lim
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 827 - 831
  • [27] Online Handwritten Cursive Word Recognition Using Segmentation-free and Segmentation-based Methods
    Zhu, Bilan
    Shivram, Arti
    Govindaraju, Venu
    Nakagawa, Masaki
    [J]. Proceedings 3rd IAPR Asian Conference on Pattern Recognition ACPR 2015, 2015, : 161 - 165
  • [28] Online handwritten cursive word recognition by combining segmentation-free and segmentation-based methods
    Zhu, Bilan
    Shivram, Arti
    Govindaraju, Venu
    Nakagawa, Masaki
    [J]. PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 417 - 422
  • [29] Segmentation-Free Dynamic Scene Deblurring
    Kim, Tae Hyun
    Lee, Kyoung Mu
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2766 - 2773
  • [30] Neural Ctrl-F: Segmentation-free Query-by-StringWord Spotting in Handwritten Manuscript Collections
    Wilkinson, Tomas
    Lindstrom, Jonas
    Brun, Anders
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4443 - 4452