Segmentation-free word spotting with exemplar SVMs

被引:45
|
作者
Almazan, Jon [1 ]
Gordo, Albert [2 ]
Fornes, Alicia [1 ]
Valveny, Ernest [1 ]
机构
[1] Univ Autonoma Barcelona, Comp Vis Ctr, Dept Ciencies Computacio, Bellaterra 08193, Barcelona, Spain
[2] INRIA Grenoble, Rhone Alpes Res Ctr, F-38330 Montbonnot St Martin, France
关键词
Word spotting; Segmentation-free; Unsupervised learning; Reranking; Query expansion; Compression; RECOGNITION; MODEL;
D O I
10.1016/j.patcog.2014.06.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose an unsupervised segmentation-free method for word spotting in document images. Documents are represented with a grid of HOG descriptors, and a sliding-window approach is used to locate the document regions that are most similar to the query. We use the Exemplar SVM framework to produce a better representation of the query in an unsupervised way. Then, we use a more discriminative representation based on Fisher Vector to rerank the best regions retrieved, and the most promising ones are used to expand the Exemplar SVM training set and improve the query representation. Finally, the document descriptors are precomputed and compressed with Product Quantization. This offers two advantages: first, a large number of documents can be kept in RAM memory at the same time. Second, the sliding window becomes significantly faster since distances between quantized HOG descriptors can be precomputed. Our results significantly outperform other segmentation-free methods in the literature, both in accuracy and in speed and memory usage. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3967 / 3978
页数:12
相关论文
共 50 条
  • [1] Segmentation-free Word Spotting for Handwritten Arabic Documents
    Khaissidi, G.
    Elfakir, Y.
    Mrabti, M.
    Lakhliai, Z.
    Chenouni, D.
    El Yacoubi, M.
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2016, 4 (01): : 6 - 10
  • [2] Word Hypotheses for Segmentation-free Word Spotting in Historic Document Images
    Rothacker, Leonard
    Sudholt, Sebastian
    Rusakov, Eugen
    Kasperidus, Matthias
    Fink, Gernot A.
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1174 - 1179
  • [3] A segmentation-free word spotting method for historical printed documents
    Konidaris, Thomas
    Kesidis, Anastasios L.
    Gatos, Basilis
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (04) : 963 - 976
  • [4] Onmilingual segmentation-free word spotting for ancient manuscripts indexation
    Leydier, Y
    Le Bourgeois, F
    Emptoz, H
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 533 - 537
  • [5] A segmentation-free word spotting method for historical printed documents
    Thomas Konidaris
    Anastasios L. Kesidis
    Basilis Gatos
    [J]. Pattern Analysis and Applications, 2016, 19 : 963 - 976
  • [6] On Evaluation of Segmentation-Free Word Spotting Approaches Without Hard Decisions
    Pantke, Werner
    Maergner, Volker
    Fingscheidt, Tim
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1300 - 1304
  • [7] Segmentation-free Word Spotting in Historical Bangla Handwritten Binarized Document
    Das, Sugata
    Mandal, Sekhar
    [J]. 2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 76 - 81
  • [8] A Segmentation-free Handwritten Word Spotting Approach by Relaxed Feature Matching
    Hast, Anders
    Fornes, Alicia
    [J]. PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 150 - 155
  • [9] R-PHOC: Segmentation-Free Word Spotting using CNN
    Ghosh, Suman K.
    Valveny, Ernest
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 801 - 806
  • [10] Word Spotting as a Service: An Unsupervised and Segmentation-Free Framework for Handwritten Documents
    Zagoris, Konstantinos
    Amanatiadis, Angelos
    Pratikakis, Ioannis
    [J]. JOURNAL OF IMAGING, 2021, 7 (12)