Word Spotting in Historical Document Collections with Online-Handwritten Queries

被引:1
|
作者
Wieprecht, Christian [1 ]
Rothacker, Leonard [1 ]
Fink, Gernot A. [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, Dortmund, Germany
关键词
word spotting; pen-based systems; online handwriting representations; common subspaces;
D O I
10.1109/DAS.2016.41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pen-based systems are becoming more and more important due to the growing availability of touch sensitive devices in various forms and sizes. Their interfaces offer the possibility to directly interact with a system by natural handwriting. In contrast to other input modalities it is not required to switch to special modes, like software-keyboards. In this paper we propose a new method for querying digital archives of historical documents. Word images are retrieved with respect to search terms that users write on a pen-based system by hand. The captured trajectory is used as a query which we call query-by-online-trajectory word spotting. By using attribute embeddings for both online-trajectory and visual features, word images are retrieved based on their distance to the query in a common subspace. The system is therefore robust, as no explicit transcription for queries or word images is required. We evaluate our approach for writer-dependent as well as writer-independent scenarios, where we present highly accurate retrieval results in the former and compelling retrieval results in the latter case. Our performance is very competitive in comparison to related methods from the literature.
引用
收藏
页码:162 / 167
页数:6
相关论文
共 50 条
  • [41] Improving Handwritten Word Synthesis for Annotation-free Word Spotting
    Wolf, Fabian
    Brandenbusch, Kai
    Fink, Gernot A.
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 61 - 66
  • [42] Synthesizing queries for handwritten word image retrieval
    Rodriguez-Serrano, Jose A.
    Perronnin, Florent
    PATTERN RECOGNITION, 2012, 45 (09) : 3270 - 3276
  • [43] Deep Features Representation of Word Image for Keyword Spotting in Historical Mongolian Document Images
    Wei, Hongxi
    Zhang, Jing
    Zhang, Hui
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 413 - 417
  • [44] Unsupervised writer style adaptation for handwritten word spotting
    Rodriguez, Jose A.
    Perronnin, Florent
    Sanchez, Gemma
    Llados, Josep
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3039 - 3042
  • [45] Handwritten Word Spotting Based on A Hybrid Optimal Distance
    Wang, Peng
    Eglin, Veronique
    Largeron, Christine
    Garcia, Christophe
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2580 - 2584
  • [46] Word Spotting and Regular Expression Detection in Handwritten Documents
    Kessentini, Yousri
    Chatelain, Clement
    Paquet, Thierry
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 516 - 520
  • [47] An Historical Handwritten Arabic Dataset for Segmentation-Free Word Spotting-HADARA80P
    Pantke, Werner
    Dennhardt, Martin
    Fecker, Daniel
    Maergner, Volker
    Fingscheidt, Tim
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 15 - 20
  • [48] Features for word spotting in historical manuscripts
    Rath, TM
    Manmatha, R
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 218 - 222
  • [49] Spotting relevant information in extremely large document collections
    Kohonen, T
    COMPUTATIONAL INTELLIGENCE: THEORY AND APPLICATIONS, 1999, 1625 : 59 - 61
  • [50] A survey of document image word spotting techniques
    Giotis, Angelos P.
    Sfikas, Giorgos
    Gatos, Basilis
    Nikou, Christophoros
    PATTERN RECOGNITION, 2017, 68 : 310 - 332