Word Spotting in Historical Document Collections with Online-Handwritten Queries

被引:1
|
作者
Wieprecht, Christian [1 ]
Rothacker, Leonard [1 ]
Fink, Gernot A. [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, Dortmund, Germany
关键词
word spotting; pen-based systems; online handwriting representations; common subspaces;
D O I
10.1109/DAS.2016.41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pen-based systems are becoming more and more important due to the growing availability of touch sensitive devices in various forms and sizes. Their interfaces offer the possibility to directly interact with a system by natural handwriting. In contrast to other input modalities it is not required to switch to special modes, like software-keyboards. In this paper we propose a new method for querying digital archives of historical documents. Word images are retrieved with respect to search terms that users write on a pen-based system by hand. The captured trajectory is used as a query which we call query-by-online-trajectory word spotting. By using attribute embeddings for both online-trajectory and visual features, word images are retrieved based on their distance to the query in a common subspace. The system is therefore robust, as no explicit transcription for queries or word images is required. We evaluate our approach for writer-dependent as well as writer-independent scenarios, where we present highly accurate retrieval results in the former and compelling retrieval results in the latter case. Our performance is very competitive in comparison to related methods from the literature.
引用
收藏
页码:162 / 167
页数:6
相关论文
共 50 条
  • [21] Ridgelet-DTW-Based Word Spotting for Arabic Historical Document
    Brik, Youcef
    Chibani, Youcef
    Zemouri, Et-Tahir
    Sehad, Abdenour
    2013 8TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA), 2013, : 194 - +
  • [22] SpottingNet: Learning the Similarity of Word Images with Convolutional Neural Network for Word Spotting in Handwritten Historical Documents
    Zhong, Zhuoyao
    Pan, Weishen
    Jin, Lianwen
    Mouchere, Harold
    Viard-Gaudin, Christian
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 295 - 300
  • [23] Historical Handwritten Text Images Word Spotting Through Sliding Window HOG Features
    Bolelli, Federico
    Borghi, Guido
    Grana, Costantino
    IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 : 729 - 738
  • [24] Multilingual Word Spotting in Offline Handwritten Documents
    Wshah, Safwan
    Kumar, Gaurav
    Govindaraju, Venu
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 310 - 313
  • [25] Attribute CNNs for word spotting in handwritten documents
    Sebastian Sudholt
    Gernot A. Fink
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 199 - 218
  • [26] A segmentation free Word Spotting for handwritten documents
    Ghorbel, Adam
    Ogier, Lean-Marc
    Vincent, Nicole
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 346 - 350
  • [27] Attribute CNNs for word spotting in handwritten documents
    Sudholt, Sebastian
    Fink, Gernot A.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (03) : 199 - 218
  • [28] Asking questions on handwritten document collections
    Mathew, Minesh
    Gomez, Lluis
    Karatzas, Dimosthenis
    Jawahar, C., V
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2021, 24 (03) : 235 - 249
  • [29] Asking questions on handwritten document collections
    Minesh Mathew
    Lluis Gomez
    Dimosthenis Karatzas
    C. V. Jawahar
    International Journal on Document Analysis and Recognition (IJDAR), 2021, 24 : 235 - 249
  • [30] Word spotting for historical documents
    Tony M. Rath
    R. Manmatha
    International Journal of Document Analysis and Recognition (IJDAR), 2007, 9 : 139 - 152