Word Spotting in Historical Document Collections with Online-Handwritten Queries

被引:1
|
作者
Wieprecht, Christian [1 ]
Rothacker, Leonard [1 ]
Fink, Gernot A. [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, Dortmund, Germany
关键词
word spotting; pen-based systems; online handwriting representations; common subspaces;
D O I
10.1109/DAS.2016.41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pen-based systems are becoming more and more important due to the growing availability of touch sensitive devices in various forms and sizes. Their interfaces offer the possibility to directly interact with a system by natural handwriting. In contrast to other input modalities it is not required to switch to special modes, like software-keyboards. In this paper we propose a new method for querying digital archives of historical documents. Word images are retrieved with respect to search terms that users write on a pen-based system by hand. The captured trajectory is used as a query which we call query-by-online-trajectory word spotting. By using attribute embeddings for both online-trajectory and visual features, word images are retrieved based on their distance to the query in a common subspace. The system is therefore robust, as no explicit transcription for queries or word images is required. We evaluate our approach for writer-dependent as well as writer-independent scenarios, where we present highly accurate retrieval results in the former and compelling retrieval results in the latter case. Our performance is very competitive in comparison to related methods from the literature.
引用
收藏
页码:162 / 167
页数:6
相关论文
共 50 条
  • [1] Local Binary Pattern for Word Spotting in Handwritten Historical Document
    Dey, Sounak
    Nicolaou, Anguelos
    Llados, Josep
    Pal, Umapada
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 574 - 583
  • [2] Segmentation-free Word Spotting in Historical Bangla Handwritten Binarized Document
    Das, Sugata
    Mandal, Sekhar
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 76 - 81
  • [3] Sequential Word Spotting in Historical Handwritten Documents
    Fernandez-Mota, David
    Llados, Josep
    Fornes, Alicia
    Manmatha, R.
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 101 - 105
  • [4] Unsupervised Word Spotting in Historical Handwritten Document Images Using Document-Oriented Local Features
    Zagoris, Konstantinos
    Pratikakis, Ioannis
    Gatos, Basilis
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (08) : 4032 - 4041
  • [5] ON THE INFLUENCE OF WORD REPRESENTATIONS FOR HANDWRITTEN WORD SPOTTING IN HISTORICAL DOCUMENTS
    Llados, Josep
    Rusinol, Marcal
    Fornes, Alicia
    Fernandez, David
    Dutta, Anjan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (05)
  • [6] Shape-based Word Spotting in Handwritten Document Images
    Giotis, Angelos P.
    Sfikas, Giorgos
    Nikou, Christophoros
    Gatos, Basilis
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 561 - 565
  • [7] Learning-free handwritten word spotting method for historical handwritten documents
    Mohammed, Hanadi Hassen
    Subramanian, Nandhini
    Al-Madeed, Somaya
    IET IMAGE PROCESSING, 2021, 15 (10) : 2332 - 2341
  • [8] Segmentation-free word spotting in historical Bangla handwritten document using Wave Kernel Signature
    Das, Sugata
    Mandal, Sekhar
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (02) : 593 - 610
  • [9] Segmentation-based Historical Handwritten Word Spotting using Document-Specific Local Features
    Zagoris, Konstantinos
    Pratikakis, Ioannis
    Gatos, Basil. Is
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 9 - 14
  • [10] Segmentation-free word spotting in historical Bangla handwritten document using Wave Kernel Signature
    Sugata Das
    Sekhar Mandal
    Pattern Analysis and Applications, 2020, 23 : 593 - 610