Hierarchical representation learning using spherical k-means for segmentation-free word spotting

被引:6
|
作者
Mhiri, Mohamed [1 ]
Abuelwafa, Sherif [1 ]
Desrosiers, Christian [1 ]
Cheriet, Mohamed [1 ]
机构
[1] Ecole Technol Super, Synchromedia Lab Multimedia Commun Telepresence, Montreal, PQ H3C 1K3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Segmentation-free and training-free word spotting; Handwriting representation; Representation learning; VLAD representation;
D O I
10.1016/j.patrec.2017.11.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic segmentation-free and training-free word spotting is a challenging task due to the large intra-class variability of handwritten shapes and the need to process the whole document image. In this work, a novel unsupervised hierarchical handwriting representation is introduced, where the spherical k-means algorithm is used to learn a hierarchy of features for representing document images. A matching system is then employed for word spotting, which consists of two stages: (1) a fast pre-selection stage applying a sliding-window approach over compressed document image representations, and (2) a re-ranking stage based on a discriminative description that encodes the spatial layout of local features. The proposed approach is evaluated using three well-known benchmark datasets, the Lord Byron (LB), the George Washington (GW) and the IAM datasets. Results show our method to yield competitive performance compared to state-of-the-art approaches for segmentation-free and training-free word spotting. In addition, since our proposed framework has a low computational and memory complexity, it can be applied to large datasets. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:52 / 59
页数:8
相关论文
共 50 条
  • [1] Unsupervised Pathology Image Segmentation Using Representation Learning with Spherical K-means
    Moriya, Takayasu
    Roth, Holger H.
    Nakamura, Shota
    Oda, Hirohisa
    Nagara, Kai
    Oda, Masahiro
    Mori, Kensaku
    [J]. MEDICAL IMAGING 2018: DIGITAL PATHOLOGY, 2018, 10581
  • [2] Segmentation-free word spotting with exemplar SVMs
    Almazan, Jon
    Gordo, Albert
    Fornes, Alicia
    Valveny, Ernest
    [J]. PATTERN RECOGNITION, 2014, 47 (12) : 3967 - 3978
  • [3] R-PHOC: Segmentation-Free Word Spotting using CNN
    Ghosh, Suman K.
    Valveny, Ernest
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 801 - 806
  • [4] Segmentation-free Word Spotting for Handwritten Arabic Documents
    Khaissidi, G.
    Elfakir, Y.
    Mrabti, M.
    Lakhliai, Z.
    Chenouni, D.
    El Yacoubi, M.
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2016, 4 (01): : 6 - 10
  • [5] Word Hypotheses for Segmentation-free Word Spotting in Historic Document Images
    Rothacker, Leonard
    Sudholt, Sebastian
    Rusakov, Eugen
    Kasperidus, Matthias
    Fink, Gernot A.
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1174 - 1179
  • [6] A segmentation-free word spotting method for historical printed documents
    Konidaris, Thomas
    Kesidis, Anastasios L.
    Gatos, Basilis
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (04) : 963 - 976
  • [7] Onmilingual segmentation-free word spotting for ancient manuscripts indexation
    Leydier, Y
    Le Bourgeois, F
    Emptoz, H
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 533 - 537
  • [8] A segmentation-free word spotting method for historical printed documents
    Thomas Konidaris
    Anastasios L. Kesidis
    Basilis Gatos
    [J]. Pattern Analysis and Applications, 2016, 19 : 963 - 976
  • [9] Segmentation-free Word Spotting in Historical Bangla Handwritten Binarized Document
    Das, Sugata
    Mandal, Sekhar
    [J]. 2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 76 - 81
  • [10] On Evaluation of Segmentation-Free Word Spotting Approaches Without Hard Decisions
    Pantke, Werner
    Maergner, Volker
    Fingscheidt, Tim
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1300 - 1304