A segmentation-free word spotting method for historical printed documents

被引:7
|
作者
Konidaris, Thomas [1 ]
Kesidis, Anastasios L. [2 ]
Gatos, Basilis [1 ]
机构
[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Computat Intelligence Lab, Patriarchou Grigoriou St, Athens 15310, Greece
[2] Technol Educ Inst Athens, Dept Surveying Engn, Athens 12210, Greece
关键词
Segmentation-free; Word spotting; Historical documents; RETRIEVAL; ALIGNMENT;
D O I
10.1007/s10044-015-0476-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a two-step segmentation-free word spotting method for historical printed documents is presented. The first step involves a minimum distance matching between a query keyword image and a document page image using keypoint correspondences. In the second step of the method, the matched keypoints on the document image serve as indicators for creating candidate image areas. The query keyword image is matched against the candidate image areas in order to properly estimate the bounding boxes of the detected word instances. The method is evaluated using two datasets of different languages and is compared against segmentation-free state-of-the-art methods. The experimental results show that the proposed method outperforms significantly the competitive approaches.
引用
收藏
页码:963 / 976
页数:14
相关论文
共 50 条
  • [1] A segmentation-free word spotting method for historical printed documents
    Thomas Konidaris
    Anastasios L. Kesidis
    Basilis Gatos
    [J]. Pattern Analysis and Applications, 2016, 19 : 963 - 976
  • [2] Segmentation-free Word Spotting for Handwritten Arabic Documents
    Khaissidi, G.
    Elfakir, Y.
    Mrabti, M.
    Lakhliai, Z.
    Chenouni, D.
    El Yacoubi, M.
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2016, 4 (01): : 6 - 10
  • [3] Word Spotting as a Service: An Unsupervised and Segmentation-Free Framework for Handwritten Documents
    Zagoris, Konstantinos
    Amanatiadis, Angelos
    Pratikakis, Ioannis
    [J]. JOURNAL OF IMAGING, 2021, 7 (12)
  • [4] Segmentation-free Word Spotting in Historical Bangla Handwritten Binarized Document
    Das, Sugata
    Mandal, Sekhar
    [J]. 2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 76 - 81
  • [5] Bag-of-Features HMMs for Segmentation-free Word Spotting in Handwritten Documents
    Rothacker, Leonard
    Rusinol, Marcal
    Fink, Gernot A.
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1305 - 1309
  • [6] Segmentation-free word spotting with exemplar SVMs
    Almazan, Jon
    Gordo, Albert
    Fornes, Alicia
    Valveny, Ernest
    [J]. PATTERN RECOGNITION, 2014, 47 (12) : 3967 - 3978
  • [7] Browsing Heterogeneous Document Collections by a Segmentation-free Word Spotting Method
    Rusinol, Marcal
    Aldavert, David
    Toledo, Ricardo
    Llados, Josep
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 63 - 67
  • [8] Segmentation-free Keyword Spotting for Bangla Handwritten Documents
    Zhang, Xi
    Pal, Umapada
    Tan, Chew Lim
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 381 - 386
  • [9] OCR-independent and Segmentation-free Word-Spotting in Handwritten Arabic Archive Documents
    Aouadi, N.
    Kacem, A.
    [J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 36 - 41
  • [10] Segmentation-free pattern spotting in historical document images
    En, Sovann
    Petitjean, Caroline
    Nicolas, Stephane
    Heutte, Laurent
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 606 - 610