Segmentation-free Word Spotting in Historical Bangla Handwritten Binarized Document

被引:0
|
作者
Das, Sugata [1 ]
Mandal, Sekhar [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Sibpur, India
关键词
segmentation-free word spotting; SIFT keypoint detector; HOG features; Normalized Cross Correlation; Cosine distance; RETRIEVAL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Content-Based Image Retrieval (CBIR) for historical handwritten documents is more challenging due to the large variety of writing style and degradation of historical manuscripts due to ageing. In this paper, we propose a segmentation-free word spotting method for historical handwritten binarized documents. The query word and the document image are converted into gray-scale images using distance transform followed by Gaussian smoothing. SIFT detector is used to locate the keypoints in both the query word and the document image. Histogram of Oriented Gradient (HOG) feature vector is used to describe each keypoint. We use an efficient search technique which calculates distance between query-word and the word (or part of a word) present in document image to spot the zone of interest in the document. The proposed method is tested on three historical handwritten Bengali data-sets and one historical English handwritten data-set. The performance is measured using standard evaluation metric which shows the efficiency of the proposed method.
引用
收藏
页码:76 / 81
页数:6
相关论文
共 50 条
  • [1] Segmentation-free word spotting in historical Bangla handwritten document using Wave Kernel Signature
    Das, Sugata
    Mandal, Sekhar
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (02) : 593 - 610
  • [2] Segmentation-free word spotting in historical Bangla handwritten document using Wave Kernel Signature
    Sugata Das
    Sekhar Mandal
    [J]. Pattern Analysis and Applications, 2020, 23 : 593 - 610
  • [3] Segmentation-free Keyword Spotting for Bangla Handwritten Documents
    Zhang, Xi
    Pal, Umapada
    Tan, Chew Lim
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 381 - 386
  • [4] Segmentation-free Word Spotting for Handwritten Arabic Documents
    Khaissidi, G.
    Elfakir, Y.
    Mrabti, M.
    Lakhliai, Z.
    Chenouni, D.
    El Yacoubi, M.
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2016, 4 (01): : 6 - 10
  • [5] Segmentation-free pattern spotting in historical document images
    En, Sovann
    Petitjean, Caroline
    Nicolas, Stephane
    Heutte, Laurent
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 606 - 610
  • [6] An Historical Handwritten Arabic Dataset for Segmentation-Free Word Spotting-HADARA80P
    Pantke, Werner
    Dennhardt, Martin
    Fecker, Daniel
    Maergner, Volker
    Fingscheidt, Tim
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 15 - 20
  • [7] A Segmentation-free Handwritten Word Spotting Approach by Relaxed Feature Matching
    Hast, Anders
    Fornes, Alicia
    [J]. PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 150 - 155
  • [8] Word Hypotheses for Segmentation-free Word Spotting in Historic Document Images
    Rothacker, Leonard
    Sudholt, Sebastian
    Rusakov, Eugen
    Kasperidus, Matthias
    Fink, Gernot A.
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1174 - 1179
  • [9] Word Spotting as a Service: An Unsupervised and Segmentation-Free Framework for Handwritten Documents
    Zagoris, Konstantinos
    Amanatiadis, Angelos
    Pratikakis, Ioannis
    [J]. JOURNAL OF IMAGING, 2021, 7 (12)
  • [10] A segmentation-free word spotting method for historical printed documents
    Konidaris, Thomas
    Kesidis, Anastasios L.
    Gatos, Basilis
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (04) : 963 - 976