Segmentation-free Word Spotting in Historical Bangla Handwritten Binarized Document

被引:0
|
作者
Das, Sugata [1 ]
Mandal, Sekhar [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Sibpur, India
关键词
segmentation-free word spotting; SIFT keypoint detector; HOG features; Normalized Cross Correlation; Cosine distance; RETRIEVAL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Content-Based Image Retrieval (CBIR) for historical handwritten documents is more challenging due to the large variety of writing style and degradation of historical manuscripts due to ageing. In this paper, we propose a segmentation-free word spotting method for historical handwritten binarized documents. The query word and the document image are converted into gray-scale images using distance transform followed by Gaussian smoothing. SIFT detector is used to locate the keypoints in both the query word and the document image. Histogram of Oriented Gradient (HOG) feature vector is used to describe each keypoint. We use an efficient search technique which calculates distance between query-word and the word (or part of a word) present in document image to spot the zone of interest in the document. The proposed method is tested on three historical handwritten Bengali data-sets and one historical English handwritten data-set. The performance is measured using standard evaluation metric which shows the efficiency of the proposed method.
引用
收藏
页码:76 / 81
页数:6
相关论文
共 50 条
  • [41] Bootstrapping Weakly Supervised Segmentation-free Word Spotting through HMM-based Alignment
    Wilkinson, Tomas
    Nettelblad, Carl
    [J]. 2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 49 - 54
  • [42] Hierarchical representation learning using spherical k-means for segmentation-free word spotting
    Mhiri, Mohamed
    Abuelwafa, Sherif
    Desrosiers, Christian
    Cheriet, Mohamed
    [J]. PATTERN RECOGNITION LETTERS, 2018, 101 : 52 - 59
  • [43] Shape-based Word Spotting in Handwritten Document Images
    Giotis, Angelos P.
    Sfikas, Giorgos
    Nikou, Christophoros
    Gatos, Basilis
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 561 - 565
  • [44] Segmentation-Free Approaches for Handwritten Numeral String Recognition
    Hochuli, Andre G.
    Oliveira, Luiz S.
    Britto Jr, Alceu de Souza
    Sabourin, Robert
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [45] Fusion of explicit segmentation based system and segmentation-free based system for on-line Arabic handwritten word recognition
    Khlif, Hanen
    Prum, Sophea
    Kessentini, Yousri
    Kanoun, Slim
    Ogier, Jean Marc
    [J]. PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 399 - 404
  • [46] Handwritten word recognition using segmentation-free hidden Markov modeling and segmentation-based dynamic programming techniques
    Mohamed, M
    Gader, P
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (05) : 548 - 554
  • [47] Improving Handwritten Word Synthesis for Annotation-free Word Spotting
    Wolf, Fabian
    Brandenbusch, Kai
    Fink, Gernot A.
    [J]. 2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 61 - 66
  • [48] A voting-based technique for word spotting in handwritten document images
    Shamik Majumder
    Subhrangshu Ghosh
    Samir Malakar
    Ram Sarkar
    Mita Nasipuri
    [J]. Multimedia Tools and Applications, 2021, 80 : 12411 - 12434
  • [49] A voting-based technique for word spotting in handwritten document images
    Majumder, Shamik
    Ghosh, Subhrangshu
    Malakar, Samir
    Sarkar, Ram
    Nasipuri, Mita
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12411 - 12434
  • [50] Word Spotting Application in Historical Mongolian Document Images
    Wei, Hongxi
    Gao, Guanglai
    [J]. INTELLIGENT COMPUTING THEORIES, 2013, 7995 : 265 - 274