Enabling Indexing and Retrieval of Historical Arabic Manuscripts through Template Matching Based Word Spotting

被引:0
|
作者
Faisal, Tayyeba [1 ]
AlMaadeed, Somaya [1 ]
机构
[1] Qatar Univ, Dept Comp Sci & Engn, Doha, Qatar
关键词
word spotting; template matching; correlation similarity; historical; Arabic; HANDWRITTEN DOCUMENTS; SYSTEM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a holistic segmentation-free query by example word spotting technique based on template matching. We have applied this technique to a dataset of historical Arabic handwritten manuscript images. First, the documents as well as query word images are pre-processed for separating text from the noisy background and converting to their binary equivalents. Then a pixel based approach is used for computing the similarity between the pre-processed template query word and document images by using the Correlation similarity measure. Slight variations in font sizes are tolerated by adjusting the threshold of similarity. Our robust pre-processing algorithm significantly enhances the performance of the learning-free template matching based word spotting approach. The proposed technique is simple as well as efficient as it does not involve any time-consuming learning steps. Experiments with a historical Arabic dataset yield promising results. This technique can generate locations of occurrences of query word images which is the fundamental step towards building searchable indexes for historical manuscripts.
引用
下载
收藏
页码:57 / 63
页数:7
相关论文
共 50 条
  • [1] Keyword Spotting in Historical Devanagari Manuscripts by Word Matching
    Sharada, B.
    Sushma, S. N.
    Bharathlal
    DATA ANALYTICS AND LEARNING, 2019, 43 : 65 - 76
  • [2] Features for word spotting in historical manuscripts
    Rath, TM
    Manmatha, R
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 218 - 222
  • [3] Prior Segmentation of Old Arabic Manuscripts by Separator Word Spotting
    Aouadi, Nabil
    Echi, Afef Kacem
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 31 - 36
  • [4] Word Retrieval System for Ancient Arabic Manuscripts
    Al-Maadeed, Somaya
    Issawi, Fatima
    Bouridan, Ahmed
    2017 9TH IEEE-GCC CONFERENCE AND EXHIBITION (GCCCE), 2018, : 412 - 416
  • [5] Ridgelet-DTW-Based Word Spotting for Arabic Historical Document
    Brik, Youcef
    Chibani, Youcef
    Zemouri, Et-Tahir
    Sehad, Abdenour
    2013 8TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA), 2013, : 194 - +
  • [6] Template-free word spotting in low-quality manuscripts
    Cao, Huaigu
    Govindaraju, Venu
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 135 - +
  • [7] Indexing and Retrieval of Malayalam News Videos Based on Word Image Matching
    Gangan, Manjary P.
    Anoop, K.
    Lajish, V. L.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 1103 - 1108
  • [8] Automatic Synthesis of Historical Arabic Text for Word-Spotting
    Kassis, Majeed
    El-Sana, Jihad
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 239 - 244
  • [9] Multi-word term indexing for Arabic document retrieval
    Boulaknadel, Siham
    Daille, Beatrice
    Driss, Aboutajdine
    2008 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1-3, 2008, : 480 - +
  • [10] Keyword Spotting based on the Analysis of Template Matching Distances
    Barakat, M. S.
    Ritz, C. H.
    Stirling, D. A.
    5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, ICSPCS'2011, 2011,