Word Spotting for Handwritten Documents using Chamfer Distance and Dynamic Time Warping

被引:7
|
作者
Saabni, Raid M. [1 ,2 ]
El-Sana, Jihad A. [2 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
[2] Triangle Res & Dev Ctr, IL-30075 Kafr Qarea, Israel
来源
关键词
Word Spotting; Handwriting Recognition; Dynamic Time Warping; Chamfer Distance;
D O I
10.1117/12.873392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A large amount of handwritten historical documents are located in libraries around the world. The desire to access, search, and explore these documents paves the way for a new age of knowledge sharing and promotes collaboration and understanding between human societies. Currently, the indexes for these documents are generated manually, which is very tedious and time consuming. Results produced by state of the art techniques, for converting complete images of handwritten documents into textual representations, are not yet sufficient. Therefore, word-spotting methods have been developed to archive and index images of handwritten documents in order to enable efficient searching within documents. In this paper, we present a new matching algorithm to be used in word-spotting tasks for historical Arabic documents. We present a novel algorithm based on the Chamfer Distance to compute the similarity between shapes of word-parts. Matching results are used to cluster images of Arabic word-parts into different classes using the Nearest Neighbor rule. To compute the distance between two word-part images, the algorithm subdivides each image into equal-sized slices (windows). A modified version of the Chamfer Distance, incorporating geometric gradient features and distance transform data, is used as a similarity distance between the different slices. Finally, the Dynamic Time Warping (DTW) algorithm is used to measure the distance between two images of word-parts. By using the DTW we enabled our system to cluster similar word-parts, even though they are transformed non-linearly due to the nature of handwriting. We tested our implementation of the presented methods using various documents in different writing styles, taken from Juma'a Al Majid Center - Dubai, and obtained encouraging results.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] HoG based Two-Directional Dynamic Time Warping for Handwritten Word Spotting
    Yao, Shunyi
    Wen, Ying
    Lu, Yue
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 161 - 165
  • [2] Word Spotting as a Service for Handwritten Documents
    Amanatiadis, Angelos
    Zagoris, Konstantinos
    Pratikakis, Ioannis
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [3] A Survey on handwritten documents word spotting
    Ahmed R.
    Al-Khatib W.G.
    Mahmoud S.
    [J]. International Journal of Multimedia Information Retrieval, 2017, 6 (1) : 31 - 47
  • [4] An overview on handwritten documents word spotting
    Boualam, Manal
    Khaissidi, Ghizlane
    Mrabti, Mostafa
    Elfakir, Youssef
    [J]. 2019 INTERNATIONAL CONFERENCE ON WIRELESS TECHNOLOGIES, EMBEDDED AND INTELLIGENT SYSTEMS (WITS), 2019,
  • [5] An Adaptive Zoning Technique for Word Spotting Using Dynamic Time Warping
    Papandreou, A.
    Gatos, B.
    Zagoris, K.
    [J]. PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 387 - 392
  • [6] Word spotting for Handwritten Arabic documents using Harris detector
    Elfakiri, Youssef
    Chenouni, Driss
    Khaissidi, Ghizlane
    El Yacoubi, Mounim
    Mrabti, Mostafa
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR ORGANIZATIONS DEVELOPMENT (IT4OD), 2016,
  • [7] Multilingual Word Spotting in Offline Handwritten Documents
    Wshah, Safwan
    Kumar, Gaurav
    Govindaraju, Venu
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 310 - 313
  • [8] Attribute CNNs for word spotting in handwritten documents
    Sebastian Sudholt
    Gernot A. Fink
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 199 - 218
  • [9] A segmentation free Word Spotting for handwritten documents
    Ghorbel, Adam
    Ogier, Lean-Marc
    Vincent, Nicole
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 346 - 350
  • [10] ON THE USE OF DYNAMIC TIME WARPING FOR WORD SPOTTING AND CONNECTED WORD RECOGNITION
    MYERS, CS
    RABINER, LR
    ROSENBERG, AE
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1981, 60 (03): : 303 - 325