Word matching using single closed contours for indexing handwritten historical documents

被引:50
|
作者
Adamek, Tornasz [1 ]
O'Connor, Noel E. [1 ]
Smeaton, Alan F. [1 ]
机构
[1] Dublin City Univ, Ctr Digital Video Proc, Dublin 9, Ireland
关键词
historical manuscripts; holistic word recognition; contour matching; annotation indexing;
D O I
10.1007/s10032-006-0024-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective indexing is crucial for providing convenient access to scanned versions of large collections of historically valuable handwritten manuscripts. Since traditional handwriting recognizers based on optical character recognition (OCR) do not perform well on historical documents, recently a holistic word recognition approach has gained in popularity as an attractive and more straightforward solution (Lavrenko et al. in proc. document Image Analysis for Libraries (DIAL'04), pp. 278-287,2004). Such techniques attempt to recognize words based on scalar and profile-based features extracted from whole word images. In this paper, we propose a new approach to holistic word recognition for historical handwritten manuscripts based on matching word contours instead of whole images or word profiles. The new method consists of robust extraction of closed word contours and the application of an elastic contour matching technique proposed originally for general shapes (Adamek and O'Connor in IEEE Trans Circuits Syst Video Technol 5: 2004). We demonstrate that multiscale contour-based descriptors can effectively capture intrinsic word features avoiding any segmentation of words into smaller subunits. Our experiments show a recognition accuracy of 83%, which considerably exceeds the performance of other systems reported in the literature.
引用
收藏
页码:153 / 165
页数:13
相关论文
共 50 条
  • [21] Lanna Handwritten Character Recognition on Historical Documents Using Feature Extraction
    Khankasikam, Krisda
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2553 - 2560
  • [22] Page Segmentation for Historical Handwritten Documents Using Fully Convolutional Networks
    Xu, Yue
    He, Wenhao
    Yin, Fei
    Liu, Cheng-Lin
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 541 - 546
  • [23] Word Spotting for Handwritten Documents using Chamfer Distance and Dynamic Time Warping
    Saabni, Raid M.
    El-Sana, Jihad A.
    DOCUMENT RECOGNITION AND RETRIEVAL XVIII, 2011, 7874
  • [24] Keyword spotting in unconstrained handwritten Chinese documents using contextual word model
    Huang, Liang
    Yin, Fei
    Chen, Qing-Hu
    Liu, Cheng-Lin
    IMAGE AND VISION COMPUTING, 2013, 31 (12) : 958 - 968
  • [25] Word Spotting in Cursive Handwritten Documents Using Modified Character Shape Codes
    Sarkar, Sayantan
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 3, 2013, 178 : 269 - 278
  • [26] Word segmentation of handwritten dates in historical documents by combining semantic a-priori-knowledge with local features
    Feldbach, Markus
    Tönnies, Klaus D.
    Proc. Int. Conf. Doc. Anal. Recognit., (333-337):
  • [27] Word segmentation of handwritten dates in historical documents by combining semantic a-priori-knowledge with local features
    Feldbach, M
    Tönnies, KD
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 333 - 337
  • [28] Enabling Indexing and Retrieval of Historical Arabic Manuscripts through Template Matching Based Word Spotting
    Faisal, Tayyeba
    AlMaadeed, Somaya
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 57 - 63
  • [29] A Thresholding Approach for Text Extraction in Handwritten Historical Documents using Adaptive Morphology
    Roy, Bishakha
    Chatterjee, Rohit Kamal
    2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 198 - 203
  • [30] A Method of Recognizing Handwritten Characters in Japanese Historical Documents by Using Feature Graphs
    Nakata, Mitsuru
    Nishida, Shuichi
    Fukuda, Ryuzo
    Ge, Qi-Wei
    Yoshimura, Makoto
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (3B): : 953 - 966