CHARACTER PROTOTYPE SELECTION FOR HANDWRITING RECOGNITION IN HISTORICAL DOCUMENTS

被引:0
|
作者
Fischer, Andreas [1 ]
Bunke, Horst [1 ]
机构
[1] Univ Bern, Inst Comp Sci & Appl Math, Neubruckstr 10, CH-3012 Bern, Switzerland
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Handwriting recognition in historical documents is vital for making scanned manuscript images amenable to searching and browsing in digital libraries. A valuable source of information is given by the basic character shapes that vary greatly for different manuscripts. Typically, character prototype images are extracted manually for bootstrapping a recognition system. This process, however, is time-consuming and the resulting prototypes may not cover all writing styles. In this paper, we propose an automatic character prototype selection method based on a forced alignment using Hidden Markov Models (HMM) and graph matching. Besides the predominant character shape given by the median or center graph, structurally different additional prototypes are retrieved with spanning and k-centers prototype selection. On the historical Parzival data set, it is demonstrated that the proposed automatic selection outperforms a manual selection for handwriting recognition with graph similarity features.
引用
收藏
页码:1435 / 1439
页数:5
相关论文
共 50 条
  • [41] On-line handwriting character recognition using stroke information
    Shin, J
    DEVELOPMENTS IN APPLIED ARTIFICAIL INTELLIGENCE, PROCEEDINGS, 2002, 2358 : 703 - 714
  • [42] Corpus and Evaluation of Handwriting Recognition of Historical Genealogical Records
    schone, Patrick
    Nielson, Heath
    Ward, Mark
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [43] ICDAR 2024 Competition on Handwriting Recognition of Historical Ciphers
    Fornes, Alicia
    Chen, Jialuo
    Torras, Pau
    Badal, Carles
    Megyesi, Beata
    Waldispuhl, Michelle
    Kopal, Nils
    Lasry, George
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT VI, 2024, 14809 : 332 - 344
  • [44] Interactive training for handwriting recognition in historical document collections
    Kennard, Douglas J.
    Barrett, William A.
    DOCUMENT RECOGNITION AND RETRIEVAL XIV, 2007, 6500
  • [45] Online Arabic Handwriting Character Recognition Using Matching Algorithm
    Omer, Marwan Ali. H.
    Ma, Shi Long
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 2, 2010, : 259 - 262
  • [46] Structural Offline Handwriting Character Recognition Using Levenshtein Distance
    Putra, Made Edwin Wira
    Supriana, Iping
    5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, : 31 - 36
  • [47] Multi-character field recognition for Arabic and Chinese handwriting
    Lopresti, Daniel
    Nagy, George
    Seth, Sharad
    Zhang, Xiaoli
    ARABIC AND CHINESE HANDWRITING RECOGNITION, 2008, 4768 : 218 - +
  • [48] Offline Chinese Handwriting Character Recognition through Feature Extraction
    Luo, Yuchen
    Xia, Rui
    Abdulghafour, M.
    2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 394 - 398
  • [49] Touching Character Segmentation Method for Chinese Historical Documents
    Sun, Xiaolu
    Peng, Liangrui
    Ding, Xiaoqing
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [50] Character Segmentation for Classical Mongolian Words in Historical Documents
    Su, Xiangdong
    Gao, Guanglai
    Wang, Weihua
    Bao, Feilong
    Wei, Hongxi
    PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 464 - 473