CHARACTER PROTOTYPE SELECTION FOR HANDWRITING RECOGNITION IN HISTORICAL DOCUMENTS

被引:0
|
作者
Fischer, Andreas [1 ]
Bunke, Horst [1 ]
机构
[1] Univ Bern, Inst Comp Sci & Appl Math, Neubruckstr 10, CH-3012 Bern, Switzerland
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Handwriting recognition in historical documents is vital for making scanned manuscript images amenable to searching and browsing in digital libraries. A valuable source of information is given by the basic character shapes that vary greatly for different manuscripts. Typically, character prototype images are extracted manually for bootstrapping a recognition system. This process, however, is time-consuming and the resulting prototypes may not cover all writing styles. In this paper, we propose an automatic character prototype selection method based on a forced alignment using Hidden Markov Models (HMM) and graph matching. Besides the predominant character shape given by the median or center graph, structurally different additional prototypes are retrieved with spanning and k-centers prototype selection. On the historical Parzival data set, it is demonstrated that the proposed automatic selection outperforms a manual selection for handwriting recognition with graph similarity features.
引用
收藏
页码:1435 / 1439
页数:5
相关论文
共 50 条
  • [1] Transfer Learning for Handwriting Recognition on Historical Documents
    Granet, Adeline
    Morin, Emmanuel
    Mouchere, Harold
    Quiniou, Solen
    Viard-Gaudin, Christian
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, : 432 - 439
  • [2] Fast handwriting recognition for indexing historical documents
    Govindaraju, V
    Xue, HH
    FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2004, : 314 - 320
  • [3] Handwriting Recognition of Historical Documents with few labeled data
    Chammas, Edgard
    Mokbel, Chafic
    Likforman-Sulem, Laurence
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 43 - 48
  • [4] Combining Handwriting and Speech Recognition for Transcribing Historical Handwritten Documents
    Granell, Emilio
    Martinez-Hinarejos, Carlos-D.
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 126 - 130
  • [5] A Complete Optical Character Recognition Methodology for Historical Documents
    Vamvakas, G.
    Gatos, B.
    Stamatopoulos, N.
    Perantonis, S. J.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 525 - 532
  • [6] Improving Handwriting Recognition for Historical Documents Using Synthetic Text Lines
    Spoto, Martin
    Wolf, Beat
    Fischer, Andreas
    Scius-Bertrand, Anna
    INTERTWINING GRAPHONOMICS WITH HUMAN MOVEMENTS, IGS 2021, 2022, 13424 : 61 - 75
  • [7] A Combined System for Text Line Extraction and Handwriting Recognition in Historical Documents
    Fischer, Andreas
    Baechler, Micheal
    Garz, Angelika
    Liwicki, Marcus
    Ingold, Rolf
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 71 - 75
  • [8] An Automatic Method for Enhancing Character Recognition in Degraded Historical Documents
    Pereira e Silva, Gabriel
    Lins, Rafael Dueire
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 553 - 557
  • [9] Optical Character Recognition Techniques for Restoration of Thai Historical Documents
    Tangwongsan, Supachai
    Sumetphong, Chaivatna
    ICCEE 2008: PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, 2008, : 531 - 535
  • [10] Handwriting Character Recognition System in Documents Containing Abbreviations Using Artificial Neural Networks
    Kartono
    Nuzulita, Nania
    Karnama, Kenny Everest
    Werdiningsih, Indah
    INTERNATIONAL CONFERENCE ON MATHEMATICS, COMPUTATIONAL SCIENCES AND STATISTICS 2020, 2021, 2329