A graph-based approach for segmenting touching lines in historical handwritten documents

被引:0
|
作者
David Fernández-Mota
Josep Lladós
Alicia Fornés
机构
[1] Universitat Autònoma de Barcelona,Computer Vision Center—Computer Science Department
关键词
Text line segmentation; Handwritten documents; Document image processing; Historical document analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Text line segmentation in handwritten documents is an important task in the recognition of historical documents. Handwritten document images contain text lines with multiple orientations, touching and overlapping characters between consecutive text lines and different document structures, making line segmentation a difficult task. In this paper, we present a new approach for handwritten text line segmentation solving the problems of touching components, curvilinear text lines and horizontally overlapping components. The proposed algorithm formulates line segmentation as finding the central path in the area between two consecutive lines. This is solved as a graph traversal problem. A graph is constructed using the skeleton of the image. Then, a path-finding algorithm is used to find the optimum path between text lines. The proposed algorithm has been evaluated on a comprehensive dataset consisting of five databases: ICDAR2009, ICDAR2013, UMD, the George Washington and the Barcelona Marriages Database. The proposed method outperforms the state-of-the-art considering the different types and difficulties of the benchmarking data.
引用
收藏
页码:293 / 312
页数:19
相关论文
共 50 条
  • [1] A graph-based approach for segmenting touching lines in historical handwritten documents
    Fernandez-Mota, David
    Llados, Josep
    Fornes, Alicia
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2014, 17 (03) : 293 - 312
  • [2] Graph-Based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 564 - 573
  • [3] Ensembles for Graph-based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 714 - 720
  • [4] Filters for graph-based keyword spotting in historical handwritten documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    PATTERN RECOGNITION LETTERS, 2020, 134 : 125 - 134
  • [5] Speeding-Up Graph-Based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION (GBRPR 2017), 2017, 10310 : 83 - 93
  • [6] Cross-Evaluation of Graph-Based Keyword Spotting in Handwritten Historical Documents
    Stauffer, Michael
    Maergner, Paul
    Fischer, Andreas
    Riesen, Kaspar
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, GBRPR 2019, 2019, 11510 : 45 - 55
  • [7] A scale space approach for automatically segmenting words from historical handwritten documents
    Manmatha, R
    Rothfeder, JL
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) : 1212 - 1225
  • [8] A Language Model for Improving the Graph-Based Transcription Approach for Historical Documents
    Lecireth Meza-Lovon, Graciela
    ADVANCES IN ARTIFICIAL INTELLIGENCE (IBERAMIA 2014), 2014, 8864 : 229 - 241
  • [9] Retrieval of handwritten lines in historical documents
    Schomaker, L. R. B.
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 594 - 598
  • [10] Graph Based Keyword Spotting in Handwritten Historical Slavic Documents
    Riesen, Kaspar
    Brodic, Darko
    ERCIM NEWS, 2013, (95): : 37 - 38