A graph-based approach for segmenting touching lines in historical handwritten documents

被引:0
|
作者
David Fernández-Mota
Josep Lladós
Alicia Fornés
机构
[1] Universitat Autònoma de Barcelona,Computer Vision Center—Computer Science Department
关键词
Text line segmentation; Handwritten documents; Document image processing; Historical document analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Text line segmentation in handwritten documents is an important task in the recognition of historical documents. Handwritten document images contain text lines with multiple orientations, touching and overlapping characters between consecutive text lines and different document structures, making line segmentation a difficult task. In this paper, we present a new approach for handwritten text line segmentation solving the problems of touching components, curvilinear text lines and horizontally overlapping components. The proposed algorithm formulates line segmentation as finding the central path in the area between two consecutive lines. This is solved as a graph traversal problem. A graph is constructed using the skeleton of the image. Then, a path-finding algorithm is used to find the optimum path between text lines. The proposed algorithm has been evaluated on a comprehensive dataset consisting of five databases: ICDAR2009, ICDAR2013, UMD, the George Washington and the Barcelona Marriages Database. The proposed method outperforms the state-of-the-art considering the different types and difficulties of the benchmarking data.
引用
收藏
页码:293 / 312
页数:19
相关论文
共 50 条
  • [31] A Morphology based Approach for Binarization of Handwritten Documents
    Papavassiliou, Vassilis
    Simistira, Fotini
    Katsouros, Vassilis
    Carayannis, George
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 577 - 581
  • [32] TSGVi: a graph-based summarization system for Vietnamese documents
    Tu-Anh Nguyen-Hoang
    Khai Nguyen
    Quang-Vinh Tran
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2012, 3 (04) : 305 - 313
  • [33] Graph-based Word Sense Disambiguation of biomedical documents
    Agirre, Eneko
    Soroa, Aitor
    Stevenson, Mark
    BIOINFORMATICS, 2010, 26 (22) : 2889 - 2896
  • [34] Comparison of distance measures for graph-based clustering of documents
    Schenker, A
    Last, M
    Bunke, H
    Kandel, A
    GRAPH BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS, 2003, 2726 : 202 - 213
  • [35] TSGVi: a graph-based summarization system for Vietnamese documents
    Tu-Anh Nguyen-Hoang
    Khai Nguyen
    Quang-Vinh Tran
    Journal of Ambient Intelligence and Humanized Computing, 2012, 3 : 305 - 313
  • [36] An Innovative Graph-Based Approach to Advance Feature Selection from Multiple Textual Documents
    Giarelis, Nikolaos
    Kanakaris, Nikos
    Karacapilidis, Nikos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2020, PT I, 2020, 583 : 96 - 106
  • [37] Graph-Based Keyword Spotting in Historical Documents Using Context-Aware Hausdorff Edit Distance
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 49 - 54
  • [38] Spectral Graph-based Features for Recognition of Handwritten Characters: A Case Study on Handwritten Devanagari Numerals
    Bhat, Mohammad Idrees
    Sharada, B.
    JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) : 799 - 813
  • [39] A segmentation method for touching Japanese handwritten characters based on connecting condition of lines
    Yamaguchi, T
    Yoshikawa, T
    Shinogi, T
    Tsuruoka, S
    Teramoto, M
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 837 - 841
  • [40] A graph-based approach to feature selection
    Zhang Z.
    Hancock E.R.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2011, 6658 LNCS : 205 - 214