Text Line Segmentation in Images of Handwritten Historical Documents

被引:0
|
作者
Sanchez, A. [1 ]
Suarez, P. D. [1 ]
Melloz, C. A. B. [2 ]
Oliveira, A. L. I. [2 ]
Alves, V. M. O. [2 ]
机构
[1] Univ Rey Juan Carlos, Dept Ciencias Computac, Madrid 28933, Spain
[2] Univ Pernambuco, Dept Comp Syst, BR-720001 Recife, PE, Brazil
关键词
Image processing; segmentation; handwriting; document processing; historical documents; line extraction;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes an original method to segment handwritten text lines from historical document images. After an initial preprocessing, we compute a black/white transition map to achieve a rough detection of the line regions in the image. Using this map, the corresponding line axes are extracted through a skeletonization algorithm and the conflicts between adjacent cutting lines are solved by some heuristics. Our approach was tested on a set of handwritten digitized documents (from the PROHIST Project database) from the end of the 19th century onwards. The proposed method worked well even with difficult images and it achieved an 82.18% of correct segmented lines for our database. The results of comparing our method with other recent proposal for automatic line extraction on the same test images offered more than a 38% of correct segmentation improvement.
引用
收藏
页码:232 / +
页数:3
相关论文
共 50 条
  • [1] A Multilevel Text line Segmentation Framework for Handwritten Historical Documents
    Ben Messaoud, Ines
    Amiri, Hamid
    El Abed, Haikal
    Maergner, Volker
    [J]. 13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 515 - 520
  • [2] Learning-Free Text Line Segmentation for Historical Handwritten Documents
    Barakat, Berat Kurar
    Cohen, Rafi
    Droby, Ahmad
    Rabaev, Irina
    El-Sana, Jihad
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (22): : 1 - 19
  • [3] Robust text-line and word segmentation for handwritten documents images
    Stafylakis, Themos
    Papavassiliou, Vassilis
    Katsouros, Vassilis
    Carayannis, George
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3393 - 3396
  • [4] Text line and word segmentation of handwritten documents
    Louloudis, G.
    Gatos, B.
    Pratikakis, I.
    Halatsis, C.
    [J]. PATTERN RECOGNITION, 2009, 42 (12) : 3169 - 3183
  • [5] Segmentation of Historical Handwritten Documents into Text Zones and Text Lines
    Gatos, Basilis
    Louloudis, Georgios
    Stamatopoulos, Nikolaos
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 464 - 469
  • [6] Text Line Extraction in Handwritten Historical Documents
    Capobianco, Samuele
    Marinai, Simone
    [J]. DIGITAL LIBRARIES AND ARCHIVES, IRCDL 2017, 2017, 733 : 68 - 79
  • [7] A Tracking Approach for Text Line Segmentation in Handwritten Documents
    Setitra, Insaf
    Hadjadj, Zineb
    Meziane, Abdelkrim
    [J]. ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 193 - 198
  • [8] Text line segmentation of historical documents: a survey
    Laurence Likforman-Sulem
    Abderrazak Zahour
    Bruno Taconet
    [J]. International Journal of Document Analysis and Recognition (IJDAR), 2007, 9 : 123 - 138
  • [9] Text line segmentation of historical documents: a survey
    Likforman-Sulem, Laurence
    Zahour, Abderrazak
    Taconet, Bruno
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) : 123 - 138
  • [10] Text Line segmentation of historical Arabic documents
    Zahour, Abderrazak
    Likforman-Sulem, Laurence
    Boussalaa, Wafa
    Taconet, Bruno
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 138 - +