Seam carving, horizontal projection profile and contour tracing for line and word segmentation of language independent handwritten documents

被引:1
|
作者
Das, Mamatarani [1 ]
Panda, Mrutyunjaya [1 ]
机构
[1] Utkal Univ, Dept Comp Sci & Applict, Bhubaneswar, Odisha, India
关键词
Line segmentation; Word segmentation; Seam carving; Horizontal projection profile; Handwritten documents; Connected components;
D O I
10.1016/j.rineng.2023.101110
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Handwritten documents are, as always, highly challenging for recognition tasks compared to printed documents. Rather than using isolated characters as elementary components for recognition, practical documents use words or character strings. In any handwritten recognition task, the segmentation of lines and words plays a pivotal role as the outputs produced at this stage can drastically affect the performance and the results of the recognition tasks. An approach combining two distinct techniques, namely horizontal projection profile and seam carving, for the segmentation of lines has been proposed in this paper. Using the horizontal projection profile method, a general idea of the location of lines in the document is obtained first, but since only using the horizontal pro-jection profile method works better for printed documents, it is not enough for handwritten documents, so the seam carving method is applied to finely segment the lines, where line separation distance varies from writer to writer. Dynamic programming is used to create an energy matrix from the input image and determine the minimum energy paths from left to right. For word segmentation, contour points are traced before applying the seam carving algorithm to find possible paths, and paths that are intersecting with the characters of the text are removed. The standard publicly available IAM English handwritten dataset and the Bangla Writing dataset are used to analyse the text-line and line-word segmentation technique, and the results show promising recognition accuracy.
引用
收藏
页数:11
相关论文
共 15 条
  • [1] Text Line Segmentation for Handwritten Documents Using Constrained Seam Carving
    Zhang, Xi
    Tan, Chew Lim
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 98 - 103
  • [2] Line and word Segmentation of Kannada Handwritten Text documents using Projection Profile Technique
    Banumathi, K. L.
    Chandra, Jagadeesh A. P.
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2016, : 196 - 201
  • [3] Handwritten Arabic Documents Segmentation into Text Lines using Seam Carving
    Daldali, M.
    Souhar, A.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2019, 5 (05): : 89 - 96
  • [4] Text line and word segmentation of handwritten documents
    Louloudis, G.
    Gatos, B.
    Pratikakis, I.
    Halatsis, C.
    PATTERN RECOGNITION, 2009, 42 (12) : 3169 - 3183
  • [5] Seam carving-based Arabic handwritten sub-word segmentation
    Berriche, Lamia
    Al-Mutairy, Abeer
    COGENT ENGINEERING, 2020, 7 (01):
  • [6] Word segmentation of off-line handwritten documents
    Huang, Chen
    Srihari, Sargur N.
    DOCUMENT RECOGNITION AND RETRIEVAL XV, 2008, 6815
  • [7] Segmentation of touching Arabic characters in Handwritten documents by overlapping set theory and contour tracing
    Ullah I.
    Azmi M.S.
    Desa M.I.
    Alomari Y.M.
    International Journal of Advanced Computer Science and Applications, 2019, 10 (05): : 155 - 160
  • [8] Segmentation of Touching Arabic Characters in Handwritten Documents by Overlapping Set Theory and Contour Tracing
    Ullah, Inam
    Azmi, Mohd Sanusi
    Desa, Mohamad Ishak
    Alomari, Yazan M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 155 - 160
  • [9] Line Segmentation in Handwritten Assamese and Meetei Mayek Script Using Seam Carving Based Algorithm
    Kumar, Chandan Jyoti
    Kalita, Sanjib Kr.
    ADVANCES IN OPTICAL SCIENCE AND ENGINEERING, 2015, 166 : 399 - 408
  • [10] Robust text-line and word segmentation for handwritten documents images
    Stafylakis, Themos
    Papavassiliou, Vassilis
    Katsouros, Vassilis
    Carayannis, George
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3393 - 3396