ON SEGMENTATION OF TOUCHING CHARACTERS AND OVERLAPPING LINES IN DEGRADED PRINTED GURMUKHI SCRIPT

被引:9
|
作者
Jindal, Manish Kumar [1 ]
Lehal, Gurpreet Singh [2 ]
Sharma, Rajendra Kumar [3 ]
机构
[1] Panjab Univ, Reg Ctr, Dept Comp Sci & Applicat, Muktsar 152026, Punjab, India
[2] Punjabi Univ, Dept Comp Sci, Patiala 147002, Punjab, India
[3] Thapar Univ, Sch Math & Comp Applicat, Patiala 147002, Punjab, India
关键词
Gurmukhi script; touching characters; horizontally overlapping lines; top zone; character segmentation;
D O I
10.1142/S0219467809003460
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Character segmentation plays a very important role in a text recognition system. The simple technique of using inter-character gap for segmentation is useful for fine printed documents, but this technique fails to give satisfactory results if the input text contains touching characters. In this paper, we have proposed two algorithms to segment touching characters, and one algorithm to segment overlapping lines in degraded printed Gurmukhi document. Various categories of touching characters in different zones, along with their solutions, have been proposed. The solution methodology extensively uses the structural properties of Gurmukhi script. The algorithm proposed for segmenting horizontally overlapping lines uses a heuristics based upon the height of a character. The problem of multiple horizontally overlapping lines may occur in a number of situations such as printed newspapers, old magazines and books etc. Similarity among Indian scripts allows us to use these algorithms for solving the segmentation problems in other Indian languages also.
引用
收藏
页码:321 / 353
页数:33
相关论文
共 50 条
  • [1] Segmentation of horizontally overlapping lines in printed Gurmukhi script
    Jindal, M. K.
    Sharma, R. K.
    Lehal, G. S.
    2006 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, VOLS 1 AND 2, 2007, : 219 - +
  • [2] A Study of Touching Characters in Degraded Gurmukhi Text
    Jindal, M. K.
    Lehal, G. S.
    Sharma, R. K.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 4, 2005, 4 : 121 - 124
  • [3] Structural features for recognizing degraded printed Gurmukhi script
    Jindal, M. K.
    Sharma, R. K.
    Lehal, G. S.
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, 2008, : 668 - +
  • [4] Text segmentation of machine printed Gurmukhi script
    Lehal, GS
    Singh, C
    DOCUMENT RECOGNITION AND RETRIEVAL VIII, 2001, 4307 : 223 - 231
  • [5] A hybrid approach to character segmentation of Gurmukhi script characters
    Davessar, NM
    Madan, S
    Singh, H
    32ND APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP, PROCEEDINGS, 2004, : 169 - 173
  • [6] Zone Segmentation of a Text Line Printed in Gurmukhi Script Newspaper
    Kaur, Rupinder Pal
    Jindal, M. K.
    Kumar, Munish
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 330 - 334
  • [7] Segmentation of Overlapping and Touching Sinhala Handwritten Characters
    Walawage, K. S. A.
    Ranathunga, L.
    2018 3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY RESEARCH (ICITR), 2018,
  • [8] SEGMENTATION OF TOUCHING CHARACTERS IN PRINTED DOCUMENT RECOGNITION
    LIANG, S
    SHRIDHAR, M
    AHMADI, M
    PATTERN RECOGNITION, 1994, 27 (06) : 825 - 840
  • [9] Text and graphics segmentation of newspapers printed in Gurmukhi script: a hybrid approach
    Kaur, Rupinder Pal
    Jindal, M. K.
    Kumar, Munish
    VISUAL COMPUTER, 2021, 37 (07): : 1637 - 1659
  • [10] Text and graphics segmentation of newspapers printed in Gurmukhi script: a hybrid approach
    Rupinder Pal Kaur
    M. K. Jindal
    Munish Kumar
    The Visual Computer, 2021, 37 : 1637 - 1659