ON SEGMENTATION OF TOUCHING CHARACTERS AND OVERLAPPING LINES IN DEGRADED PRINTED GURMUKHI SCRIPT

被引:9
|
作者
Jindal, Manish Kumar [1 ]
Lehal, Gurpreet Singh [2 ]
Sharma, Rajendra Kumar [3 ]
机构
[1] Panjab Univ, Reg Ctr, Dept Comp Sci & Applicat, Muktsar 152026, Punjab, India
[2] Punjabi Univ, Dept Comp Sci, Patiala 147002, Punjab, India
[3] Thapar Univ, Sch Math & Comp Applicat, Patiala 147002, Punjab, India
关键词
Gurmukhi script; touching characters; horizontally overlapping lines; top zone; character segmentation;
D O I
10.1142/S0219467809003460
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Character segmentation plays a very important role in a text recognition system. The simple technique of using inter-character gap for segmentation is useful for fine printed documents, but this technique fails to give satisfactory results if the input text contains touching characters. In this paper, we have proposed two algorithms to segment touching characters, and one algorithm to segment overlapping lines in degraded printed Gurmukhi document. Various categories of touching characters in different zones, along with their solutions, have been proposed. The solution methodology extensively uses the structural properties of Gurmukhi script. The algorithm proposed for segmenting horizontally overlapping lines uses a heuristics based upon the height of a character. The problem of multiple horizontally overlapping lines may occur in a number of situations such as printed newspapers, old magazines and books etc. Similarity among Indian scripts allows us to use these algorithms for solving the segmentation problems in other Indian languages also.
引用
收藏
页码:321 / 353
页数:33
相关论文
共 50 条
  • [21] SEGMENTATION OF TOUCHING LANNA CHARACTERS
    Pravesjit, Sakkayaphop
    Thammano, Arit
    2011 PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS (SIGMAP 2011), 2011,
  • [22] Segmentation of touching characters in formulas
    Okamoto, M
    Sakaguchi, S
    Suzuki, T
    DOCUMENT ANALYSIS SYSTEMS: THEORY AND PRACTICE, 1999, 1655 : 151 - 156
  • [23] Segmentation of degraded characters
    Beheim, L
    Milgram, M
    DOCUMENT RECOGNITION AND RETRIEVAL VII, 2000, 3967 : 11 - 22
  • [24] Segmentation of Touching Arabic Characters in Handwritten Documents by Overlapping Set Theory and Contour Tracing
    Ullah, Inam
    Azmi, Mohd Sanusi
    Desa, Mohamad Ishak
    Alomari, Yazan M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 155 - 160
  • [25] Segmentation of touching Arabic characters in Handwritten documents by overlapping set theory and contour tracing
    Ullah I.
    Azmi M.S.
    Desa M.I.
    Alomari Y.M.
    International Journal of Advanced Computer Science and Applications, 2019, 10 (05): : 155 - 160
  • [26] Automatic segmentation of overlapping and touching chromosomes
    Yuan, ZQ
    Chen, XH
    Zhang, RL
    Yu, C
    IMAGE EXTRACTION, SEGMENTATION, AND RECOGNITION, 2001, 4550 : 334 - 339
  • [27] An iterative algorithm for segmentation of isolated handwritten words in Gurmukhi script
    Sharma, Dharam Veer
    Lehal, Gurpreet Singh
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 1022 - +
  • [28] A segmentation method for touching Japanese handwritten characters based on connecting condition of lines
    Yamaguchi, T
    Yoshikawa, T
    Shinogi, T
    Tsuruoka, S
    Teramoto, M
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 837 - 841
  • [29] Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy multifactorial analysis
    Garain, U
    Chaudhuri, BB
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 805 - 809
  • [30] Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy, multifactorial analysis
    Garain, U
    Chaudhuri, BB
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2002, 32 (04): : 449 - 459