ON SEGMENTATION OF TOUCHING CHARACTERS AND OVERLAPPING LINES IN DEGRADED PRINTED GURMUKHI SCRIPT

被引:9
|
作者
Jindal, Manish Kumar [1 ]
Lehal, Gurpreet Singh [2 ]
Sharma, Rajendra Kumar [3 ]
机构
[1] Panjab Univ, Reg Ctr, Dept Comp Sci & Applicat, Muktsar 152026, Punjab, India
[2] Punjabi Univ, Dept Comp Sci, Patiala 147002, Punjab, India
[3] Thapar Univ, Sch Math & Comp Applicat, Patiala 147002, Punjab, India
关键词
Gurmukhi script; touching characters; horizontally overlapping lines; top zone; character segmentation;
D O I
10.1142/S0219467809003460
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Character segmentation plays a very important role in a text recognition system. The simple technique of using inter-character gap for segmentation is useful for fine printed documents, but this technique fails to give satisfactory results if the input text contains touching characters. In this paper, we have proposed two algorithms to segment touching characters, and one algorithm to segment overlapping lines in degraded printed Gurmukhi document. Various categories of touching characters in different zones, along with their solutions, have been proposed. The solution methodology extensively uses the structural properties of Gurmukhi script. The algorithm proposed for segmenting horizontally overlapping lines uses a heuristics based upon the height of a character. The problem of multiple horizontally overlapping lines may occur in a number of situations such as printed newspapers, old magazines and books etc. Similarity among Indian scripts allows us to use these algorithms for solving the segmentation problems in other Indian languages also.
引用
收藏
页码:321 / 353
页数:33
相关论文
共 50 条
  • [41] A RECURSIVE ALGORITHM FOR SEGMENTATION OF DEGRADED DEVANAGARI SCRIPT
    Habib, Sobia
    Shukla, Manoj K.
    Kapoor, Rajiv
    MECHATRONIC SYSTEMS AND CONTROL, 2023, 51 (03): : 133 - 142
  • [42] Segmentation of horizontal and vertical touching Thai characters
    Premchaiswadi, N
    Premchaiswadi, W
    Narita, S
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2000, E83A (06) : 987 - 995
  • [43] A segmentation system for touching handwritten Japanese characters
    Yamaguchi, T
    Tsuruoka, S
    Yoshikawa, T
    Shinogi, T
    Makimoto, E
    Ogata, H
    Shridhar, M
    EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 407 - 412
  • [44] A segmentation method for touching handwritten Japanese characters
    Nishimura, H
    Ikeda, H
    Nakano, Y
    DOCUMENT ANALYSIS SYSTEMS: THEORY AND PRACTICE, 1999, 1655 : 130 - 139
  • [45] Detection and Segmentation of Lines and Words in Gurmukhi Handwritten Text
    Kumar, Rajiv
    Singh, Amardeep
    2010 IEEE 2ND INTERNATIONAL ADVANCE COMPUTING CONFERENCE, 2010, : 353 - +
  • [46] Preprocessing for Identification of Degraded Urdu and Devanagari Printed Script
    Habib, Sobia
    Shukla, Manoj Kumar
    Kapoor, Rajiv
    COMPUTATIONAL INTELLIGENCE IN PATTERN RECOGNITION, CIPR 2020, 2020, 1120 : 519 - 528
  • [47] A Semantic Segmentation Based Approach for Segmentation and Recognition of Touching and Overlapping Digits
    Demir, Ali Alper
    Ozkaya, Ufuk
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [48] Touching character segmentation for printed Odia document
    Pattanayak, Sanjibani Sudha
    Malik, Ramesh Chandra
    Pradhan, Sateesh Kumar
    Kar, Aradhana
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 387 - 392
  • [49] Analysis and recognition of highly degraded printed characters
    Anna Tonazzini
    Stefano Vezzosi
    Luigi Bedini
    Document Analysis and Recognition, 2003, 6 (4): : 236 - 247
  • [50] A Novel Approach for Segmentation of Touching Characters on the License Plate
    Wang, Ran
    Wang, Guoyou
    Liu, Jianguo
    Tian, Jiangmin
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2012), 2013, 8768